Socket
Socket
Sign inDemoInstall

minimol

Package Overview
Dependencies
Maintainers
2
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

minimol

Molecular fingerprinting using pre-trained deep nets


Maintainers
2

Minimol architecture

A parameter-efficient molecular featuriser that generalises well to biological tasks thanks to the effective pre-training on biological and quantum mechnical datasets.

The model has been introduced in the paper 𝙼𝚒𝚗𝚒𝙼𝚘𝚕: A Parameter-Efficient Foundation Model for Molecular Learning, published in the ICML workshop on Accessible and Efficient Foundation Models for Biological Discovery in 2024.

Usage

Embeddings can be generated in four lines of code:

from minimol import Minimol
model = Minimol()
smiles = [
    'COc1ccc2cc(C(=O)NC3(C(=O)N[C@H](Cc4ccccc4)C(=O)NCC4CCN(CC5CCOCC5)CC4)CCCC3)sc2c1',
    'Nc1nc(=O)c2c([nH]1)NCC(CNc1ccc(C(=O)NC(CCC(=O)O)C(=O)O)cc1)N2C=O',
    'O=C1CCCN1CCCCN1CCN(c2cc(C(F)(F)F)ccn2)CC1',
    'c1ccc(-c2cccnc2)cc1',
]
model(smiles)
>> A list of 4 tensors of (512,) shape

For a Colab notebook showing how to use Minimol's fingerprints to achieve SoTA results on a downstream task, click here: Open In Colab

Installation

Pip

When used with cuda, use nvcc --version to see which version of the driver is installed on your machine, to select the wheel (cuXXX):

pip install torch-sparse torch-cluster torch-scatter -f https://pytorch-geometric.com/whl/torch-2.3.0+cu124.html
pip install minimol

Local

git clone git@github.com:graphcore-research/minimol.git 
cd minimol
mamba env create -f env.yml -n minimol_venv
mamba activate minimol

To install mamba see the official documentation.

Performance

The model has been evaluated on 22 benchmarks from the ADMET group of Therapeutics Data Commons (TDC). These are the results when comparing to MolE and TOP5 models from the TDC leaderboard (as of June 2024):

TDC DatasetTDC LeaderboardMolEMiniMol (GINE)
NameSizeMetricSoTA ResultResultRankResultRank
Absorption
Caco2 Wang906MAE0.276 ± 0.0050.310 ± 0.01060.350 ± 0.0187
Bioavailability Ma640AUROC0.748 ± 0.0330.654 ± 0.02870.689 ± 0.0205
Lipophilicity AZ4,200MAE0.467 ± 0.0060.469 ± 0.00930.456 ± 0.0081
Solubility AqSolDB9,982MAE0.761 ± 0.0250.792 ± 0.00550.741 ± 0.0131
HIA Hou578AUROC0.989 ± 0.0010.963 ± 0.01970.993 ± 0.0051
Pgp Broccatelli1,212AUROC0.938 ± 0.0020.915 ± 0.00570.942 ± 0.0021
Distribution
BBB Martins1,975AUROC0.916 ± 0.0010.903 ± 0.00570.924 ± 0.0031
PPBR AZ1,797MAE7.526 ± 0.1068.073 ± 0.33567.696 ± 0.1254
VDss Lombardo1,130Spearman0.713 ± 0.0070.654 ± 0.03130.535 ± 0.0277
Metabolism
CYP2C9 Veith12,092AUPRC0.859 ± 0.0010.801 ± 0.00350.823 ± 0.0064
CYP2D6 Veith13,130AUPRC0.790 ± 0.0010.682 ± 0.00860.719 ± 0.0045
CYP3A4 Veith12,328AUPRC0.916 ± 0.0000.867 ± 0.00370.877 ± 0.0014
CYP2C9 Substrate666AUPRC0.441 ± 0.0330.446 ± 0.06220.474 ± 0.0251
CYP2D6 Substrate664AUPRC0.736 ± 0.0240.699 ± 0.01870.695 ± 0.0326
CYP3A4 Substrate667AUROC0.662 ± 0.0310.670 ± 0.01810.663 ± 0.0082
Excretion
Half Life Obach667Spearman0.562 ± 0.0080.549 ± 0.02440.495 ± 0.0426
Clearance Hepatocyte1,102Spearman0.498 ± 0.0090.381 ± 0.03870.446 ± 0.0293
Clearance Microsome1,020Spearman0.630 ± 0.0100.607 ± 0.02760.628 ± 0.0052
Toxicity
LD50 Zhu7,385MAE0.552 ± 0.0090.823 ± 0.01970.585 ± 0.0052
hERG648AUROC0.880 ± 0.0020.813 ± 0.00970.846 ± 0.0164
Ames7,255AUROC0.871 ± 0.0020.883 ± 0.00510.849 ± 0.0045
DILI475AUROC0.925 ± 0.0050.577 ± 0.02170.956 ± 0.0061
Mean Rank:5.23.3

License

Copyright (c) 2024 Graphcore Ltd. Licensed under the MIT License.

The included code is released under the MIT license (see details of the license).

FAQs


Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc