
Security News
Bun 1.2.19 Adds Isolated Installs for Better Monorepo Support
Bun 1.2.19 introduces isolated installs for smoother monorepo workflows, along with performance boosts, new tooling, and key compatibility fixes.
biochemical-data-connectors
Advanced tools
A Python package to extract chemical, biochemical, and bioactivity data from public databases like ORD, ChEMBL and PubChem.
biochemical-data-connectors
is a Python package for extracting chemical, biochemical, and bioactivity data from public databases like ChEMBL, PubChem, BindingDB, and the Open Reaction Database (ORD).
biochemical-data-connectors
provides a simple and consistent interface to query major cheminformatics bioinformatics databases for compounds. It is designed to be a modular and reusable tool for researchers and developers in cheminformatics and drug discovery.
ChEMBLBioactivesExtractor
)PubChemBioactivesExtractor
)OpenReactionDatabaseConnector
) to efficiently process a local copy of the Open Reaction Database.You can install this package locally via:
pip install biochemical-data-connectors
Here is a simple example of how to retrieve all compounds from ChEMBL with a measured Kd of less than or equal to 1000 nM for the EGFR protein (UniProt ID: P00533
).
import logging
from biochemical_data_connectors import ChEMBLConnector
logging.basicConfig(level=logging.INFO)
logger = logging.getLogger(__name__)
# 1. Instantiate the connector for the desired database
chembl_connector = ChEMBLConnector(
bioactivity_measure='Kd',
bioactivity_threshold=1000.0, # in nM
logger=logger
)
# 2. Specify the target's UniProt ID
target_uniprot_id = "P00533" # EGFR
# 3. Get the bioactive compounds
print(f"Fetching bioactive compounds for {target_uniprot_id} from ChEMBL...")
smiles_list = chembl_connector.get_bioactive_compounds(target_uniprot_id)
# 4. Print the results
if smiles_list:
print(f"\nFound {len(smiles_list)} compounds.")
print("First 5 compounds:")
for smiles in smiles_list[:5]:
print(smiles)
else:
print("No compounds found matching the criteria.")
biochemical-data-connectors/
├── pyproject.toml
├── requirements-dev.txt
├── src/
│ └── biochemical_data_connectors/
│ ├── __init__.py
│ ├── constants.py
│ ├── models.py
│ ├── connectors/
│ │ ├── __init__.py
│ │ ├── ord_connectors.py
│ │ └── bioactive_compounds
│ │ ├── __init__.py
│ │ ├── base_bioactives_connector.py
│ │ ├── bindingdb_bioactives_connector.py
│ │ ├── chembl_bioactives_connector.py
│ │ ├── iuphar_bioactives_connector.py
│ │ └── pubchem_bioactives_connector.py
│ └── utils/
│ ├── __init__.py
│ ├── files_utils.py
│ ├── iter_utils.py
│ ├── standardization_utils.py
│ └── api/
│ ├── __init__.py
│ ├── base_api.py
│ ├── bindingbd_api.py
│ ├── chembl_api.py
│ ├── iuphar_api.py
│ ├── mappings.py
│ └── pubchem_api.py
├── tests/
│ └── ...
└── README.md
This project is licensed under the terms of the MIT License.
FAQs
A Python package to extract chemical, biochemical, and bioactivity data from public databases like ORD, ChEMBL and PubChem.
We found that biochemical-data-connectors demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Bun 1.2.19 introduces isolated installs for smoother monorepo workflows, along with performance boosts, new tooling, and key compatibility fixes.
Security News
Popular npm packages like eslint-config-prettier were compromised after a phishing attack stole a maintainer’s token, spreading malicious updates.
Security News
/Research
A phishing attack targeted developers using a typosquatted npm domain (npnjs.com) to steal credentials via fake login pages - watch out for similar scams.