
Security News
npm Adopts OIDC for Trusted Publishing in CI/CD Workflows
npm now supports Trusted Publishing with OIDC, enabling secure package publishing directly from CI/CD workflows without relying on long-lived tokens.
This package allows interface with the AlphaFold Protein Structure Database. This package allows the download of entries' metadata an AlphaFold files (e.g. mmCIF, PAE, PDB...)
AlphaFetcher
facilitates fetching and downloading protein metadata and related files from the
AlphaFold Protein Structure Database
using Uniprot access codes.
Batch Import: Input single or multiple Uniprot access codes seamlessly.
Parallel Processing: Efficiently fetch metadata using multithreading.
Flexible Downloads: Choose among various file types - PDB, CIF, BCIF, PAE image, and PAE data files.
Optimal Performance: Easily adjust the number of workers for threaded tasks.
We recommend PyPI installation:
pip install alphafetcher
from alphafetcher import AlphaFetcher
# Instantiate the fetcher
# The base_savedir parameter allows you to set a base directory where files will be saved.
# Inside this directory, two separate directories for pdb and cif files will be created.
fetcher = AlphaFetcher(base_savedir="my_savedir")
# Add desired Uniprot access codes
fetcher.add_proteins(["A1KXE4", "H0YL14", "B2RXH2", "A8MVW5"])
# Retrieve metadata
fetcher.fetch_metadata(multithread=True, workers=4)
# Metadata available at fetcher.metadata_dict
# Commence download of specified files
fetcher.download_all_files(pdb=True, cif=True, multithread=True, workers=4)
AlphaFetcher(base_savedir: str)
base_savedir
is where the downloaded pdb and cif files will be stored. Inside this directory, two subdirectories will be automatically created: one for pdb files and another for cif files.base_savedir
: The base directory where the pdb and cif files will be saved.add_proteins(proteins: Union[str, List[str]])
fetch_metadata(multithread: bool = False, workers: int = 10)
fetcher.metadata_dict
, assuming the notation of the example
above is followed.download_all_files(uniprot_access: str, pdb: bool = False, cif: bool = False, bcif: bool = False, pae_image:bool = False, pae_data: bool = False)
For a comprehensive guide, users are encouraged to view the docstrings incorporated within the source code.
Always respect the AlphaFold Protein Structure Database terms of service, ensuring not to flood it with excessive concurrent requests. Consider adjusting the number of workers to reduce the requests density.
We welcome your contributions! To collaborate:
This project is licensed under the GNU General Public License v3 (GPLv3).
FAQs
This package allows interface with the AlphaFold Protein Structure Database. This package allows the download of entries' metadata an AlphaFold files (e.g. mmCIF, PAE, PDB...)
We found that AlphaFetcher demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
npm now supports Trusted Publishing with OIDC, enabling secure package publishing directly from CI/CD workflows without relying on long-lived tokens.
Research
/Security News
A RubyGems malware campaign used 60 malicious packages posing as automation tools to steal credentials from social media and marketing tool users.
Security News
The CNA Scorecard ranks CVE issuers by data completeness, revealing major gaps in patch info and software identifiers across thousands of vulnerabilities.