Security News
Fluent Assertions Faces Backlash After Abandoning Open Source Licensing
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
_____________________.___.____ .____
\__ ___/\______ \ | | | |
| | | _/ | | | |
| | | | \ | |___| |___
|____| |____|_ /___|_______ \_______ \
\/ \/ \/
TRILL (TRaining and Inference using the Language of Life) is a sandbox for creative protein engineering and discovery. As a bioengineer myself, deep-learning based approaches for protein design and analysis are of great interest to me. However, many of these deep-learning models are rather unwieldy, especially for non ML-practitioners due to their sheer size. Not only does TRILL allow researchers to perform inference on their proteins of interest using a variety of models, but it also democratizes the efficient fine-tuning of large-language models. Whether using Google Colab with one GPU or a supercomputer with many, TRILL empowers scientists to leverage models with millions to billions of parameters without worrying (too much) about hardware constraints. Currently, TRILL supports using these models as of v1.8.3:
Command | Function | Available Models |
---|---|---|
Embed | Generates numerical representations or "embeddings" of protein sequences for quantitative analysis and comparison. | ESM2, ProtT5-XL, ProstT5, Ankh, CaLM, mRNA-FM/RNA-FM, SaProt |
Visualize | Creates interactive 2D visualizations of embeddings for exploratory data analysis. | PCA, t-SNE, UMAP |
Finetune | Finetunes protein language models for specific tasks. | ESM2, ProtGPT2, ZymCTRL, ProGen2 |
Language Model Protein Generation | Generates proteins using pretrained language models. | ESM2, ProtGPT2, ZymCTRL, ProGen2 |
Inverse Folding Protein Generation | Designs proteins to fold into specific 3D structures. | ESM-IF1, LigandMPNN, ProstT5 |
Diffusion Based Protein Generation | Uses denoising diffusion models to generate proteins. | RFDiffusion |
Fold | Predicts 3D protein structures. | ESMFold, ProstT5, Chai-1, Boltz-1 |
Dock | Simulates protein-ligand interactions. | DiffDock, Smina, Autodock Vina, Lightdock, GeoDock |
Classify | Predicts protein properties with pretrained models or train custom classifiers | TemStaPro, EpHod, ECPICK, LightGBM, XGBoost, Isolation Forest |
Regress | Train custom regression models. | LightGBM, Linear |
Simulate | Uses molecular dynamics to simulate protein-ligand interactions. | OpenMM |
Score | Utilize ESM1v or ESM2 to score protein sequences or ProteinMPNN/LigandMPNN to score protein structures in a zero-shot manner. | COMPSS |
Workflow | Automated protein design workflows. | Foldtuning |
Check out the documentation and examples at https://trill.readthedocs.io/en/latest/index.html
FAQs
Sandbox for Computational Protein Design
We found that trill-proteins demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
Research
Security News
Socket researchers uncover the risks of a malicious Python package targeting Discord developers.
Security News
The UK is proposing a bold ban on ransomware payments by public entities to disrupt cybercrime, protect critical services, and lead global cybersecurity efforts.