Security News
Research
Data Theft Repackaged: A Case Study in Malicious Wrapper Packages on npm
The Socket Research Team breaks down a malicious wrapper package that uses obfuscation to harvest credentials and exfiltrate sensitive data.
_____________________.___.____ .____
\__ ___/\______ \ | | | |
| | | _/ | | | |
| | | | \ | |___| |___
|____| |____|_ /___|_______ \_______ \
\/ \/ \/
TRILL (TRaining and Inference using the Language of Life) is a sandbox for creative protein engineering and discovery. As a bioengineer myself, deep-learning based approaches for protein design and analysis are of great interest to me. However, many of these deep-learning models are rather unwieldy, especially for non ML-practitioners due to their sheer size. Not only does TRILL allow researchers to perform inference on their proteins of interest using a variety of models, but it also democratizes the efficient fine-tuning of large-language models. Whether using Google Colab with one GPU or a supercomputer with many, TRILL empowers scientists to leverage models with millions to billions of parameters without worrying (too much) about hardware constraints. Currently, TRILL supports using these models as of v1.8.0:
Command | Function | Available Models |
---|---|---|
Embed | Generates numerical representations or "embeddings" of protein sequences for quantitative analysis and comparison. | ESM2, ProtT5-XL, ProstT5, Ankh |
Visualize | Creates interactive 2D visualizations of embeddings for exploratory data analysis. | PCA, t-SNE, UMAP |
Finetune | Finetunes protein language models for specific tasks. | ESM2, ProtGPT2, ZymCTRL |
Language Model Protein Generation | Generates proteins using pretrained language models. | ESM2, ProtGPT2, ZymCTRL |
Inverse Folding Protein Generation | Designs proteins to fold into specific 3D structures. | ESM-IF1, LigandMPNN, ProstT5 |
Diffusion Based Protein Generation | Uses denoising diffusion models to generate proteins. | RFDiffusion |
Fold | Predicts 3D protein structures. | ESMFold, ProstT5 |
Dock | Simulates protein-ligand interactions. | DiffDock, Smina, Autodock Vina, Lightdock, GeoDock |
Classify | Predicts protein properties with pretrained models or train custom classifiers | TemStaPro, EpHod, ECPICK, LightGBM, XGBoost, Isolation Forest |
Regress | Train custom regression models. | LightGBM, Linear |
Simulate | Uses molecular dynamics to simulate protein-ligand interactions. | OpenMM |
Score | Utilize ESM1v or ESM2 to score protein sequences or ProteinMPNN to score protein structures in a zero-shot manner. | COMPSS |
Check out the documentation and examples at https://trill.readthedocs.io/en/latest/index.html
FAQs
Sandbox for Computational Protein Design
We found that trill-proteins demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Research
The Socket Research Team breaks down a malicious wrapper package that uses obfuscation to harvest credentials and exfiltrate sensitive data.
Research
Security News
Attackers used a malicious npm package typosquatting a popular ESLint plugin to steal sensitive data, execute commands, and exploit developer systems.
Security News
The Ultralytics' PyPI Package was compromised four times in one weekend through GitHub Actions cache poisoning and failure to rotate previously compromised API tokens.