Embed | Generates numerical representations or "embeddings" of protein sequences for quantitative analysis and comparison. | ESM2, ProtT5-XL, ProstT5, Ankh, CaLM, mRNA-FM/RNA-FM, SaProt |
Visualize | Creates interactive 2D visualizations of embeddings for exploratory data analysis. | PCA, t-SNE, UMAP |
Finetune | Finetunes protein language models for specific tasks. | ESM2, ProtGPT2, ZymCTRL, ProGen2 |
Language Model Protein Generation | Generates proteins using pretrained language models. | ESM2, ProtGPT2, ZymCTRL, ProGen2 |
Inverse Folding Protein Generation | Designs proteins to fold into specific 3D structures. | ESM-IF1, LigandMPNN, ProstT5 |
Diffusion Based Protein Generation | Uses denoising diffusion models to generate proteins. | RFDiffusion |
Fold | Predicts 3D protein structures. | ESMFold, ProstT5, Chai-1, Boltz-1 |
Dock | Simulates protein-ligand interactions. | DiffDock, Smina, Autodock Vina, Lightdock, GeoDock |
Classify | Predicts protein properties with pretrained models or train custom classifiers | TemStaPro, EpHod, ECPICK, LightGBM, XGBoost, Isolation Forest |
Regress | Train custom regression models. | LightGBM, Linear |
Simulate | Uses molecular dynamics to simulate protein-ligand interactions. | OpenMM |
Score | Utilize ESM1v or ESM2 to score protein sequences or ProteinMPNN/LigandMPNN to score protein structures in a zero-shot manner. | COMPSS |
Workflow | Automated protein design workflows. | Foldtuning |