littleworth/protgpt2-distilled-small

Activeby littleworth50updated 4 months ago

A compact protein language model distilled from ProtGPT2 using complementary-regularizer distillation---a method that combines uncertainty-aware position weighting with calibration-aware label smoothing to achieve 54% better perplexity than standard knowledge distillation at 9.4x compression.

Sourced from

HuggingFace — littleworth/protgpt2-distilled-small

Related resources

littleworth/protgpt2-distilled-tiny

by littleworth

A compact protein language model distilled from ProtGPT2 using complementary-regularizer distillation---a method that combines uncertainty-aware position weighting with calibration-aware label smoothing to achieve 87% better perplexity than standard knowledge distillation at 20x compression.

Active144 months ago

littleworth/protgpt2-distilled-medium

by littleworth

A compact protein language model distilled from ProtGPT2 using complementary-regularizer distillation---a method that combines uncertainty-aware position weighting with calibration-aware label smoothing to achieve 31% better perplexity than standard knowledge distillation at 3.8x compression.

Active704 months ago

biohub/ESMC-300M

by biohub

fill-mask

ESMC is a state-of-the-art protein language model that has learned the rules of protein biology from training on billions of protein sequences. ESMC provides representations of proteins enabling novel AI applications from therapeutic protein engineering to unlocking basic insights into protein…

Active10.3K1 month ago

biohub/ESMC-600M

by biohub

fill-mask

Active467.8K1 month ago

biohub/ESMC-6B

by biohub

fill-mask

Active1.3M1 month ago

alimotahharynia/DrugGen-2

by alimotahharynia

# DrugGen 2: A disease-aware language model for enhancing drug discovery DrugGen-2 is a disease‑aware language model specialized for generating drug-like SMILES structures based on both disease pathways and protein sequence.

Active1821 day ago