SimpleFold (Apple, arXiv 2025)
github.com/apple/ml-simplefoldFlow-matching protein folding model using only general-purpose transformer layers, scaled to 3B parameters and trained on 8.6M+ distilled structures; challenges the reliance on complex domain-specific architectures and supports PyTorch and MLX backends with model sizes from 100M to 3B parameters (985+ stars, MIT License)
Sourced from
- Awesome AI for Science — github.com/apple/ml-simplefold
Related resources
AlphaFold 3 inference pipeline for unified biomolecular structure prediction of proteins, nucleic acids, small molecules, ions, and post-translational modifications (Google DeepMind, Nature 2024)
Deep learning library for Chemistry based on Tensorflow
Protein structure prediction from ESM models
First fully open-source model achieving AlphaFold3-level accuracy with 1000x faster binding affinity prediction (MIT)
Trainable, memory-efficient PyTorch reproduction and retraining of AlphaFold2 providing new insights into its learning dynamics and out-of-distribution generalization; widely used as the open-source AlphaFold2 backbone underpinning many downstream protein structure prediction and design pipelines (Columbia AlQuraishi Lab & OpenFold Consortium, Nature Methods 2024)
RFdiffusion is an open source method for structure generation, with or without conditional information (a motif, target etc).