RiNALMo (Nature Communications 2025)

github.com/lbcb-sci/rinalmo
Active165updated 1 month ago
Python
Apache-2.0

General-purpose RNA language model with 650M parameters pretrained on 36M non-coding RNA sequences, achieving strong generalization on structure prediction tasks including secondary structure prediction, splice-site prediction, mean ribosome loading, and ncRNA classification (lbcb-sci, 165+ stars, Apache-2.0)

Sourced from

  • Awesome AI for Sciencegithub.com/lbcb-sci/rinalmo
  • GitHubgithub.com/lbcb-sci/rinalmo

Related resources

Unified framework for state-of-the-art pre-trained bio foundation models across genomics and transcriptomics, providing standardized interfaces and pipelines for DNA, RNA, and single-cell models including Evo 2, Geneformer, scGPT, and UCE with streamlined inference, benchmarking, and fine-tuning workflows (213+ stars, 2024-2025)

Active2151 month ago
Python
AGPL-3.0

*GenerRNA is a generative pre-trained language model for de novo RNA sequence design. It is a Transformer (decoder-only, GPT-style) model that learns the "language" of RNA from millions of natural sequences and can generate novel, realistic RNA sequences without any structural input, functional…

Active01 week ago

Deep learning-based variant caller

Active3.7K2 months ago
Python
BSD-3-Clause

Single-cell analysis with transformers

Active1.6K1 month ago
Jupyter Notebook
MIT

Therapeutics Data Commons: 66 AI-ready datasets across 22 drug discovery tasks with 29 leaderboards, covering target identification, molecular generation, ADMET prediction, and clinical trial outcomes (Harvard MIMS, NeurIPS 2021/2024)

Idle1.3K11 months ago
Jupyter Notebook
MIT

Unified Python framework for bulk, single-cell, and spatial RNA-seq multi-omics analysis with deep learning deconvolution (VAE) and graph neural networks, bridging Bindea, Bindea, scanpy and squidpy ecosystems (Nature Communications 2024)

Active1K2 weeks ago
Python
GPL-3.0