Find open-source science resources

A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.

6 of 5,923 resources

Single-cell analysis with transformers

Active1.6K1 month ago
Jupyter Notebook
MIT

First architecture deeply integrating a DNA foundation model with an LLM for multimodal biological reasoning, achieving 98% accuracy on KEGG disease pathway prediction and 15%+ average gains on variant effect prediction with interpretable step-by-step reasoning traces (bowang-lab, 390+ stars)

Active3902 months ago
Jupyter Notebook
Apache-2.0

Foundation models for genomics and transcriptomics pretrained on 3,000+ human genomes and 850+ diverse species, enabling chromatin accessibility prediction, splice site detection, and promoter classification across multiple model scales (InstaDeep, NVIDIA & TUM, Nature Methods 2023)

Active8843 months ago
Jupyter Notebook
NOASSERTION

Teaching Large Language Models the Language of Biology through single-cell transcriptomics (ICML 2024)

Idle8627 months ago
Jupyter Notebook
Apache-2.0

RNA foundation model trained on millions of RNA sequences for generalist RNA sequence understanding, enabling downstream structure prediction, function annotation, and representation learning for non-coding RNAs (ml4bio, 372+ stars)

Idle3741 year ago
Jupyter Notebook
MIT

Generative pre-training for genomics

Stale3202 years ago
Jupyter Notebook