Find open-source science resources

A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.

70 of 5,893 resources

Showing 150

Chemical reaction network and systems biology interface for scientific machine learning (SciML), enabling high-performance, GPU-parallelized simulation and analysis of complex biochemical systems with O(1) solvers (SciML, 518+ stars, Julia)

Active5172 days ago
Julia
NOASSERTION

Differentiable tokamak core transport simulator for fusion energy research, coupling PDE solvers with JAX auto-differentiation and neural-network surrogates for fast forward modelling, pulse-design, and trajectory optimization (Google DeepMind, Apache 2.0)

Active6792 days ago
Python
NOASSERTION

Freely available tools for biological computing in Python, with included cookbook, packaging and thorough documentation. Part of the [Open Bioinformatics Foundation](http://open-bio.org/). Contains the very useful [Entrez](https://biopython.org/DIST/docs/api/Bio.Entrez-module.html) package for API access to the NCBI databases.

Active5.1K4 days ago
Python
NOASSERTION

This package provides a periodic table of the elements with support for mass, density and xray/neutron scattering information.

Active1721 week ago
Python
NOASSERTION

Diffusion MR imaging.

Active8221 week ago
Python
NOASSERTION

A molecule manipulation library.

Active2371 week ago
Python
NOASSERTION

A compressor of common genomic file formats (BAM, CRAM, FASTQ, VCF etc).

Active1831 week ago
C
NOASSERTION

Physics-informed neural networks in Julia

Active1.2K1 week ago
Julia
NOASSERTION

samtools/bcftools are a suite of tools for manipulating NGS data and can be used to call variants.

Active8711 week ago
C
NOASSERTION

atomate2 is a library of computational materials science workflows.

Active3151 week ago
Python
NOASSERTION

Simulations of spiking neural networks.

Active1.2K1 week ago
Python
NOASSERTION

SOTA multimodal document parsing with 1.2B parameters outperforming GPT-4o, converts PDFs to LLM-ready Markdown/JSON

Active65.9K1 week ago
Python
NOASSERTION

Ontologies that aim to provide semantic specifications for units of measure, quantity kind, dimensions and data types.

Active1542 weeks ago
HTML
NOASSERTION

Toolkit for large-scale whole-slide image processing supporting 22+ patch encoders (UNI, CONCH, Virchow, H-Optimus-0, etc.), slide encoders (TITAN, GigaPath, PRISM, CHIEF, Madeleine, Feather), tissue segmentation, and multi-GPU inference with end-to-end pipeline and smart resume for standardized deployment of computational pathology foundation models (Mahmood Lab, Harvard Medical School, 553+ stars)

Active5672 weeks ago
Python
NOASSERTION

Vision foundation model for the tree of life, pretrained on diverse biological imagery across taxa for zero-shot species identification, trait extraction, and biodiversity research (Ohio State University Imageomics Institute)

Active2592 weeks ago
Python
NOASSERTION

197 bioinformatics and life science skills for Claude Code and AI agents, achieving 92.0% accuracy on BixBench. Covers RNA-seq, single-cell analysis, drug discovery, proteomics, and more. Powers OmicsHorizon (195+ stars, 2026)

Active1952 weeks ago
Python
NOASSERTION

A small language for defining pipeline stages and linking them together to make pipelines.

Active2422 weeks ago
Groovy
NOASSERTION

98B-parameter frontier generative model jointly reasoning over protein sequence, structure, and function, trained on 2.78 billion proteins; generated a novel fluorescent protein (esmGFP) with only 58% sequence identity to known GFPs (EvolutionaryScale, 2024)

Active2.4K2 weeks ago
Jupyter Notebook
NOASSERTION

The modern C++ library for sequence analysis.

Active4542 weeks ago
C++
NOASSERTION

An issue on the UBERON GitHub Issue tracker

Active1552 weeks ago
Emacs Lisp
NOASSERTION

Simulation of large-scale brain models

Active9292 weeks ago
Python
NOASSERTION

An object-oriented, webGL based JavaScript library for online molecular visualization.

Active9733 weeks ago
Jupyter Notebook
NOASSERTION

AlphaFold 3 inference pipeline for unified biomolecular structure prediction of proteins, nucleic acids, small molecules, ions, and post-translational modifications (Google DeepMind, Nature 2024)

Active8.1K3 weeks ago
Python
NOASSERTION

Biological vision foundation model trained on TreeOfLife-200M, yielding extraordinary accuracy on diverse biological visual tasks including habitat classification and trait prediction despite a narrow training objective (Ohio State University Imageomics Institute)

Active683 weeks ago
Python
NOASSERTION

A collection of object-oriented software tools for problems involving chemical kinetics, thermodynamics, and transport processes.

Active8063 weeks ago
C++
NOASSERTION

Julia differential equations suite

Active3.1K1 month ago
Julia
NOASSERTION

Machine learning interatomic potentials

Active1.2K1 month ago
Python
NOASSERTION

SPAdes (St. Petersburg genome assembler) is an assembly toolkit containing various assembly pipelines and the de-facto standard for prokaryotic genome assemblies.

Active9351 month ago
C++
NOASSERTION

Closed-loop multi-agent system from hypothesis to verification across 12 scientific tasks, #1 on MLE-Bench (36.44%)

Active1.3K1 month ago
Python
NOASSERTION

Benchmark quantifying end-to-end autonomous AI research abilities of LLM agents across 20 tasks from SOTA machine learning papers spanning NLP, code, math, biochemical modelling, and time series forecasting, with normalized score metrics against human SOTA and HuggingFace dataset

Active941 month ago
Python
NOASSERTION

Machine learning model predicting cellular perturbation response across diverse contexts with State Transition (ST) and State Embedding (SE) variants, featuring CLI tooling, PyPI distribution, and Virtual Cell Challenge integration (575+ stars)

Active5871 month ago
Python
NOASSERTION

First physics-aligned interactive benchmark for LLM agents in engineering construction, designing rockets/cars/bridges in physics simulator with 3D spatial geometry library

Active921 month ago
Python
NOASSERTION

Directed message passing neural networks for property prediction of molecules and reactions with uncertainty and interpretation.

Active2.4K1 month ago
Python
NOASSERTION

Benchmark evaluating AI agents on 75 curated Kaggle-style ML engineering competitions with reproducible Docker-based grading harness, human baselines, and end-to-end task lifecycle, used as a primary benchmark for autonomous ML research agents (e.g., InternAgent #1 at 36.44%)

Active1.5K1 month ago
Python
NOASSERTION

Access to Biological Web Services from Python.

Active3371 month ago
Python
NOASSERTION

Dataset and benchmarking framework integrating histology and spatial transcriptomics, enabling multimodal analysis of whole-slide images with matched spatial gene expression for advancing computational pathology and tissue microenvironment research (Mahmood Lab, Harvard Medical School, 411+ stars)

Active4111 month ago
Jupyter Notebook
NOASSERTION

Accessible protein design platform via Google Colab integrating AlphaFold2, RoseTTAFold, and ProteinMPNN for de novo hallucination, fixed backbone design, and binder design (Sergey Ovchinnikov, 2022+)

Active9132 months ago
Python
NOASSERTION

Agent skill for AI-assisted scientific manuscript writing review distilled from Stanford's *Writing in the Sciences* course, performing five sequential editorial audit passes on clarity, voice, structure, consistency, and integrity (2026)

Active6752 months ago
NOASSERTION

Baidu's open-source reproduction of AlphaFold3 in PaddlePaddle, providing pretrained weights and inference pipelines for unified biomolecular structure prediction across proteins, nucleic acids, ligands, ions, and post-translational modifications within the PaddleHelix biocomputing platform (Baidu, bioRxiv 2024)

Active1.1K2 months ago
Python
NOASSERTION

Ontology, part of the SI Reference Point, covering measurement units (SI base units and SI units with special names) and prefixes.

Active152 months ago
NOASSERTION

Genetic variant annotation and effect prediction toolbox.

Active3083 months ago
Java
NOASSERTION

A Python package for protein dynamics analysis

Active5463 months ago
Python
NOASSERTION

The Generative Artificial Intelligence Delegation Taxonomy (GAIDeT) assigns identifiers to contributor roles as an extension to the Contributor Roles Taxonomy (CRediT) to support promoting transparency and accountability in academic publishing when AI contribtors are involved in research. It is operationalized in the [GAIDeT Declaration Generator](https://panbibliotekar.github.io/gaidet-declaration/), an interactive tool for researchers to disclose the delegation of tasks to generative AI (GAI) tools in accordance with the GAIDeT taxonomy.

Active73 months ago
HTML
NOASSERTION

This package addresses the mean-variance relationship in spatially resolved transcriptomics data. Precision weights are generated for individual observations using Empirical Bayes techniques. These weights are used to rescale the data and covariates, which are then used as input in spatially variable gene detection tools.

Active03 months ago
R
NOASSERTION

Multimodal deep learning framework integrating peptide-MHC protein sequence, structure, and biochemical properties to predict class-I immunogenicity for infectious disease epitopes and cancer neoepitopes with cancer-wildtype contrastive learning, enabling personalized vaccine design (Krishnaswamy Lab, Yale University)

Active443 months ago
Python
NOASSERTION

GenBio AI's software stack for the AI-Driven Digital Organism, supporting adaptation and finetuning of multiscale biological foundation models across DNA, RNA, protein, structure, and single-cell tasks with reproducible CLIs and pretrained model zoo (2025)

Active1153 months ago
Python
NOASSERTION

Foundation models for genomics and transcriptomics pretrained on 3,000+ human genomes and 850+ diverse species, enabling chromatin accessibility prediction, splice site detection, and promoter classification across multiple model scales (InstaDeep, NVIDIA & TUM, Nature Methods 2023)

Active8843 months ago
Jupyter Notebook
NOASSERTION

Universal pretrained neural network potential with charge and magnetic moment awareness, trained on 1.5M+ Materials Project inorganic structures for charge-informed molecular dynamics and phase diagram prediction (Berkeley, Nature Machine Intelligence 2023 Cover)

Active3833 months ago
Python
NOASSERTION

First fully autonomous open-ended scientific discovery system with official implementation: hypothesis→experiment→writing→review simulation (13.8K+ stars, 2024)

Active14K5 months ago
Jupyter Notebook
NOASSERTION