Find open-source science resources

Google DeepMind's official collection of agentic science skills accelerating scientific workflows with better grounding and higher token efficiency, integrating insights from AlphaGenome, AFDB, UniProt and 30+ other databases and tools (2026)

Active2.1K1 month ago

Literature Management Plugins

PapersGPT for Zotero

Multi-PDF conversation, retrieval, and citation in Zotero with commercial/local models (Ollama), MCP support

Active2.5K1 month ago

JavaScript

AGPL-3.0

Paper2Poster

Poster Generation

Multi-agent system with Parser-Planner-Painter architecture converting `paper.pdf` to editable `poster.pptx`, outperforms GPT-4o with 87% fewer tokens

Active3.8K1 month ago

Autonomous Research Systems (2023-2025 Breakthroughs)

ScienceClaw

Self-evolving AI research colleague built on OpenClaw with 285+ runtime-adaptive skills across 28+ disciplines, persistent cross-session research memory, and zero-hallucination citation protocols; agent autonomously writes new SKILL.md files based on research patterns without redeployment (828+ stars, MIT License, 2026)

Active8641 month ago

TypeScript

MONAI

NVIDIA and King's College London's open-source AI toolkit for healthcare imaging, providing foundational frameworks for medical image annotation (MONAI Label), training (MONAI Core), and deployment (MONAI Deploy) across radiology, pathology, and endoscopy (8K+ stars, Apache 2.0)

Active8.3K1 month ago

JAX

Machine Learning

High-performance ML research

Active35.8K1 month ago

Carbon (Hugging Face, 2026)

Family of causal genomic foundation models trained on 1T tokens (~6T DNA base pairs) from the Carbon Pretraining Corpus, combining eukaryote genes, mRNA transcripts, and prokaryote genomes with a hybrid text/6-mer tokenizer; Carbon-3B matches or beats Evo2-7B on zero-shot DNA evaluations including sequence recovery, variant effect prediction, and perturbations (Apache 2.0, 201+ stars)

Active1991 month ago

Scientific Literature RAG & Analysis

PaperQA2

High-accuracy RAG for scientific PDFs with citation support, agentic RAG, and contradiction detection

Active8.6K1 month ago

Autonomous Research Systems (2023-2025 Breakthroughs)

Aviary

Language agent gymnasium for challenging scientific tasks including DNA manipulation, literature search, and protein engineering

Active2711 month ago

GPN-Star (Song Lab, UC Berkeley, bioRxiv 2025)

Phylogeny-aware genomic language model trained on whole-genome alignments across multiple evolutionary timescales, predicting functional constraints and variant effects for human, mouse, chicken, fly, worm, and Arabidopsis genomes (344+ stars, MIT License)

Active3441 month ago

Autonomous Research Systems (2023-2025 Breakthroughs)

CORAL (arXiv 2026)

Robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearch; agents run in isolated git worktrees, share knowledge through a common state directory, and are scored by a grader daemon; natively integrated with Claude Code, Codex, Cursor Agent, OpenCode, and Kiro (672+ stars, Apache 2.0)

Active7121 month ago

Helical

Unified framework for state-of-the-art pre-trained bio foundation models across genomics and transcriptomics, providing standardized interfaces and pipelines for DNA, RNA, and single-cell models including Evo 2, Geneformer, scGPT, and UCE with streamlined inference, benchmarking, and fine-tuning workflows (213+ stars, 2024-2025)

Active2191 month ago

AGPL-3.0

RNAPro (NVIDIA, 2026)

State-of-the-art RNA 3D folding model developed with Stanford Das Lab and Kaggle competition winners, featuring a 488M-parameter AF3-like architecture with MSA and template-based modeling, enabling structure-driven drug discovery and RNA therapeutics design (NVIDIA-Digital-Bio, Apache 2.0)

Active861 month ago

Microsoft Biodiversity

Ecological Modeling

Microsoft AI for Good Lab's open-source biodiversity research hub providing AI models, edge devices, and tools for wildlife monitoring and conservation, including MegaDetector (camera trap animal detection), SPARROW (species recognition), PytorchWildlife (conservation AI toolkit), and bioacoustics analysis pipelines (1K+ stars)

Active1K1 month ago

Best of Atomistic Machine Learning

Materials Discovery

High-Performance Document Processing

Curated list of atomistic ML projects for materials science

Active6911 month ago

CC-BY-SA-4.0

PaddleOCR 3.0 (2024/2025)

Advanced OCR with PP-StructureV3 document parsing, 13% accuracy improvement, supports 80+ languages

Active81.3K1 month ago

STARLING (Holehouse Lab, Nature 2026)

Latent-space probabilistic denoising diffusion model for predicting coarse-grained conformational ensembles of intrinsically disordered proteins and regions from sequence, with GPU/CPU inference, trajectory export, and FAISS-based similarity search (67+ stars, LGPL-3.0)

Active671 month ago

CryoDRGN

Neural network-based cryo-EM heterogeneous reconstruction, modeling continuous 3D structure distributions from single-particle images, with CryoDRGN-ET extending to in-cell cryo-electron tomography (MIT CSAIL, Nature Methods 2021/2024)

Active3781 month ago

GPL-3.0

MoleCode

LLM for Chemistry

LLM-native molecular language that represents molecules as explicit graph-based code, enabling LLMs to operate and reason on chemistry directly with 5× lower token cost and ~76-80% accuracy on novel molecules vs ~20% for SMILES; supports small molecules, polymers, and Markush structures with lossless RDKit interconversion and Claude Code/Codex agent skills (AtomFlow, arXiv:2605.16480, 281+ stars, MIT License, 2026)

Active2811 month ago

cellSAM

Foundation model for universal cell segmentation achieving state-of-the-art performance across bacteria, tissue, yeast, cell culture, and diverse imaging modalities (brightfield, fluorescence, phase), with pip-installable inference and Napari plugin (vanvalenlab/Caltech, bioRxiv 2024)

Active2021 month ago

Physics-Informed Neural Networks

NeuralPDE.jl

Physics-informed neural networks in Julia

Active1.2K1 month ago

Julia

Autonomous Research Systems (2023-2025 Breakthroughs)

AutoResearchClaw

Fully autonomous research from idea to paper with multi-agent debate, citation verification, and OpenClaw integration (11K+ stars, 2026)

Active13.5K1 month ago

BiaPy

Open-source deep learning toolbox for bioimage analysis providing a unified, configuration-driven framework for 2D/3D semantic segmentation, instance segmentation, classification, denoising, super-resolution, and self-supervised learning; integrates state-of-the-art architectures including U-Net, Vision Transformers, and ConvNeXt, designed for microscopy and biomedical imaging researchers without extensive coding expertise (MIT License, actively maintained)

Active2011 month ago

Scientific Literature RAG & Analysis

Valsci

Self-hostable scientific claim-verification and literature-review tool combining Semantic Scholar retrieval, bibliometric scoring, and LLM-based evidence synthesis for large-batch validation workflows

Active121 month ago

Interactive Research Environments

GPL-3.0

Jupyter AI (JupyterLab Extension)

Official Jupyter extension with `%%ai` magic commands and sidebar chat assistant, connecting multiple model providers and local inference

Active4.3K1 month ago

Machine Learning for Physics

BSD-3-Clause

JAX-MD

Molecular dynamics in JAX

Active1.4K1 month ago

Domain-Specific Research Agents

Foam-Agent (NeurIPS 2025)

End-to-end composable multi-agent framework for automating OpenFOAM-based CFD simulations from natural language prompts, managing meshing, case setup, execution, error correction, and post-processing; achieves 100% success rate on 110 FoamBench tasks with Claude Opus 4.6 through Architect-Input Writer-Runner-Reviewer agent collaboration with RAG-enhanced generation and MCP tool integration (RPI CSML, 242+ stars, MIT License)

Active2541 month ago

micro-sam

Segment Anything Model for microscopy: interactive and automatic segmentation of light, electron, and fluorescence microscopy images in 2D and 3D, with domain-specific fine-tuning workflows for scientific imaging (1.5K+ stars)

Active6851 month ago

Neuroscience & Behavioral Analysis

braindecode

Deep learning software to decode EEG, ECG or MEG signals, providing standardized neural network models, preprocessing pipelines, and evaluation workflows for brain-computer interfaces and cognitive neuroscience research (1.2K+ stars, BSD 3-Clause, actively maintained)

Active1.2K1 month ago

Autonomous Research Systems (2023-2025 Breakthroughs)

BSD-3-Clause

Denario (AstroPilot-AI, Agents4Science 2025)

Domain-Specific Research Agents

Modular multi-agent scientific research assistant that automates idea generation, literature review, methodology design, code execution in Docker, visualization, LaTeX paper writing, and peer-review simulation across 10+ disciplines; winner of the NeurIPS 2025 Fair Universe Competition (573+ stars, GPL-3.0, 2025-2026)

Active5761 month ago

TeX

GPL-3.0

Lean Copilot

LLMs as copilots for theorem proving in Lean 4, exposing native tactics (`suggest_tactics`, `search_proof`, `select_premises`) that embed language model inference and premise retrieval directly inside the Lean proof environment, supporting local CTranslate2/CUDA inference as well as remote model APIs for interactive and automated proof search (Caltech & NVIDIA, NeurIPS 2024, 1.2K+ stars)

Active1.3K1 month ago

C++

Autonomous Research Systems (2023-2025 Breakthroughs)

EvoScientist

Self-evolving AI scientist with 6 specialized sub-agents (plan/research/code/debug/analyze/write) and persistent memory, #1 on DeepResearch Bench II and AstaBench, supporting multi-provider LLMs and multi-channel deployment (Apache 2.0, 2026)

Active3.3K1 month ago

GeoAI

Climate Modeling

High-level open-source geospatial AI package for satellite/aerial imagery analysis, model training, inference, interactive visualization, and QGIS integration, bridging PyTorch/Transformers with remote sensing workflows (MIT, 2026)

Active3.1K1 month ago

Domain-Specific Research Agents

Biomni

General-purpose biomedical AI agent integrating LLM reasoning with retrieval-augmented planning and code-based execution to autonomously execute diverse biomedical research tasks and generate testable hypotheses (Stanford SNAP, bioRxiv 2025)

Active3.2K1 month ago

Neuroscience & Behavioral Analysis

DeepLabCut

Markerless pose estimation of user-defined features with deep learning for all animals including humans, enabling quantitative behavioral analysis in neuroscience and ethology (Nature Neuroscience 2018, 5.6K+ stars)

Active5.7K1 month ago

LGPL-3.0

SwitchCraft

Programmatic framework for designing state-switching proteins via backpropagation through compositional design constraints parameterized by structure prediction models; enables de novo design of allosteric regulators and fluorescent biosensors for arbitrary small-molecule analytes (79+ stars, MIT License, ICML 2026)

Active791 month ago

Neural Operators & Model Discovery

exponax

Efficient differentiable n-dimensional PDE solvers built on JAX and Equinox, shipping 46+ built-in equations with Fourier spectral methods, exponential time differencing, and full auto-differentiation for physics-based deep learning workflows (MIT, 200+ stars, 2024)

Active2101 month ago

Neuroscience & Behavioral Analysis

CEBRA (Nature 2023)

Learnable latent embeddings for joint behavioral and neural analysis, enabling consistent and interpretable mapping of neural activity to behavior across modalities, species, and experiments (EPFL & Harvard, 1K+ stars)

Active1.1K1 month ago

cellxgene (Chan Zuckerberg Initiative)

Interactive explorer for single-cell transcriptomics data enabling visualization of UMAP/t-SNE embeddings, differential expression analysis, and cross-dataset comparison through a fast web-based interface; widely adopted for exploring atlas-scale single-cell datasets and integrating with AI/ML analysis workflows (773+ stars, MIT License)

Active7731 month ago

JavaScript

Simulation-Based Inference

sbi

Python package for simulation-based inference enabling likelihood-free Bayesian parameter estimation from scientific simulators, with flexible interfaces for neural posterior estimation, sequential methods, and MCMC/variational backends (Mackelab, 825+ stars)

Active8281 month ago

Boltz

First fully open-source model achieving AlphaFold3-level accuracy with 1000x faster binding affinity prediction (MIT)

Active4K1 month ago

BioCLIP (CVPR 2024)

Ecological Modeling

Vision foundation model for the tree of life, pretrained on diverse biological imagery across taxa for zero-shot species identification, trait extraction, and biodiversity research (Ohio State University Imageomics Institute)

Active2591 month ago

Allegro

Materials Discovery

Highly scalable equivariant deep learning interatomic potentials enabling million-atom molecular dynamics simulations with ab initio accuracy, building on E(3)-equivariant architectures for large-scale atomistic modeling (mir-group, MIT License, 480+ stars)

Active4921 month ago

BioReason (NeurIPS 2025)

First architecture deeply integrating a DNA foundation model with an LLM for multimodal biological reasoning, achieving 98% accuracy on KEGG disease pathway prediction and 15%+ average gains on variant effect prediction with interpretable step-by-step reasoning traces (bowang-lab, 390+ stars)

Active3981 month ago

BioSimulators

Ecological Modeling

Biological simulation tools

Active151 month ago

Autonomous Research Systems (2023-2025 Breakthroughs)

EvoMaster (SJTU SAI, arXiv 2026)

Foundational auto-research agent framework for agentic science at scale, providing modular agent construction, run-level self-evolution, and multiple SciMaster domain agents (ML-Master, X-Master, Browse-Master); outperforms general-purpose agents across authoritative benchmarks including the OpenAI Frontier Science Benchmark (206+ stars, Apache 2.0, 2026)

Active2061 month ago

Research Workbench & Plugins

SciAgent-Skills

197 bioinformatics and life science skills for Claude Code and AI agents, achieving 92.0% accuracy on BixBench. Covers RNA-seq, single-cell analysis, drug discovery, proteomics, and more. Powers OmicsHorizon (195+ stars, 2026)

Active1951 month ago

Autonomous Research Systems (2023-2025 Breakthroughs)

AutoScientists (Harvard MIMS, arXiv 2026)

Decentralized self-organizing teams of AI agents for long-running computational scientific experimentation; agents critique each other's proposals before spending compute and share successes/failures to avoid redundant exploration, achieving +8.33% on BioML-Bench, 1.9× faster nanoGPT optimization, and +12.5% on ProteinGym ACE2-Spike (425+ stars, 2026)

Active6781 month ago

IgGM

Generative foundation model for functional antibody and nanobody design, supporting de novo generation, affinity maturation, inverse design, structure prediction, and humanization (Tencent AI4S, ICLR 2025)

Active2011 month ago

AION (arXiv 2025)

Astronomy & Astrophysics

Polymathic AI's large omnimodal foundation model for astronomical surveys, seamlessly integrating 39 distinct data modalities including imaging, spectra, photometry, and catalog entries for similarity search, property prediction, and generative modeling across legacy surveys (MIT)

Active1351 month ago