Find open-source science resources

A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.

82 of 5,923 resources

Showing 5182

This package provides functions for the analysis of data generated by the multiplex substrate profiling by mass spectrometry for proteases (MSP-MS) method. Data exported from upstream proteomics software is accepted as input and subsequently processed for analysis. Tools for statistical analysis, visualization, and interpretation of the data are provided.

Idle16 months ago
R
NOASSERTION

Biocaml aims to be a high-performance user-friendly library for Bioinformatics.

Idle1256 months ago
OCaml
NOASSERTION

Foundation model for universal cell segmentation achieving state-of-the-art performance across bacteria, tissue, yeast, cell culture, and diverse imaging modalities (brightfield, fluorescence, phase), with pip-installable inference and Napari plugin (vanvalenlab/Caltech, bioRxiv 2024)

Idle1957 months ago
Python
NOASSERTION

Another list focuses on Python stuff related to Chemistry.

Idle1.4K8 months ago
NOASSERTION

HOSO is an ontology of informational entities and processes related to healthcare organizations and services.

Idle08 months ago
HTML
NOASSERTION

SIMD C library for global, semi-global, and local pairwise sequence alignments

Idle2849 months ago
C
NOASSERTION

DeepSeek's open-source large language model for formal theorem proving in Lean 4, integrating informal and formal mathematical reasoning through recursive subgoal decomposition and reinforcement learning powered by DeepSeek-V3, with open weights and ProverBench evaluation (2025)

Idle1.3K11 months ago
NOASSERTION

Unified Code for Units of Measure (UCUM) is a code system intended to include all units of measures being contemporarily used in international science, engineering, and business.

Idle9811 months ago
HTML
NOASSERTION

A project supporting the DRAO application ontology, a hierarchy of specific research domains and descriptors which imports subsets of terms from over 40 publicly-available terminologies. (from repository)

Idle212 months ago
Makefile
NOASSERTION

In silico directed evolution framework using few-shot active learning to optimize protein activities, enabling rapid protein engineering with minimal experimental data (352+ stars, 2023)

Idle3601 year ago
Python
NOASSERTION

GRIDSS: the Genomic Rearrangement IDentification Software Suite.

Idle2831 year ago
Java
NOASSERTION

General-purpose pathology foundation model pretrained on 100K+ diagnostic whole-slide images across 20 major tissue types, achieving state-of-the-art transfer learning across 30+ clinical tasks and serving as a universal feature extractor for digital pathology (Mahmood Lab, 722+ stars)

Idle7441 year ago
Jupyter Notebook
NOASSERTION

The eiR package provides utilities for accelerated structure similarity searching of very large small molecule data sets using an embedding and indexing approach.

Idle41 year ago
R
NOASSERTION

A terminology for the skills necessary to make data FAIR and to keep it FAIR.

Idle171 year ago
Makefile
NOASSERTION

The software uses the copy number segments from a text file and identifies all chromosome arms that are globally altered and computes various genome-wide scores. The following HRD scores (characteristic of BRCA-mutated cancers) are included: LST, HR-LOH, nLST and gLOH. the package is tailored for the ThermoFisher Oncoscan assay analyzed with their Chromosome Alteration Suite (ChAS) but can be adapted to any input.

Idle31 year ago
R
NOASSERTION

Universal chart comprehension and reasoning model

Idle1351 year ago
Python
NOASSERTION

The Semantic Web for Earth and Environmental Terminology is a mature foundational ontology that contains over 6000 concepts organized in 200 ontologies represented in OWL. Top level concepts include Representation (math, space, science, time, data), Realm (Ocean, Land Surface, Terrestrial Hydroshere, Atmosphere, etc.), Phenomena (macro-scale ecological and physical), Processes (micro-scale physical, biological, chemical, and mathematical), Human Activities (Decision, Commerce, Jurisdiction, Environmental, Research).

Idle1401 year ago
Turtle
NOASSERTION

[RDKit](http://www.rdkit.org/) and [OSRA](https://cactus.nci.nih.gov/osra/) in the [Bottle](http://bottlepy.org/docs/dev/) on [Tornado](http://www.tornadoweb.org/en/stable/).

Archived502 years ago
Python
NOASSERTION

A VCF Parser for Python.

Stale4192 years ago
Python
NOASSERTION

Educational resource on performing RNA-seq analysis in the cloud using Amazon AWS cloud services. Topics include preparing the data, preprocessing, differential expression, isoform discovery, data visualization, and interpretation.

Stale1.4K3 years ago
R
NOASSERTION
Stale693 years ago
Makefile
NOASSERTION

Learning nonlinear operators

Stale8193 years ago
Python
NOASSERTION

AI for chemical reaction prediction and synthesis planning

Stale4244 years ago
Python
NOASSERTION

FASTQ/A short-reads pre-processing tools: Demultiplexing, trimming, clipping, quality filtering, and masking utilities.

Stale2024 years ago
C
NOASSERTION
Stale04 years ago
NOASSERTION

This proposed vocabulary allows edges in Property Graphs (e.g Neo4j, RDF*) to be augmented with edge properties that specify ontological semantics, including (but not limited) to OWL-DL interpretations. [from GitHub]

Stale355 years ago
Makefile
NOASSERTION

The Reagent Ontology (ReO) adheres to OBO Foundry principles (obofoundry.org) to model the domain of biomedical research reagents, considered broadly to include materials applied “chemically” in scientific techniques to facilitate generation of data and research materials. ReO is a modular ontology that re-uses existing ontologies to facilitate cross-domain interoperability. It consists of reagents and their properties, linking diverse biological and experimental entities to which they are related. ReO supports community use cases by providing a flexible, extensible, and deeply integrated framework that can be adapted and extended with more specific modeling to meet application needs.

Stale06 years ago
Python
NOASSERTION

Flexible circular visualization of genome-associated data with BioPerl and SVG.

Stale466 years ago
Perl
NOASSERTION