Find open-source science resources

A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.

15 of 5,923 resources

A haplotype-resolved assembler for accurate Hifi reads.

Active7791 week ago
C++
MIT

A Flexible Model For Record Linkage

Active12 weeks ago
C++
GPL-3.0-or-later

The modern C++ library for sequence analysis.

Active4542 weeks ago
C++
NOASSERTION

Structural variant discovery by integrated paired-end and split-read analysis.

Active5212 weeks ago
C++
BSD-3-Clause

A collection of object-oriented software tools for problems involving chemical kinetics, thermodynamics, and transport processes.

Active8063 weeks ago
C++
NOASSERTION

A small <720Kb C++ windows utility. That allows you to load Ancestry, 23andMe, FTDNA, or Genes for Good RAW DNA files search them, merge them. covert them to Ancestry format. But also create files from peer reviewed publications to compare with you loaded data to give your genetic disposition for the condition you have entered the data for an statistical risk if OR values are included. Included with the program are example files for Type 2 Diabetes risk factors. (As I have type 2 Diabetes so I could test the results).

Active03 weeks ago
C++
GPL-3.0

SPAdes (St. Petersburg genome assembler) is an assembly toolkit containing various assembly pipelines and the de-facto standard for prokaryotic genome assemblies.

Active9351 month ago
C++
NOASSERTION

Descriptor library containing a variety of fingerprinting techniques, including the Smooth Overlap of Atomic Positions (SOAP).

Active4661 month ago
C++
Apache-2.0

A single molecule sequence assembler for genomes large and small.

Active7003 months ago
C++

A polymorphic bayesian genotyping model with wide applicability.

Active3233 months ago
C++
MIT

Collection of tools for working with BAM files.

Idle4301 year ago
C++
MIT

A system for rapidly aligning entire genomes, whether in complete or draft form.

Idle5611 year ago
C++
Artistic-2.0

Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples.

Stale3226 years ago
C++
BSL-1.0

Telseq is a tool for estimating telomere length from whole genome sequence data.

Stale767 years ago
C++
GPL-3.0

maeparser is a parser for Schrodinger Maestro files.