Find open-source science resources
A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.
Filters
Health
Domain
Language
License(1)
Source(1)
Type
12 of 5,893 resources
A compressor of common genomic file formats (BAM, CRAM, FASTQ, VCF etc).
samtools/bcftools are a suite of tools for manipulating NGS data and can be used to call variants.
A small language for defining pipeline stages and linking them together to make pipelines.
The modern C++ library for sequence analysis.
SPAdes (St. Petersburg genome assembler) is an assembly toolkit containing various assembly pipelines and the de-facto standard for prokaryotic genome assemblies.
Access to Biological Web Services from Python.
Genetic variant annotation and effect prediction toolbox.
Biocaml aims to be a high-performance user-friendly library for Bioinformatics.
SIMD C library for global, semi-global, and local pairwise sequence alignments
GRIDSS: the Genomic Rearrangement IDentification Software Suite.
Educational resource on performing RNA-seq analysis in the cloud using Amazon AWS cloud services. Topics include preparing the data, preprocessing, differential expression, isoform discovery, data visualization, and interpretation.
FASTQ/A short-reads pre-processing tools: Demultiplexing, trimming, clipping, quality filtering, and masking utilities.