Find open-source science resources
A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.
Filters
Health
Domain
Language
License
Source(1)
Type
126 of 5,893 resources
Showing 51–100
A fuzzy Bruijn graph approach to long noisy reads assembly
Git repo of useful single line commands.
Educational resource on performing RNA-seq analysis in the cloud using Amazon AWS cloud services. Topics include preparing the data, preprocessing, differential expression, isoform discovery, data visualization, and interpretation.
Easily submitting PBS jobs with script template. Multiple input files supported.
Syntax Highlighting for Computational Biology file formats (SAM, VCF, GTF, FASTA, PDB, etc...) in vim/less/gedit/sublime.
Point and click, cross platform suite for analysing and visualizing next-generation sequencing datasets.
Go Get Data; A command line interface for obtaining genomic data.
FASTQ/A short-reads pre-processing tools: Demultiplexing, trimming, clipping, quality filtering, and masking utilities.
JavaScript library that can be used to generate interactive and highly customizable web-based genome browsers.
BioJS is a library of over hundred JavaScript components enabling you to visualize and process data using current web technologies.
Computation Pipeline library for python widely used in science and bioinformatics.
Easy-to-use DNA sequence visualization tool that turns FASTA files into browser-based visualizations.
Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data.
Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples.
Flexible circular visualization of genome-associated data with BioPerl and SVG.
JavaScript library for drawing canvas-based gene diagrams.
Expertly curated genomics papers to get up to speed on genomics, RNA-seq, statistics (used in genomics), software development, and more.
Perl package for circular plots, which are well suited for genomic rearrangements.
Telseq is a tool for estimating telomere length from whole genome sequence data.
List of resources on alternative splicing including software, databases, and other tools.
D3 JavaScript based genome viewer. Constructs SVGs.
International association of users & developers of open source Perl tools for bioinformatics, genomics and life sciences.
Rust implementations of algorithms and data structures useful for bioinformatics.
Java framework for processing biological data.
Easily get SRA download links and other information.
Modular and universal bioinformatics, Bionode provides pipeable UNIX command line tools and JavaScript APIs for bioinformatics analysis workflows.
A wee tool for random access into BGZF files.
Fast FASTQ filtering by matching reads against one or more regex patterns.
Write-once-read-many table for large datasets.
Create an index on a compressed text file.
A cross-system scripting language for working with big data pipelines in computer systems of different sizes and capabilities.
Hadoop Oozie-based workflow system focused on genomics data analysis in cloud environments.
Workflow standard developed by the Broad.
A flexible pipeline, built with Nextflow, for the complete analysis of bacterial genomes.
A generic but comprehensive bacterial annotation pipeline, built with Nextflow, with nice graphical options for investigating results.
Customizable pipeline for differential expression analysis with an intuitive GUI.
A pipeline for preprocessing short and long sequencing reads, built with Nextflow.
Aggregate results from bioinformatics analyses across many samples into a single report.
A cross-platform and ultrafast toolkit for FASTA/Q file manipulation in Golang.
Toolkit for processing sequences in FASTA/Q formats.
Scalable genomic analysis.
Scalable gVCF merging and joint variant calling for population sequencing projects.
An ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.
the wavefront alignment algorithm (WFA) which expoit sequence similarity to speed up alignment
Partial-Order Alignment for fast alignment and consensus of multiple homologous sequences.
A software package for estimating gene and isoform expression levels from RNA-Seq data.
Bayesian haplotype-based polymorphism discovery and genotyping.