Find open-source science resources

A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.

126 of 5,893 resources

Showing 51100

A fuzzy Bruijn graph approach to long noisy reads assembly

Stale5302 years ago
C
GPL-3.0

A VCF Parser for Python.

Stale4192 years ago
Python
NOASSERTION

Git repo of useful single line commands.

Stale2K2 years ago

Educational resource on performing RNA-seq analysis in the cloud using Amazon AWS cloud services. Topics include preparing the data, preprocessing, differential expression, isoform discovery, data visualization, and interpretation.

Stale1.4K3 years ago
R
NOASSERTION

Easily submitting PBS jobs with script template. Multiple input files supported.

Stale293 years ago
Python
MIT

Syntax Highlighting for Computational Biology file formats (SAM, VCF, GTF, FASTA, PDB, etc...) in vim/less/gedit/sublime.

Stale2723 years ago
Shell
GPL-3.0

Point and click, cross platform suite for analysing and visualizing next-generation sequencing datasets.

Stale173 years ago
TypeScript
GPL-3.0

Go Get Data; A command line interface for obtaining genomic data.

Stale423 years ago
Python
MIT

FASTQ/A short-reads pre-processing tools: Demultiplexing, trimming, clipping, quality filtering, and masking utilities.

Stale2024 years ago
C
NOASSERTION

JavaScript library that can be used to generate interactive and highly customizable web-based genome browsers.

Stale2814 years ago
JavaScript
Apache-2.0

BioJS is a library of over hundred JavaScript components enabling you to visualize and process data using current web technologies.

Stale5064 years ago
Apache-2.0

Table file index.

Archived924 years ago
C

Computation Pipeline library for python widely used in science and bioinformatics.

Stale1754 years ago
Python
MIT

Easy-to-use DNA sequence visualization tool that turns FASTA files into browser-based visualizations.

Archived424 years ago
Python
MIT

Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data.

Stale2146 years ago
Python
MIT

Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples.

Stale3226 years ago
C++
BSL-1.0

Flexible circular visualization of genome-associated data with BioPerl and SVG.

Stale466 years ago
Perl
NOASSERTION

JavaScript library for drawing canvas-based gene diagrams.

Stale767 years ago
JavaScript

Expertly curated genomics papers to get up to speed on genomics, RNA-seq, statistics (used in genomics), software development, and more.

Stale5027 years ago

Perl package for circular plots, which are well suited for genomic rearrangements.

Stale887 years ago
Perl

Telseq is a tool for estimating telomere length from whole genome sequence data.

Stale767 years ago
C++
GPL-3.0

List of resources on alternative splicing including software, databases, and other tools.

Stale588 years ago

D3 JavaScript based genome viewer. Constructs SVGs.

Stale3310 years ago
JavaScript
GPL-2.0

International association of users & developers of open source Perl tools for bioinformatics, genomics and life sciences.

Rust implementations of algorithms and data structures useful for bioinformatics.

Java framework for processing biological data.

Easily get SRA download links and other information.

Modular and universal bioinformatics, Bionode provides pipeable UNIX command line tools and JavaScript APIs for bioinformatics analysis workflows.

A wee tool for random access into BGZF files.

Fast FASTQ filtering by matching reads against one or more regex patterns.

Write-once-read-many table for large datasets.

Create an index on a compressed text file.

A cross-system scripting language for working with big data pipelines in computer systems of different sizes and capabilities.

Hadoop Oozie-based workflow system focused on genomics data analysis in cloud environments.

Workflow standard developed by the Broad.

A flexible pipeline, built with Nextflow, for the complete analysis of bacterial genomes.

A generic but comprehensive bacterial annotation pipeline, built with Nextflow, with nice graphical options for investigating results.

Customizable pipeline for differential expression analysis with an intuitive GUI.

A pipeline for preprocessing short and long sequencing reads, built with Nextflow.

Aggregate results from bioinformatics analyses across many samples into a single report.

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation in Golang.

Toolkit for processing sequences in FASTA/Q formats.

Scalable genomic analysis.

Scalable gVCF merging and joint variant calling for population sequencing projects.

An ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.

the wavefront alignment algorithm (WFA) which expoit sequence similarity to speed up alignment

An ultrafast protein aligner for `blastp` and `blastx` like searches.

Partial-Order Alignment for fast alignment and consensus of multiple homologous sequences.

A software package for estimating gene and isoform expression levels from RNA-Seq data.

Bayesian haplotype-based polymorphism discovery and genotyping.