Find open-source science resources

A compressor of common genomic file formats (BAM, CRAM, FASTQ, VCF etc).

Active1831 week ago

NOASSERTION

bcftools

Variant Calling

samtools/bcftools are a suite of tools for manipulating NGS data and can be used to call variants.

Active8711 week ago

NOASSERTION

Fast and accurate protein structure search using a learned 3Di structural alphabet (VQ-VAE) that discretizes tertiary interactions into structural tokens, enabling protein-universe-scale structural alignment at sequence-search speeds (4-5 orders of magnitude faster than DALI/TM-align) and underpinning many AI4S tools such as SaProt, ESMAtlas search, and AFDB clustering pipelines (Steinegger Lab, Nature Biotechnology 2023)

Active1.2K1 month ago

SPALN

Mapping

Genome mapping and spliced alignment of cDNA or amino acid sequences

Active1133 months ago

GPL-2.0

BWA-FastAlign

Pairwise

BWA-MEM drop-in replacement: 2-3x faster, 2-5x cheaper, 100% identical output on standard CPUs.

Active223 months ago

Structural variant callers

lumpy

lumpy: a general probabilistic framework for structural variant discovery.

Active3423 months ago

pyBigWig

Computational biology

A python extension, written in C, for quick access to bigBed files and access to and creation of bigWig files.

Active2445 months ago

Parasail

Pairwise

SIMD C library for global, semi-global, and local pairwise sequence alignments

Idle2849 months ago

NOASSERTION

minigraph

Genomics

Minigraph is a sequence-to-graph mapper and graph constructor. For graph generation, it aligns a query sequence against a sequence graph and incrementally augments an existing graph with long query subsequences diverged from the graph.

Idle48110 months ago

BWA

Pairwise

Burrow-Wheeler Aligner for pairwise alignment between DNA sequences.

Idle1.7K1 year ago

Bedtools2

GFF BED File Utilities

A Swiss Army knife for genome arithmetic.

Idle1K1 year ago

DAZZ_DB

Computational biology

A database system designed to store, organize, and manage large-scale nucleotide sequencing read data (like PacBio reads) for the Dazzler genome assembler

Idle361 year ago

Other

wtdbg2

Long-read Assembly

A fuzzy Bruijn graph approach to long noisy reads assembly

Stale5302 years ago

VerityMap

Mapping

VerityMap is a tool for mapping long reads to assemblies of extra-long tandem repeats, producing SAM files and identifying potential heterozygous sites and assembly errors through analysis of rare k-mers. It supports PacBio HiFi and ONT reads and generates interactive HTML plots for variant analysis.

Stale393 years ago