gtca/alphagenome_pytorch

https://huggingface.co/gtca/alphagenome_pytorch
Activeby gtca438updated 3 months ago

A PyTorch port of AlphaGenome, the DNA sequence model from Google DeepMind that predicts hundreds of genomic tracks at single base-pair resolution from sequences up to 1M bp.

Sourced from

  • HuggingFacegtca/alphagenome_pytorch

Related resources

Deep learning-based variant caller

Active3.7K2 months ago
Python
BSD-3-Clause

Evo 2 is a state-of-the-art DNA language model trained autoregressively on trillions of DNA tokens.

Active1063 months ago
Stale02 years ago

This is the base model of GenomeOcean-4B. It is trained with Causal Language Modeling (CLM) and uses a BPE tokenizer with 4096 tokens. It supports a maximum sequence length of 10240 tokens (~50kbp).

Idle4.4K1 year ago

Evo 2 is a state of the art DNA language model for long context modeling and design. Evo 2 models DNA sequences at single-nucleotide resolution at up to 1 million base pair context length using the StripedHyena 2 architecture, using Savanna.

Idle08 months ago

GGUF quantizations of HuggingFaceBio/Carbon-3B — a generative DNA foundation model — for efficient inference with llama.cpp.

Active6201 week ago