DOEJGI/GenomeOcean-4B
https://huggingface.co/DOEJGI/GenomeOcean-4BIdleby DOEJGI4.4K10updated 1 year ago
This is the base model of GenomeOcean-4B. It is trained with Causal Language Modeling (CLM) and uses a BPE tokenizer with 4096 tokens. It supports a maximum sequence length of 10240 tokens (~50kbp).
Sourced from
- HuggingFace — DOEJGI/GenomeOcean-4B
Related resources
InstaDeepAI/NTv3_650M_pre
by InstaDeepAIgemma4-12b-bioinfo is a fine-tuned Gemma 4 12B instruction model for bioinformatics, genomics, and computational biology question answering.
A PyTorch port of AlphaGenome, the DNA sequence model from Google DeepMind that predicts hundreds of genomic tracks at single base-pair resolution from sequences up to 1M bp.
Active433 months ago
zhihan1996/DNA_bert_6
by zhihan1996zhihan1996/DNA_bert_4
by zhihan1996vandijklab/C2S-Scale-Gemma-2-27B
by vandijklabGitHub homepage: Cell2Sentence GitHub