Find open-source science resources

littleworth/protgpt2-distilled-small

by littleworth

A compact protein language model distilled from ProtGPT2 using complementary-regularizer distillation---a method that combines uncertainty-aware position weighting with calibration-aware label smoothing to achieve 54% better perplexity than standard knowledge distillation at 9.4x compression.

Active53 months ago

littleworth/protgpt2-distilled-tiny

by littleworth

A compact protein language model distilled from ProtGPT2 using complementary-regularizer distillation---a method that combines uncertainty-aware position weighting with calibration-aware label smoothing to achieve 87% better perplexity than standard knowledge distillation at 20x compression.

Active143 months ago

InstaDeepAI/NTv3_650M_pre

by InstaDeepAI

Active6.5K3 months ago

baichuan-inc/Baichuan-M3-235B-Q4_K_M-GGUF

by baichuan-inc

From Inquiry to Decision: Building Trustworthy Medical AI

Active204 months ago

EQUES/JPharmatron-7B

by EQUES

Active454 months ago

Raziel1234/OSTLM

by Raziel1234

translation

A Neural Machine Translation (NMT) model based on a custom Transformer (Encoder-Decoder) architecture, trained from scratch. This model is designed to translate English sentences into Hebrew using multilingual encoding and specialized layer configurations.

Active274 months ago

unsloth/medgemma-1.5-4b-it-GGUF

by unsloth

Unsloth Dynamic 2.0 achieves superior accuracy & outperforms other leading quants.

Active7.8K4 months ago

OpenMed/OpenMed-PII-SuperClinical-Small-44M-v1

by OpenMed

token-classification

PII Detection Model | 44M Parameters | Open Source

Active27K4 months ago

microsoft/MediPhi-Instruct

by microsoft

The MediPhi Model Collection comprises 7 small language models of 3.8B parameters from the base model Phi-3.5-mini-instruct specialized in the medical and clinical domains. The collection is designed in a modular fashion. Five MediPhi experts are fine-tuned on various medical corpora (i.e.

Active2K5 months ago

nvidia/geneformer_V2_316M

by nvidia

## Description: Geneformer is a foundational transformer model pretrained on a large-scale corpus of single-cell transcriptomes to enable context-specific predictions in settings with limited data in network biology.

Active325 months ago

nvidia/geneformer_V2_104M_CLcancer

by nvidia

## Description: Geneformer is a foundational transformer model pretrained on a large-scale corpus of single-cell transcriptomes to enable context-specific predictions in settings with limited data in network biology. This model version was continually pretrained on ~14 million cancer transcriptomes…

Active165 months ago

nvidia/geneformer_V2_104M

by nvidia

## Description: Geneformer is a foundational transformer model pretrained on a large-scale corpus of single-cell transcriptomes to enable context-specific predictions in settings with limited data in network biology.

Active315 months ago

nvidia/geneformer_V1_10M

by nvidia

## Description: Geneformer is a foundational transformer model pretrained on a large-scale corpus of single-cell transcriptomes to enable context-specific predictions in settings with limited data in network biology.

Active175 months ago

ZJU-AI4H/Hulu-Med-4B

by ZJU-AI4H

Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding

Idle19.9K6 months ago

microsoft/llava-med-v1.5-mistral-7b

by microsoft

Large Language and Vision Assistant for bioMedicine (i.e., “LLaVA-Med”) is a large language and vision model trained using a curriculum learning method for adapting LLaVA to the biomedical domain. It is an open-source release intended for research use only to facilitate reproducibility of the…

Idle21.4K6 months ago

gbyuvd/chemembed-chemselfies

by gbyuvd

sentence-similarity

ChemFIE-BED is a sentence-transformers based on gbyuvd/chemselfies-base-bertmlm fine-tuned on around (for now) 2 million pairs of valid molecules' SELFIES (Krenn et al. 2020) taken from COCONUTDB (Sorokina et al. 2021) and ChemBL34 (Zdrazil et al. 2023).

Idle1147 months ago

vandijklab/C2S-Scale-Gemma-2-27B

by vandijklab

GitHub homepage: Cell2Sentence GitHub

Idle9607 months ago

google/medgemma-4b-it

by google

Idle445.6K7 months ago

gbyuvd/chemselfies-base-bertmlm

by gbyuvd

This model is a lightweight model pre-trained on SELFIES (Self-Referencing Embedded Strings) representations of molecules. It is trained on 2.7M unique and valid molecules taken from COCONUTDB and ChemBL34, with 7.3M total generated masked examples.

Idle68 months ago

nvidia/AMPLIFY_350M

by nvidia

> [!NOTE] > This model has been optimized using NVIDIA's TransformerEngine > library. Slight numerical differences may be observed between the original model and the optimized > model. For instructions on how to install TransformerEngine, please refer to the > official documentation.

Idle348 months ago

nvidia/AMPLIFY_120M

by nvidia

> [!NOTE] > This model has been optimized using NVIDIA's TransformerEngine > library. Slight numerical differences may be observed between the original model and the optimized > model. For instructions on how to install TransformerEngine, please refer to the > official documentation.

Idle5838 months ago

lingshu-medical-mllm/Lingshu-7B

by lingshu-medical-mllm

Website    🤖 7B Model    🤖 32B Model    MedEvalKit    Technical Report    Lingshu MCP

Idle4.1K8 months ago

google/medgemma-27b-text-it

by google

Idle35.6K8 months ago

lastmass/Qwen3_Medical_GRPO

by lastmass

中文版说明

Idle779 months ago

S4nfs/Neeto-1.0-8b

by S4nfs

Neeto-1.0-8b is an openly released biomedical large language model (LLM) created by BYOL Academy to assist learners and practitioners with medical exam study, literature understanding, and structured clinical reasoning.

Idle7.7K9 months ago

Zaixi/RNAGenesis

by Zaixi

feature-extraction

Idle519 months ago

sagawa/ReactionT5v2-forward

by sagawa

This is a ReactionT5 pre-trained to predict the products of reactions. You can use the demo here.

Idle2K9 months ago

AdaptLLM/biomed-Qwen2.5-VL-3B-Instruct

by AdaptLLM

This repos contains the biomedicine MLLM developed from Qwen2.5-VL-3B-Instruct in our paper: On Domain-Adaptive Post-Training for Multimodal Large Language Models. The correspoding training dataset is in biomed-visual-instructions.

Idle1529 months ago

OpenMed/OpenMed-NER-ChemicalDetect-ElectraMed-33M

by OpenMed

token-classification

Specialized model for Chemical Entity Recognition - Identifies chemical compounds and substances in biomedical literature

Idle7110 months ago

darkknight25/deepseek-16b-medical-GPT

by darkknight25

darkknight25/deepseek-16b-medical-GPT is a fine-tuned version of deepseek-ai/deepseek-l6b-moe-chat, optimized for medical question answering, reasoning, and clinical summarization using QLoRA and open-access healthcare datasets.

Idle010 months ago

mradermacher/Qwen-3-32B-Medical-Reasoning-i1-GGUF

by mradermacher

For a convenient overview and download list, visit our model page for this model.

Idle3.6K11 months ago

mradermacher/Dans-PersonalityEngine-V1.3.0-24b-i1-GGUF

by mradermacher

For a convenient overview and download list, visit our model page for this model.

Idle42811 months ago

unsloth/medgemma-27b-it-GGUF

by unsloth

Unsloth Dynamic 2.0 achieves superior accuracy & outperforms other leading quants.

Idle7.7K11 months ago

google/medgemma-27b-it

by google

Idle494.9K11 months ago

zero-shot-image-classification

google/medsiglip-448

by google

Idle32.3K11 months ago

helical-ai/helix-mRNA

by helical-ai

feature-extraction

Idle9.8K11 months ago

ibm-research/materials.smi-ted

by ibm-research

feature-extraction

Welcome to IBM's series of large foundation models for sustainable materials. Our models span a variety of representations and modalities, including SMILES, SELFIES, 3D atom positions, 3D density grids, molecular graphs, and other formats.

Idle19011 months ago

zhihan1996/DNA_bert_3

by zhihan1996

Idle2.3K11 months ago

zhihan1996/DNA_bert_4

by zhihan1996

Idle73811 months ago

zhihan1996/DNA_bert_5

by zhihan1996

Idle72911 months ago

zhihan1996/DNA_bert_6

by zhihan1996

Idle6.2K11 months ago

andrewdalpino/ESM2-150M-Protein-Molecular-Function

by andrewdalpino

An Evolutionary-scale Model (ESM) for protein function prediction from amino acid sequences using the Gene Ontology (GO). Based on the ESM2 Transformer architecture, pre-trained on UniRef50, and fine-tuned on the AmiGO dataset, this model predicts the GO subgraph for a particular protein sequence -…

Idle2312 months ago

andrewdalpino/ESM2-150M-Protein-Cellular-Component

by andrewdalpino

An Evolutionary-scale Model (ESM) for protein function prediction from amino acid sequences using the Gene Ontology (GO). Based on the ESM2 Transformer architecture, pre-trained on UniRef50, and fine-tuned on the AmiGO dataset, this model predicts the GO subgraph for a particular protein sequence -…

Idle1212 months ago

andrewdalpino/ESM2-150M-Protein-Biological-Process

by andrewdalpino

An Evolutionary-scale Model (ESM) for protein function prediction from amino acid sequences using the Gene Ontology (GO). Based on the ESM2 Transformer architecture, pre-trained on UniRef50, and fine-tuned on the AmiGO dataset, this model predicts the GO subgraph for a particular protein sequence -…

Idle712 months ago

andrewdalpino/ESM2-35M-Protein-Molecular-Function

by andrewdalpino

An Evolutionary-scale Model (ESM) for protein function prediction from amino acid sequences using the Gene Ontology (GO). Based on the ESM2 Transformer architecture, pre-trained on UniRef50, and fine-tuned on the AmiGO dataset, this model predicts the GO subgraph for a particular protein sequence -…

Idle51 year ago