Find open-source science resources
A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.
Filters
Health
Domain
Language
License
Source(1)
Type
288 of 5,893 resources
Showing 251–288
# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-MUV-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…
mradermacher/Palmyra-Med-70B-GGUF
by mradermacherIf you are unsure how to use GGUF files, refer to one of TheBloke's READMEs for more details, including on how to concatenate multi-part files.
ChemFIE-SA is a BERT-like sequence classifier for predicting synthesis accessibility given a SELFIES string of a compound, fine-tuned from gbyuvd/chemselfies-base-bertmlm on DeepSA's expanded dataset from Wang et al. 2023.
This model is a BERT-like sequence classifier for 221 human protein drug targets, fine-tuned from gbyuvd/chemselfies-base-bertmlm on a dataset derived ChemBL34 (Zdrazil et al. 2023). It predicts potential drug targets using chemical structures represented as SELFIES (Self-Referencing Embedded…
sagawa/ReactionT5v1-forward
by sagawaThis is a ReactionT5 pre-trained to predict the products of reactions.
minwoosun/uce-100m
by minwoosunUniversal Cell Embeddings (UCE) is a foundation model designed for single-cell RNA sequencing data analysis. UCE generates a universal representation of cells that captures the molecular diversity across different cell types, tissues, and species.
johnsnowlabs/JSL-MedLlama-3-8B-v2.0
by johnsnowlabs# JSL-MedLlama-3-8B-v2.0
!image/png
This model is a fine-tuned version of DeBERTa on the PubMED Dataset.
knowledgator/SMILES2IUPAC-canonical-base
by knowledgatorSMILES2IUPAC-canonical-base was designed to accurately translate SMILES chemical names to IUPAC standards.
songlab/tokenizer-dna-mlm
by songlabrootstrap-org/Alzheimer-Classifier-Demo
by rootstrap-org### Model Description A machine learning model for waste classification
TachyHealth/Thealth_Mixtral-8x7B
by TachyHealthAmelieSchreiber/esm_interact
by AmelieSchreiberThis model was finetuned on concatenated pairs of interacting proteins in much the same way as PepMLM. It is meant to generate interaction partners for proteins using the masked language modeling capabilities of ESM-2. The model is not well tested, so use with caution.
Rostlab/ProstT5
by RostlabProstT5 is a protein language model (pLM) which can translate between protein sequence and structure. !ProstT5 pre-training and inference
Galahad3x/QAModelForPatho
by Galahad3xQuestion Answering Model for the PathoTHREAT Project
Intae/mymodel
by IntaeThis is a Japanese RoBERTa base model pre-trained on academic articles in medical sciences collected by Japan Science and Technology Agency (JST).
I present a demo showcasing retinal vessel segmentation using the U-Net model, which is a well-known and widely used model in medical image segmentation. The model was trained on the DRIVE dataset, and the training process was conducted on Google Colab.
datasets: - UMLS
In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.
Dr-BERT/DrBERT-4GB-CP-CamemBERT
by Dr-BERTIn recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.
Dr-BERT/DrBERT-4GB
by Dr-BERTIn recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.
Dr-BERT/DrBERT-7GB
by Dr-BERTIn recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.
tinnerofkors/kors
by tinnerofkorsopenskies2009/PeternewModelhere
by openskies2009K8778/universe
by K8778SamKenX-Hub-Community/SamKenXAI-engine-compiting
by SamKenX-Hub-CommunityThis modelcard aims to be a base template for new models. It has been generated using this raw template.
csimonmeunier/test-model
by csimonmeuniercosmobaby/ka
by cosmobabySevenlee/kkk
by Sevenlee# ChemGPT 1.2B ChemGPT is based on the GPT-Neo model and was introduced in the paper Neural Scaling of Deep Chemical Models.
# ChemGPT 19M ChemGPT is based on the GPT-Neo model and was introduced in the paper Neural Scaling of Deep Chemical Models.
# ChemGPT 4.7M ChemGPT is based on the GPT-Neo model and was introduced in the paper Neural Scaling of Deep Chemical Models.
Deep learning for chemistry and materials science remains a novel field with lots of potiential. However, the popularity of transfer learning based methods in areas such as NLP and computer vision have not yet been effectively developed in computational chemistry + machine learning.