Find open-source science resources

A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.

29 of 5,923 resources

MITE (Minimum Information about a Tailoring Enzyme) is a data repository and associated data standard designed to capture the reaction- and substrate-specificities of tailoring enzymes. Community-driven and fully expert-reviewed, it represents enzymatic reactions using reaction SMARTS and links to established resources such as UniProt, NCBI GenPept, Rhea, and MIBiG. MITE serves as a knowledgebase for enzyme and pathway annotation, in silico biosynthesis, and machine learning applications.

Active35 days ago
Python
CC0-1.0

METPO (Microbial Ecophysiological Trait and Phenotype Ontology) provides standardized terms for describing microbial phenotypes, growth characteristics, and culture conditions. It includes classes for growth media, temperature tolerances, pH tolerances, and relationships like "grows in" and "does not grow in".

Active11 week ago
Python
CC-BY-4.0

SSSOM is a Simple Standard for Sharing Ontological Mappings, providing - a TSV-based representation for ontology term mappings - a comprehensive set of standard metadata elements to describe mappings and - a standard translation between the TSV and the Web Ontology Language (OWL). Most metadata elements, such as "sssom:mapping_justification" are defined in the sssom namespace.

Active2012 weeks ago
Python
BSD-3-Clause

A data model for managing information about chemical entities, ranging from atoms through molecules to complex mixtures.

Active233 weeks ago
Python
CC0-1.0
Active3753 weeks ago
Python
Active633 weeks ago
Python

The Chromosome Ontology is an automatically derived ontology of chromosomes and chromosome parts.

Active163 weeks ago
Python
BSD-3-Clause

The Common Core Ontologies (CCO) comprise twelve ontologies that are designed to represent and integrate taxonomies of generic classes and relations across all domains of interest. CCO is a mid-level extension of Basic Formal Ontology (BFO), an upper-level ontology framework widely used to structure and integrate ontologies in the biomedical domain (Arp, et al., 2015). BFO aims to represent the most generic categories of entity and the most generic types of relations that hold between them, by defining a small number of classes and relations. CCO then extends from BFO in the sense that every class in CCO is asserted to be a subclass of some class in BFO, and that CCO adopts the generic relations defined in BFO (e.g., has_part) (Smith and Grenon, 2004). Accordingly, CCO classes and relations are heavily constrained by the BFO framework, from which it inherits much of its basic semantic relationships.

Active3313 weeks ago
Python
CC-BY-4.0

The Simplified Upper Level Ontology (SULO) is ontology with a minimal set of classes and relations to guide the development of a personal health knowledge graph. [from homepage]

Active163 weeks ago
Python
MIT

The EVORAO Ontology provides a structured and harmonized vocabulary for describing shareable pathogens as characterized biological materials, along with their derived products and associated services, organized into collections. Developed within the EVORA project, it supports consistent metadata annotation across research infrastructures, promoting findability, accessibility, interoperability, and reusability (FAIR). By aligning with relevant standards and ontologies, EVORAO facilitates cross-domain collaboration, integration, and sharing of pathogenic resources and services to enhance pandemic preparedness and response. While initially focused on virology, EVORAO is designed to be extensible and also supports metadata harmonization for other pathogens. [from repository]

Active01 month ago
Python
CC0-1.0
Active51 month ago
Python

The submission-centric metadata schema for the German Human Genome-Phenome Archive (GHGA).

Active161 month ago
Python
Apache-2.0

An extension of Schema.org to annotate metadata on software projects

Active3481 month ago
Python
Apache-2.0

The Context and Measurement Ontology (COMO) contains ontological terms to describe the context for various types of experimental data and measurements. It is useful in its current state for several different environmental microbiology projects. This ontology is used in multiple CORAL (Contextual Ontology-based Repository Analysis Library) deployments.

Active82 months ago
Python
AGPL-3.0

An EMMO-based domain ontology for atomistic and electronic modelling.

Active12 months ago
Python
CC-BY-4.0

The Graphic Descriptor Ontology (GDO) is intended for use in describing graphics that represent the form of objects. It uses the language of visual communication, illustration, and technical drawing. The GDO is rooted in the Basic Formal Ontology (BFO) and uses several classes from the Information Entity Ontology of the Common Core Ontologies as a mid-level ontology. [from https://gdo.endlessforms.info/about]

Active02 months ago
Python
CC-BY-4.0

This ontology integrates cell type markers for cells in the Cell Ontology from various sources along with details of marker context (anatomical context, assay), confidence (where available) and provenance. [from repository]

Active13 months ago
Python

The Bibframe vocabulary consists of RDF classes and properties used for the description of items cataloged principally by libraries, but may also be used to describe items cataloged by museums and archives. Classes include the three core classes - Work, Instance, and Item - in addition to many more classes to support description. Properties describe characteristics of the resource being described as well as relationships among resources. For example: one Work might be a "translation of" another Work; an Instance may be an "instance of" a particular Bibframe Work. Other properties describe attributes of Works and Instances. For example: the Bibframe property "subject" expresses an important attribute of a Work (what the Work is about), and the property "extent" (e.g. number of pages) expresses an attribute of an Instance.

Idle546 months ago
Python
CC0-1.0

Assigns identifiers to knowledge graphs (KGs) that are used and/or maintained within any NFDI consortium.

Idle09 months ago
Python
CC0-1.0
Idle611 months ago
Python

An ontology of qualifications, distinctions, and certifications that uses the Phenotype And Trait Ontology term quality (PATO:0000001) as a root term.

Idle111 months ago
Python
MIT

MIBiG (Minimum Information about a Biosynthetic Gene Cluster) is a data repository and associated data standard designed to describe biosynthetic gene clusters involved in the production of specialized metabolites. It also stores data on measured biological activities and links to other resources such as NCBI, NPAtlas, and ChEBI. MIBiG is used as a reference database, knowledgebase, and training dataset for machine learning.

Stale102 years ago
Python

The Science Data Discovery Ontology (sddo) is being developed to provide a semantic foundation for the discovery of information managed by NASA's Science Mission Directorate. This information spans many scientific disciplines, fields and subfields, including heliophysics, earth science, planetary science, astrophysics, biology, astrobiology, and physical science. [from repository]

Stale23 years ago
Python

Algorithm Metadata Vocabulary is a vocabulary for capturing and storing the metadata about the algorithms (a procedure or a set of rules that is followed step-by-step to solve a problem, especially by a computer). There are uncountable algorithms present in every area (e.g., Computer Science, Mathematics), which makes it hard for specialists, academicians, application engineers, and so forth to discover, distinguish, select, and reuse them. [from repository]

Stale03 years ago
Python
CC0-1.0

An ontology transcription of definitions in the Functional Mock-up Interface (FMI) standard document from https://fmi-standard.org/ that enables representing Functional Mock-up Units (FMUs) in RDF

Stale24 years ago
Python

The Reagent Ontology (ReO) adheres to OBO Foundry principles (obofoundry.org) to model the domain of biomedical research reagents, considered broadly to include materials applied “chemically” in scientific techniques to facilitate generation of data and research materials. ReO is a modular ontology that re-uses existing ontologies to facilitate cross-domain interoperability. It consists of reagents and their properties, linking diverse biological and experimental entities to which they are related. ReO supports community use cases by providing a flexible, extensible, and deeply integrated framework that can be adapted and extended with more specific modeling to meet application needs.

Stale06 years ago
Python
NOASSERTION
Stale68 years ago
Python

Selventa legacy chemical namespace used with the Biological Expression Language

Archived08 years ago
Python
Apache-2.0