Find open-source science resources
A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.
Filters
Health
Domain
Language
License(1)
Source(1)
Type
262 of 5,923 resources
Showing 51–100
terms approved for use by BODC to describe the measurement units for data held in its repositories.
Assigns identifiers to collections of datasets indexed by CELLxGENE. CELLxGENE is an interactive data visualization and exploration tool developed by the Chan Zuckerberg Initiative that enables researchers to analyze and share single-cell genomics datasets. It provides a user-friendly interface for biologists and computational scientists to interrogate gene expression patterns across different cell types.
Names of organisations providing metadata for CESSDA Data Catalogue.
This vocabulary holds the definitions and descriptions of the collections included in the CESSDA Data Catalogue.
Lists the types of persistent identifiers that CESSDA accepts as study level PIDs in its data catalogue.
Nomenclature Consortium around Chicken genes (analogous to the HGNC for humans)
The goal of the CODATA Research Data Management Terminology is to gather the key terms needed for a common understanding of the research data management domain. The RDMT was revised by the CODATA RDM Terminology Working Group, shared for public review, and then confirmed and finalised in 2023. The RDMT grew out of the CASRAI Research Data Management Glossary, which was intended as a practical reference for individuals and groups concerned with the improvement of research data management (RDM). In 2020, CASRAI requested that CODATA assume responsibility for the curation of this valued resource. To that end, the RDM Terminology Working Group uses a lightweight and pragmatic biennial process to review the resource now restructured as the CODATA RDM Terminology and suggest any edits, additions and removals that are required in order to develop and improve this important reference resource.
CRediT (Contributor Roles Taxonomy) is high-level taxonomy, including 14 roles, that can be used to represent the roles typically played by contributors to scientific scholarly output. The roles describe each contributor’s specific contribution to the scholarly output.
A thesaurus of terms useful for digital archaeology
DCAT is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web
Identifies the type of aggregation used to combine related categories, usually within a common branch of a hierarchy, to provide information at a broader level than the level at which detailed observations are taken. (From: The OECD Glossary of Statistical Terms)
Describes the entity being analyzed in the study or variable. This vocabulary can also be used to describe the unit of observation, which is the unit being observed, or from which data are collected. The unit of observation can be the same as, or different from the unit of analysis.
Standard set of characters upon which many character encodings are based (Wikipedia).
Describes the degree of similarity between two items or schemes (collections of items).
Identifies the type of data, which has a bearing on the acceptable data values, the operations that can be performed with the data, and the ways in which the data are stored.
Describes the physical format(s) of the data documented in the logical product(s) of a study unit.
Describes the level of proficiency of an individual in a natural language.
Specifies the event happening over the data life cycle that is considered significant enough to document.
The procedure, technique, or mode of inquiry used to attain the data.
Indicates the entity that provided the information carried by the variable.
Indicates the statistical software package used in the production/processing/dissemination of the data. Data collection software is not covered in this list.
Specifies the type of summary statistic. Summary statistics are a single number representation of the characteristics of a set of values.
Time zone specification as an offset from UTC (Coordinated Universal Time) in terms of hours and minutes.
Identifies the type of address entered as contact information for an individual or an organization.