Find open-source science resources
A directory of tools, AI models, datasets, and research resources for biotech, bioinformatics, and other scientific fields. Aggregated from curated GitHub awesome-lists, HuggingFace, bio.tools, Bioconductor, and more.
Filters
Health
Domain
Language
License
Source
Type
5,940 resources indexed
Showing 4,051–4,100
A Python program to compute quasi-harmonic thermochemical data from Gaussian frequency calculations.
Calculate mass, elemental composition, and mass distribution spectrum of a molecule given by its chemical formula, relative element weights, or sequence.
A toolkit for visualizations in materials informatics.
A library for processing, analyzing and modeling spectroscopic data.
Library of descriptors to aid in the data-mining of materials properties, created by the Lawrence Berkeley National Laboratory.
Aims to provide useful high-level interfaces that make ML for materials science as easy as possible.
Library for fast calculations of **mo**lecula**r** **fe**at**u**re**s** from 3D structures for machine learning with a focus on steric descriptors.
Ensemble of automated machine learning protocols that can be run sequentially through a single command line. The program works for regression and classification problems.
Self-Referencing Embedded Strings (SELFIES): A 100% robust molecular string representation.
A benchmarking platform for molecular generation models.
Makes alchemical free energy calculations easier by leveraging the full power and flexibility of the PyData stack.
A tool and library for creating quantum chemistry input files.
A package to 'build' collections of materials properties from the output of computational materials calculations.
The "FRamewOrk for Molecular AGgregate Excitations" enables localised QM/QM' excited state calculations in a solid state environment.
High-Throughput Molecular Dynamics: Programming Environment for Molecular Discovery.
Open source PEM (Proton Exchange Membrane) fuel cell simulation tool.
Python Library for Automating Molecular Simulation: input preparation, job execution, file management, output processing and building data workflows.
Usage-Instructions) - A program to visualize reaction networks.
An open, extensible Python framework for GPU-accelerated alchemical free energy calculations.
Convert AMBER forcefields from ANTECHAMBER to GROMACS format.
A Chemical Knowledge Graph and Toolkit, writting in IUPAC/SMILES/SMARTS, for common small molecules from diverse communities to aid users in selecting compounds for forcefield parametirization.
A benchmark for ML-guided high-throughput materials discovery.
A [Jupyter](https://jupyter.org/) widget to interactively view molecular structures and trajectories.
High-throughput PubChem client for batch queries with caching, validation, rate-limit-aware retries, and a simple CLI.
Scientific Computing for Chemists with Python is a Jupyter book teaching basic python in chemistry skills, including relevant libraries, and applies them to solving chemical problems.
Parallel computing with task scheduling.
k-mer counting, filtering, and graph traversal.
Neuro-imaging file formats.
Behavioral and neuroimaging experiments.
Spike detection and clustering-based spike sorting.
by Sebastian Raschka.
Robert Johansson.
Umbrella package is available for the entire Automated Affymetrix Array Analysis suite of package.
Base utility functions are available for the Automated Affymetrix Array Analysis set of packages.
Functionalities for classification of Affymetrix microarray data, integrating within the Automated Affymetrix Array Analysis set of packages.
Utility functions for the Automated Affymetrix Array Analysis set of packages.
Utility functions to pre-process data for the Automated Affymetrix Array Analysis set of packages.
Utility functions to facilitate the reporting of the Automated Affymetrix Array Analysis Reporting set of packages.
Automated pipline to perform gene expression analysis for Applied Biosystems Genome Survey Microarray (AB1700) data format. Functions include data preprocessing, filtering, control probe analysis, statistical analysis in one single function. A GUI interface is also provided. The raw data, processed data, graphics output and statistical results are organized into folders according to the analysis settings used.
Inferring differential expression genes by absolute counts difference between two groups, utilizing Negative binomial distribution and moderating fold-change according to heterogeneity of dispersion across expression level.
This package provides a multivariate inferential analysis method for detecting differentially expressed genes in gene expression data. It uses artificial components, close to the data's principal components but with an exact interpretation in terms of differential genetic expression, to identify differentially expressed genes while controlling the false discovery rate (FDR). The methods on this package are described in the vignette or in the article 'Multivariate Method for Inferential Identification of Differentially Expressed Genes in Gene Expression Experiments' by J. P. Acosta, L. Lopez-Kleine and S. Restrepo (2015, pending publication).
Functions for reading aCGH data from image analysis output files and clone information files, creation of aCGH S3 objects for storing these data. Basic methods for accessing/replacing, subsetting, printing and plotting aCGH objects.
ADAM is a GSEA R package created to group a set of genes from comparative samples (control versus experiment) belonging to different species according to their respective functions (Gene Ontology and KEGG pathways as default) and show their significance by calculating p-values referring togene diversity and activity. Each group of genes is called GFAG (Group of Functionally Associated Genes).
ADAMgui is a Graphical User Interface for the ADAM package. The ADAMgui package provides 2 shiny-based applications that allows the user to study the output of the ADAM package files through different plots. It's possible, for example, to choose a specific GFAG and observe the gene expression behavior with the plots created with the GFAGtargetUi function. Features such as differential expression and foldchange can be easily seen with aid of the plots made with GFAGpathUi function.
ADAPT carries out differential abundance analysis for microbiome metagenomics data in phyloseq format. It has two innovations. One is to treat zero counts as left censored and use Tobit models for log count ratios. The other is an innovative way to find non-differentially abundant taxa as reference, then use the reference taxa to find the differentially abundant ones.
Processes MS2 data to identify potentially adducted peptides from spectra that has been corrected for mass drift and retention time drift and quantifies MS1 level mass spectral peaks.
Single-cell RNA sequencing (scRNA-seq) methods are typically unable to quantify the expression levels of all genes in a cell, creating a need for the computational prediction of missing values (‘dropout imputation’). Most existing dropout imputation methods are limited in the sense that they exclusively use the scRNA-seq dataset at hand and do not exploit external gene-gene relationship information. Here we propose two novel methods: a gene regulatory network-based approach using gene-gene relationships learnt from external data and a baseline approach corresponding to a sample-wide average. ADImpute can implement these novel methods and also combine them with existing imputation methods (currently supported: DrImpute, SAVER). ADImpute can learn the best performing method per gene and combine the results from different methods into an ensemble.
This package implements clustering of microarray gene expression profiles according to functional annotations. For each term genes are annotated to, splits into two subclasses are computed and a significance of the supporting gene set is determined.