Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://metagenomics.iiserb.ac.in/mp3/
Software tool for prediction of pathogenic proteins in genomic and metagenomic data. Used for identification of partial pathogenic proteins predicted from short (100-150 bp) metagenomic reads and also performs on complete protein sequences., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: MP3 tool (RRID:SCR_019282) Copy
http://www.sanger.ac.uk/science/tools/ssaha2-0
A program designed for the efficient mapping of sequence reads onto genomic references. The software is capable of reading most sequencing platforms and giving a range of outputs are supported.
Proper citation: Sequence Search and Alignment by Hashing Algorithm (RRID:SCR_000544) Copy
http://alchemy.sourceforge.net/
ALCHEMY is a genotype calling algorithm for Affymetrix and Illumina products which is not based on clustering methods. Features include explicit handling of reduced heterozygosity due to inbreeding and accurate results with small sample sizes. ALCHEMY is a method for automated calling of diploid genotypes from raw intensity data produced by various high-throughput multiplexed SNP genotyping methods. It has been developed for and tested on Affymetrix GeneChip Arrays, Illumina GoldenGate, and Illumina Infinium based assays. Primary motivations for ALCHEMY''s development was the lack of available genotype calling methods which can perform well in the absence of heterozygous samples (due to panels of inbred lines being genotyped) or provide accurate calls with small sample batches. ALCHEMY differs from other genotype calling methods in that genotype inference is based on a parametric Bayesian model of the raw intensity data rather than a generalized clustering approach and the model incorporates population genetic principles such as Hardy-Weinberg equilibrium adjusted for inbreeding levels. ALCHEMY can simultaneously estimate individual sample inbreeding coefficients from the data and use them to improve statistical inference of diploid genotypes at individual SNPs. The main documentation for ALCHEMY is maintained on the sourceforge-hosted MediaWiki system. Features * Population genetic model based SNP genotype calling * Simultaneous estimation of per-sample inbreeding coefficients, allele frequencies, and genotypes * Bayesian model provides posterior probabilities of genotype correctness as quality measures * Growing number of scripts and supporting programs for validation of genotypes against control data and output reformating needs * Multithreaded program for parallel execution on multi-CPU/core systems * Non-clustering based methods can handle small sample sets for empirical optimization of sample preparation techniques and accurate calling of SNPs missing genotype classes ALCHEMY is written in C and developed on the GNU/Linux platform. It should compile on any current GNU/Linux distribution with the development packages for the GNU Scientific Library (gsl) and other development packages for standard system libraries. It may also compile and run on Mac OS X if gsl is installed.
Proper citation: ALCHEMY (RRID:SCR_005761) Copy
http://bamview.sourceforge.net/
A free interactive display of read alignments in BAM data files that can be launched with Java Web Start or downloaded. This interactive Java application for visualizing the large amounts of data stored for sequence reads which are aligned against a reference genome sequence can be used in a number of contexts including SNP calling and structural annotation. It has been integrated into Artemis so that the reads can be viewed in the context of the nucleotide sequence and genomic features. The source code is available as part of the Artemis code which can be downloaded from GitHub.
Proper citation: BamView (RRID:SCR_004207) Copy
http://noble.gs.washington.edu/proj/genomedata/
A format for efficient storage of multiple tracks of numeric data anchored to a genome. The format allows fast random access to hundreds of gigabytes of data, while retaining a small disk space footprint. They have also developed utilities to load data into this format. Retrieving data from this format is more than 2900 times faster than a naive approach using wiggle files. A reference implementation in Python and C components is available here under the GNU General Public License. The software has only been tested on Linux and Mac systems.
Proper citation: Genomedata (RRID:SCR_004544) Copy
http://www.bioinf.uni-leipzig.de/Software/RNAplex/
Software tool to rapidly search for short interactions between two long RNAs.
Proper citation: RNAplex (RRID:SCR_002763) Copy
http://web.cmb.usc.edu/people/alber/Software/tomominer/
Software platform for large-scale cryo electron subtomogram classification, alignment, and averaging.
Proper citation: TomoMiner (RRID:SCR_015045) Copy
https://www.hgmd.cf.ac.uk/ac/introduction.php?lang=english
Curated database of known (published) gene lesions responsible for human inherited disease.
Proper citation: Human Gene Mutation Database (RRID:SCR_001621) Copy
A system providing resolvable persistent Uniform Resource Identifiers (URIs) used to identify data for the scientific community, with a current focus on the Life Sciences domain. The provision of resolvable identifiers (URLs) fits well with the Semantic Web vision, and the Linked Data initiative. It provides direct access to the identified data using one chosen physical location (or resource). If more than one physical locations providing the data are recorded in the Registry, then you can access them via the top banner or by using a profile.
Proper citation: Identifiers.org (RRID:SCR_003735) Copy
http://cran.r-project.org/web/packages/circlize/
Software package that implements and enhances circular visualization in R. Due to natural born feature of R to draw statistical graphics, this package can provide more general and flexible way to visualize huge information in circular style.
Proper citation: circlize (RRID:SCR_002141) Copy
A suite of software tools for analyzing and manipulating next-generation sequencing datasets, such as FASTQ, BED and BAM format files. These tools provide a stable and modular platform for data management and analysis.
Proper citation: NGSUtils (RRID:SCR_001236) Copy
Software library and suite of command line tools for working with DNA sequence that takes a k-mer-centric approach to sequence analysis. It is primarily aimed at short-read sequencing data such as that produced by the Illumina platform.
Proper citation: khmer (RRID:SCR_001156) Copy
http://bioconductor.org/packages/release/bioc/html/nondetects.html
Software R package to model and impute non-detects in results of qPCR experiments.Used to directly model non-detects as missing data.
Proper citation: nondetects (RRID:SCR_001702) Copy
http://clip.med.yale.edu/presto/
Software toolkit for processing raw reads from high-throughput sequencing of lymphocyte repertoires.
Proper citation: pRESTO (RRID:SCR_001782) Copy
http://www.yandell-lab.org/software/maker.html
Software genome annotation pipeline. Portable and easily configurable genome annotation pipeline. Used to allow smaller eukaryotic and prokaryotic genomeprojects to independently annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to genome, produces ab-initio gene predictions and automatically synthesizes these data into gene annotations having evidence based quality values.
Proper citation: MAKER (RRID:SCR_005309) Copy
A multiplatform open-source software to assist molecular biologists without much expertise in bioinformatics to manage, analyze and visualize their data. UGENE integrates widely used bioinformatics tools within a common user interface. The toolkit supports multiple biological data formats and allows the retrieval of data from remote data sources. It provides visualization modules for biological objects such as annotated genome sequences, Next Generation Sequencing (NGS) assembly data, multiple sequence alignments, phylogenetic trees and 3D structures. Most of the integrated algorithms are tuned for maximum performance by the usage of multithreading and special processor instructions. UGENE includes a visual environment for creating reusable workflows that can be launched on local resources or in a High Performance Computing (HPC) environment. UGENE is written in C++ using the Qt framework. The built-in plugin system and structured UGENE API make it possible to extend the toolkit with new functionality.
Proper citation: Unipro UGENE (RRID:SCR_005579) Copy
https://github.com/Gregor-Mendel-Institute/poolhap
Software tool for inferring haplotypes from pooled sequencing. Enables to infer strain numbers and haplotype frequencies in silico from sequences of pooled samples.
Proper citation: PoolHap (RRID:SCR_012129) Copy
https://imdevsoftware.wordpress.com/imdev/
A software application of RExcel that integrates R into Excel as an embedded additon for omics tasks and analysis. It can be used specifically for tasks concerning multivariate data visualization, exploration, and analysis. imDev has interactive modules for dimensional reduction, prediction, feature selection, analysis of correlation, and generation of networked structures, all of which provide an integrated environment for systems level analysis of multivariate data.
Proper citation: imDEV (RRID:SCR_014674) Copy
A package of over twenty mass spectrometry-based tools primarily geared toward proteomic data analysis and database mining. It can be run from the command line, but is primarily used through a web browser, and there is a public website that allows anyone to use the software without local installation. Tandem mass spectrometry analysis tools are used for database searching and identification of peptides, including post-translationally modified peptides and cross-linked peptides. Support for isotope and label-free quantification from this type of data is provided. MS-Viewer software allows sharing and displaying of annotated spectra from many different tandem mass spectrometry data analysis packages. Other tools include software for analyzing peptide mass fingerprinting data (MS-Fit); prediction of theoretical fragmentation of peptides (MS-Product); theoretical chemical or enzymatic digestion of proteins (MS-Digest); and theoretical modeling of the isotope distribution of any chemical, including peptides (MS-Isotope). Searches using amino acid sequence can be used to identify homologous peptides in a database (MS-Pattern); the use of the combination of amino acid sequence and masses can be used for homologous peptide and protein identification using MS-Homology. Tandem mass spectrometry peak list files can be filtered for the presence of certain peaks or neutral losses using MS-Filter. Given a list of proteins, MS-Bridge can report all potential cross-linked peptide combinations of a specified mass. Given a precursor peptide mass and information about known amino acid presence, absence, or modifications, MS-Comp can report all amino acid combinations that could lead to the observed mass.
Proper citation: Protein Prospector (RRID:SCR_014558) Copy
https://github.com/smajidian/phaseme
Software tool set to assess quality of per read phasing information and help to reduce errors during this process.
Proper citation: PhaseME (RRID:SCR_018739) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.