Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
Collection of genome databases for vertebrates and other eukaryotic species with DNA and protein sequence search capabilities. Used to automatically annotate genome, integrate this annotation with other available biological data and make data publicly available via web. Ensembl tools include BLAST, BLAT, BioMart and the Variant Effect Predictor (VEP) for all supported species.
Proper citation: Ensembl (RRID:SCR_002344) Copy
http://droog.gs.washington.edu/polyphred/
Software program that compares fluorescence-based sequences across traces obtained from different individuals to identify heterozygous sites for single nucleotide substitutions. Its functions are integrated with the use of three other programs: Phred (Brent Ewing and Phil Green), Phrap (Phil Green), and Consed (David Gordon and Phil Green). PolyPhred identifies potential heterozygotes using the base calls and peak information provided by Phred and the sequence alignments provided by Phrap. Potential heterozygotes identified by PolyPhred are marked for rapid inspection using the Consed tool.
Proper citation: PolyPhred (RRID:SCR_002337) Copy
A service that provides low cost DNA sequencing. They utilize microfluidic technology.
Proper citation: Functional Biosciences (RRID:SCR_000943) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 23,2022. Open source viewer / browser software for the SAM / BAM format commonly used in the assembly tasks of Next Generation Sequencing data.
Proper citation: GenoViewer (RRID:SCR_001203) Copy
http://www.broadinstitute.org/science/programs/genome-biology/computational-rd/somaticcall-manual
Software program that finds single-base differences (substitutions) between sequence data from tumor and matched normal samples. It is designed to be highly stringent, so as to achieve a low false positive rate. It takes as input a BAM file for each sample, and produces as output a list of differences (somatic mutations). Note: This software package is no longer supported and information on this page is provided for archival purposes only.
Proper citation: SomaticCall (RRID:SCR_001196) Copy
http://www.bioinformatics.org/peakanalyzer/wiki/
A set of standalone software programs for the automated processing of any genomic loci, with an emphasis on datasets consisting of ChIP-derived signal peaks. The software is able to identify individual binding / modification sites from enrichment loci, retrieve peak region sequences for motif discovery, and integrate experimental data with different classes of annotated elements throughout the genome. PeakAnalyzer requires a peak file and a feature annotation file in BED or GTF format. Complete annotation files for the current builds of the human (HG19) and mouse (MM9) genomes are provided with the software distribution.
Proper citation: PeakAnalyzer (RRID:SCR_001194) Copy
A database of hierarchical classification of enzymes that relates specific sequence-structure features to specific chemical capabilities. The SFLD classifies evolutionarily related enzymes according to shared chemical functions and maps these shared functions to conserved active site features. The classification is hierarchical, where broader levels encompass more distantly related proteins with fewer shared features. It thus serves as the analysis and archive site for superfamilies targeted by the Enzyme Function Initiative, and is developed by the Babbitt Laboratory in collaboration with the UCSF Resource for Biocomputing, Visualization, and Informatics. The resource also provides a collection of tools and data for investigating sequence-structure-function relationships and hypothesizing function.
Proper citation: Structure-function linkage database (RRID:SCR_001375) Copy
http://ishtar.sourceforge.net/
A program for designing primer pairs that amplify multiple target sequences using DNA thermodynamics and one class support vector machines. Written in Python.
Proper citation: Ishtar (RRID:SCR_000538) Copy
http://sourceforge.net/projects/denovogear/
A software for detecting de novo mutations using sequencing data. It utilizes likelihood-based error modeling to reduce the false positive rate of mutative discovery in exome analysis. It also uses fragment information to identify the parental origin of germ-line mutations.
Proper citation: DeNovoGear (RRID:SCR_000670) Copy
http://compbio.soe.ucsc.edu/yeast_introns.html
Database of information about the spliceosomal introns of the yeast Saccharomyces cerevisiae. Listed are known spliceosomal introns in the yeast genome and the splice sites actually used are documented. Through the use of microarrays designed to monitor splicing, they are beginning to identify and analyze splice site context in terms of the nature and activities of the trans-acting factors that mediate splice site recognition. In version 3.0, expression data that relates to the efficiency of splicing relative to other processes in strains of yeast lacking nonessential splicing factors is included. These data are displayed on each intron page for browsing and can be downloaded for other types of analysis.
Proper citation: Yeast Intron Database (RRID:SCR_007144) Copy
http://net.icgeb.org/benchmark/
It was created in order to create standard datasets on which the performance of machine learning methods can be compared. The collection contains datasets of sequences and structures, each subdivided into positive/negative training/test sets. Such a subdivision is called a classification task. Typical tasks include the classification of structural domains in the SCOP and CATH databases based on their sequences, as fell as various functional and taxonomic classification tasks. Running a performance evaluation test on an entire database can include many different classification tasks. These ensembles of classification tasks are encoded in a simple matrix format - called the cast matrix or membership table - that specifies the role of each sequence (or structure) in the different calculations. Each column of this matrix is a subdivision of the objects (rows) into positive/negative training/test sets. Typically, a database record contains such an ensemble of classification tasks, encoded in a single cast matrix. In addition, there is a collection of distance matrices that contain an all vs. all comparison of the datasets using methods as BLAST, Smith-Waterman, 3D-comparisons etc. Evaluation of a method on a given database consists of calculating a performance measure such as a receiver operating curve (ROC) AUC value. Results of evaluation are deposited along with the data, each dataset is evaluated at least by one classification method, such as 1NN (nearest neighbour) or SVM (support vector machines), ANN (artificial neural networks), RF (random forests) etc.. There are small datasets meant for program developers, as well as downloadable programs for various classification algorithms.
Proper citation: Protein Classification Benchmark Collection (RRID:SCR_007561) Copy
http://cmckb.cellmigration.org
It is a database of keys facts about proteins, families, and complexes involved in cell migration. This ongoing project provides a large amount of automated and curated data, collected from numerous online resources that are updated monthly. These data include names, synonyms, sequence information, summaries, CMC research data, reagents, structures, as well as protein family and complex details. CMKB''s ultimate goal is to create a database that will enable the cell migration community to conveniently access significant information about molecules of interest. This will also serve as a stepping stone to pathway analysis and demonstrate how these molecules coordinate with one another during cell adhesion and movement. Sponsors: This resource is supported by the Cell Migration Consortium.
Proper citation: CMKB (RRID:SCR_007229) Copy
http://ecoliwiki.net/colipedia/index.php/T4-like_genome_database
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 22, 2016. A database of information on bacterial phages. It contains multiple phage genomes, which users can BLAST and MegaBLAST, and also hosts a Phage Forum in which users can discuss phage data. Interactive browsing of completed phage genomes is available using the program. The browser allows users to scan the genome for particular features and to download sequence information plus analyses of those features. Views of the genome are generated showing named genes BLAST similarities to other phages predicted tRNAs and other sequence features.
Proper citation: T4-like genome database (RRID:SCR_005367) Copy
http://genome.jgi.doe.gov/programs/plants/index.jsf
The goal of the DOE JGI Plant Genome Program is to shed light on the fundamental biology of photosynthesis and transduction of solar to chemical energy. Other areas of interest include characterizing: * Ecosystems and the role of terrestrial plants and oceanic phytoplankton-in carbon sequestration. * The role of plants in coping with toxic pollutants in soils by hyper-accumulation and detoxification. * Feedstocks for biofuels, e.g., biodiesel from soybean; cellulosic ethanol from perennial grasses. * The ability to respond to environmental change (e.g., loss of diversity from monoculture produces vulnerabilities; nitrogen fixing nodules in legumes reduce fertilizer need). * The generation of useful secondary metabolites (produced largely for disease resistance)- for positive/negative control in agriculture, with attendant influence on global carbon cycle. The Plant Genome Program accomplishes the above through the following activities: # Sequence. Produce genome sequences of key plant (and algal) species to accelerate biofuel development and understand response to climate change. # Function. Develop datasets (and synthetic biology tools) to elucidate functional elements in plant genomes, with special focus on handful of flagship genomes. # Variation. Characterize natural genomic variation in plants (and their associated microbiomes), and relate to biofuel sustainability and adaptation to climate change. # Integration. Provide a centralized hub for the retrieval and deep integrated analysis of plant genome datasets.
Proper citation: Plant Genome Resource at JGI (RRID:SCR_005315) Copy
Database of known and predicted protein interactions. The interactions include direct (physical) and indirect (functional) associations and are derived from four sources: Genomic Context, High-throughput experiments, (Conserved) Coexpression, and previous knowledge. STRING quantitatively integrates interaction data from these sources for a large number of organisms, and transfers information between these organisms where applicable. The database currently covers 5''214''234 proteins from 1133 organisms. (2013)
Proper citation: STRING (RRID:SCR_005223) Copy
http://bioinfo.iitk.ac.in/MIPModDB/
This is a database of comparative protein structure models of MIP (Major Intrinsic Protein) family of proteins. The nearly completed sets of MIPs have been identified from the completed genome sequence of organisms available at NCBI. The structural models of MIP proteins were created by defined protocol. The database aims to provide key information of MIPs in particular based on sequence as well as structures. This will further help to decipher the function of uncharacterized MIPs. For each MIP entry, this database contains information about the source, gene structure, sequence features, substitutions in the conserved NPA motifs, structural model, the residues forming the selectivity filter and channel radius profile. For selected set of MIPs, it is possible to derive structure-based sequence alignment and evolutionary relationship. Sequences and structures of selected MIPs can be downloaded from MIPModDB database.
Proper citation: MIPModDB (RRID:SCR_006058) Copy
http://operons.ibt.unam.mx/OperonPredictor/
The Prokaryotic Operon DataBase (ProOpDB) constitutes one of the most precise and complete repository of operon predictions in our days. Using our novel and highly accurate operon algorithm, we have predicted the operon structures of more than 1,200 prokaryotic genomes. ProOpDB offers diverse alternatives by which a set of operon predictions can be retrieved including: i) organism name, ii) metabolic pathways, as defined by the KEGG database, iii) gene orthology, as defined by the COG database, iv) conserved protein motifs, as defined by the Pfam database, v) reference gene, vi) reference operon, among others. In order to limit the operon output to non-redundant organisms, ProOpDB offers an efficient protocol to select the more representative organisms based on a precompiled phylogenetic distances matrix. In addition, the ProOpDB operon predictions are used directly as the input data of our Gene Context Tool (GeConT) to visualize their genomic context and retrieve the sequence of their corresponding 5�� regulatory regions, as well as the nucleotide or amino acid sequences of their genes. The prediction algorithm The algorithm is a multilayer perceptron neural network (MLP) classifier, that used as input the intergenic distances of contiguous genes and the functional relationship scores of the STRING database between the different groups of orthologous proteins, as defined in the COG database. Nevertheless, the operon prediction of our method is not restricted to only those genes with a COG assignation, since we successfully defined new groups of orthologous genes and obtained, by extrapolation, a set of equivalent STRING-like scores based on conserved gene pairs on different genomes. Since the STRING functional relationships scores are determined in an un-bias manner and efficiently integrates a large amount of information coming from different sources and kind of evidences, the prediction made by our MLP are considerably less influenced by the bias imposed in the training procedure using one specific organism.
Proper citation: ProOpDB (RRID:SCR_006111) Copy
http://prorepeat.bioinformatics.nl/
ProRepeat is an integrated curated repository and analysis platform for in-depth research on the biological characteristics of amino acid tandem repeats. ProRepeat collects repeats from all proteins included in the UniProt knowledgebase, together with 85 completely sequenced eukaryotic proteomes contained within the RefSeq collection. It contains non-redundant perfect tandem repeats, approximate tandem repeats and simple, low-complexity sequences, covering the majority of the amino acid tandem repeat patterns found in proteins. The ProRepeat web interface allows querying the repeat database using repeat characteristics like repeat unit and length, number of repetitions of the repeat unit and position of the repeat in the protein. Users can also search for repeats by the characteristics of repeat containing proteins, such as entry ID, protein description, sequence length, gene name and taxon. ProRepeat offers powerful analysis tools for finding biological interesting properties of repeats, such as the strong position bias of leucine repeats in the N-terminus of eukaryotic protein sequences, the differences of repeat abundance among proteomes, the functional classification of repeat containing proteins and GC content constrains of repeats' corresponding codons.
Proper citation: ProRepeat (RRID:SCR_006113) Copy
High quality ribosomal RNA databases providing comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya). Supplementary services include a rRNA gene aligner, online tools for probe and primer evaluation and optimized browsing, searching and downloading on the website. The extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches. Alignment tool, SINA, is available for download as well as available for use online.
Proper citation: SILVA (RRID:SCR_006423) Copy
ViralZone is a SIB Swiss Institute of Bioinformatics web-resource for all viral genus and families, providing general molecular and epidemiological information, along with virion and genome figures. Each virus or family page gives an easy access to UniProtKB/Swiss-Prot viral protein entries. ViralZone project is handled by the virus program of SwissProt group. Proteins popups were developed in collaboration with Prof. Christian von Mering and Andrea Franceschini, Bioinformatics Group , Institute of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland, funded in part by the SIB Swiss Institute of bioinformatics. All pictures in ViralZone are copyright of the SIB Swiss Institute of Bioinformatics.
Proper citation: ViralZone (RRID:SCR_006563) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.