Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
Publicly available database of the genes, proteins, experimentally-verified interactions and signaling pathways involved in the innate immune response of humans, mice and bovines to microbial infection. The database captures coverage of the innate immunity interactome by integrating known interactions and pathways from major public databases together with manually-curated data into a centralized resource. The database can be mined as a knowledgebase or used with the integrated bioinformatics and visualization tools for the systems level analysis of the innate immune response. Although InnateDB curation focuses on innate immunity-relevant interactions and pathways, it also incorporates detailed annotation on the entire human, mouse and bovine interactomes by integrating data (178,000+ interactions & 3,900+ pathways) from several of the major public interaction and pathway databases. InnateDB also has integrated human, mouse and bovine orthology predictions generated using Ortholgue software. Ortholgue uses a phylogenetic distance-based method to identify possible paralogs in high-throughput orthology predictions. Integrated human and mouse conserved gene order and synteny information has also been determined to provide further support for orthology predictions. InnateDB Capabilities: * View statistics for manually-curated innate immunity relevant molecular interactions. New manually curated interactions are submitted weekly. * Search for genes and proteins of interest. * Search for experimentally-verified molecular interactions by gene/protein name, interaction type, cell type, etc. * Search genes/interactions belonging to 3,900 pathways. * Visualize interactions using an intuitive subcellular localization-based layout in Cerebral. * Upload your own list of genes along with associated gene expression data (from up to 10 experimental conditions) to interactively analyze this data in a molecular interaction network context. Once you have uploaded your data, you will be able to interactively visualize interaction networks with expression data overlaid; carry out Pathway, Gene Ontology and Transcription Factor Binding Site over-representation analyses; construct orthologous interaction networks in other species; and much more. * Access curated interaction data via a dedicated PSICQUIC webservice.
Proper citation: InnateDB (RRID:SCR_006714) Copy
Curated protein-protein and genetic interaction repository of raw protein and genetic interactions from major model organism species, with data compiled through comprehensive curation efforts.
Proper citation: Biological General Repository for Interaction Datasets (BioGRID) (RRID:SCR_007393) Copy
http://diseases.jensenlab.org/
Database that integrates evidence on disease-gene associations from automatic text mining, manually curated literature, cancer mutation data, and genome-wide association studies. It also assigns confidence scores that facilitate comparison of the different types and sources of evidence.
Proper citation: DISEASES (RRID:SCR_015664) Copy
http://bioweb.ensam.inra.fr/esther
Database and tools for analysis of protein and nucleic acid sequences belonging to superfamily of alpha/beta hydrolases homologous to cholinesterases. Covers multiple species, including human, mouse caenorhabditis and drosophila., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: ESTHER (RRID:SCR_002621) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 14,2026. Integrated database of genomic, expression and protein data for Drosophila, Anopheles, C. elegans and other organisms. You can run flexible queries, export results and analyze lists of data. FlyMine presents data in categories, with each providing information on a particular type of data (for example Gene Expression or Protein Interactions). Template queries, as well as the QueryBuilder itself, allow you to perform searches that span data from more than one category. Advanced users can use a flexible query interface to construct their own data mining queries across the multiple integrated data sources, to modify existing template queries or to create your own template queries. Access our FlyMine data via our Application Programming Interface (API). We provide client libraries in the following languages: Perl, Python, Ruby and & Java API
Proper citation: FlyMine (RRID:SCR_002694) Copy
http://archive.ics.uci.edu/ml/datasets/EEG+Database
Data set from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at 256 Hz (3.9-msec epoch) for 1 second. There were two groups of subjects: alcoholic and control. Each subject was exposed to either a single stimulus (S1) or to two stimuli (S1 and S2) which were pictures of objects chosen from the 1980 Snodgrass and Vanderwart picture set. When two stimuli were shown, they were presented in either a matched condition where S1 was identical to S2 or in a non-matched condition where S1 differed from S2. There were 122 subjects and each subject completed 120 trials where different stimuli were shown. The electrode positions were located at standard sites (Standard Electrode Position Nomenclature, American Electroencephalographic Association 1990). Zhang et al. (1995) describes in detail the data collection process. There are three versions of the EEG data set. * The Small Data Set (smni97_eeg_data.tar.gz) contains data for the 2 subjects, alcoholic a_co2a0000364 and control c_co2c0000337. For each of the 3 matching paradigms, c_1 (one presentation only), c_m (match to previous presentation) and c_n (no-match to previous presentation), 10 runs are shown. * The Large Data Set (SMNI_CMI_TRAIN.tar.gz and SMNI_CMI_TEST.tar.gz) contains data for 10 alcoholic and 10 control subjects, with 10 runs per subject per paradigm. The test data used the same 10 alcoholic and 10 control subjects as with the training data, but with 10 out-of-sample runs per subject per paradigm. * The Full Data Set contains all 120 trials for 122 subjects. The entire set of data is about 700 MBytes.
Proper citation: EEG Database (RRID:SCR_001581) Copy
The EBI genomes pages give access to a large number of complete genomes including bacteria, archaea, viruses, phages, plasmids, viroids and eukaryotes. Methods using whole genome shotgun data are used to gain a large amount of genome coverage for an organism. WGS data for a growing number of organisms are being submitted to DDBJ/EMBL/GenBank. Genome entries have been listed in their appropriate category which may be browsed using the website navigation tool bar on the left. While organelles are all listed in a separate category, any from Eukaryota with chromosome entries are also listed in the Eukaryota page. Within each page, entries are grouped and sorted at the species level with links to the taxonomy page for that species separating each group. Within each species, entries whose source organism has been categorized further are grouped and numbered accordingly. Links are made to: * taxonomy * complete EMBL flatfile * CON files * lists of CON segments * Project * Proteomes pages * FASTA file of Proteins * list of Proteins
Proper citation: EBI Genomes (RRID:SCR_002426) Copy
http://bpg.utoledo.edu/~afedorov/lab/eid.html
Data sets of protein-coding intron-containing genes that contain gene information from humans, mice, rats, and other eukaryotes, as well as genes from species whose genomes have not been completely sequenced. This is a comprehensive and convenient dataset of sequences for computational biologists who study exon-intron gene structures and pre-mRNA splicing. The database is derived from GenBank release 112, and it contains protein-coding genes that harbor introns, along with extensive descriptions of each gene and its DNA and protein sequences, as well as splice motif information. They have created subdatabases of genes whose intron positions have been experimentally determined. The collection also contains data on untranslated regions of gene sequences and intron-less genes. For species with entirely sequenced genomes, species-specific databases have been generated. A novel Mammalian Orthologous Intron Database (MOID) has been introduced which includes the full set of introns that come from orthologous genes that have the same positions relative to the reading frames.
Proper citation: EID: Exon-Intron Database (RRID:SCR_002469) Copy
http://www.sci.unisannio.it/docenti/rampone/
Data set of Homo Sapiens Exons, Introns and Splice regions extracted from GenBank Rel.123 with an aim of giving standardized material to train and to assess the prediction accuracy of computational approaches for gene identification and characterization. From the complete GenBank (Primate Sequences Division) Rel.123 (162,557 entries), entries of Human Nuclear DNA including Complete CDS and more than one Exon have been selected, and 4523 exons and 3802 introns have been extracted from these entries. Details about extracted exons and introns are reported (Locus, number, Start and End position in the entry, sequence, length, G+C content, presence of not AGCT data (nucleotide scan check)). Statistics are also reported (overall nucleotides, average G+C content, nucleotide scan check results, number of not GT starting / AG ending introns, minimum / maximum / average length, length standard deviation). 3799+3799 donor and acceptor sites, as windows of 140 nucleotides around each splice site have been extracted. After discarding sequences not including canonical GTAG junctions (65+74), including insufficient data (not enough material for a 140 nucleotide window) (686+589), including not AGCT bases (29+30), and redundant (218+226) there are 2796+ 2880 windows. Finally, there are 271,937 + 332,296 windows of false splice sites, selected by searching canonical GTAG pairs in not splicing positions. The false sites in a range of +/- 60 from a true splice site are marked as proximal.
Proper citation: HS3D - Homo Sapiens Splice Sites Dataset (RRID:SCR_002939) Copy
Curated lists of genes associated to speech / language phenotypes and structural or functional abnormalities observed in patient populations. Entrez ID gene information, as well as gene expression profiles from the Allen Brain Atlas are available. You can also download expression data for a given gene in JSON or XML format.
Proper citation: Speech Language Disorders Database (RRID:SCR_003655) Copy
http://www.linked-neuron-data.org/
Neuroscience data and knowledge from multiple scales and multiple data sources that has been extracted, linked, and organized to support comprehensive understanding of the brain. The core is the CAS Brain Knowledge base, a very large scale brain knowledge base based on automatic knowledge extraction and integration from various data and knowledge sources. The LND platform provides services for neuron data and knowledge extraction, representation, integration, visualization, semantic search and reasoning over the linked neuron data. Currently, LND extracts and integrates semantic data and knowledge from the following resources: PubMed, INCF-CUMBO, Allen Reference Atlas, NIF, NeuroLex, MeSH, DBPedia/Wikipedia, etc.
Proper citation: Linked Neuron Data (RRID:SCR_003658) Copy
http://aws.amazon.com/1000genomes/
A dataset containing the full genomic sequence of 1,700 individuals, freely available for research use. The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalogue of human genetic variation. The project has grown to 200 terabytes of genomic data including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research free of charge. The dataset containing the full genomic sequence of 1,700 individuals is now available to all via Amazon S3. The data can be found at: http://s3.amazonaws.com/1000genomes The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year. Public Data Sets on AWS provide a centralized repository of public data hosted on Amazon Simple Storage Service (Amazon S3). The data can be seamlessly accessed from AWS services such Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce (Amazon EMR), which provide organizations with the highly scalable compute resources needed to take advantage of these large data collections. AWS is storing the public data sets at no charge to the community. Researchers pay only for the additional AWS resources they need for further processing or analysis of the data. All 200 TB of the latest 1000 Genomes Project data is available in a publicly available Amazon S3 bucket. You can access the data via simple HTTP requests, or take advantage of the AWS SDKs in languages such as Ruby, Java, Python, .NET and PHP. Researchers can use the Amazon EC2 utility computing service to dive into this data without the usual capital investment required to work with data at this scale. AWS also provides a number of orchestration and automation services to help teams make their research available to others to remix and reuse. Making the data available via a bucket in Amazon S3 also means that customers can crunch the information using Hadoop via Amazon Elastic MapReduce, and take advantage of the growing collection of tools for running bioinformatics job flows, such as CloudBurst and Crossbow.
Proper citation: 1000 Genomes Project and AWS (RRID:SCR_008801) Copy
http://www.sgn.cornell.edu/bulk/input.pl?modeunigene
Allows users to download Unigene or BAC information using a list of identifiers or complete datasets with FTP., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Sol Genomics Network - Bulk download (RRID:SCR_007161) Copy
http://linux1.softberry.com/spldb/SpliceDB.html
Database of canonical and non-canonical mammalian splice sites. The information about verified splice site sequences for canonical and non-canonical sites is presented with the supporting evidence. Weight matrices were built for the major splice groups, which can be incorporated into gene prediction programs.
Proper citation: SpliceDB (RRID:SCR_006262) Copy
http://proline.bic.nus.edu.sg/dedb/
Database on Drosophila melanogaster exons presented in a splicing graph form. Data is based on release 3.2 of the Drosophila melanogaster genome annotations available at FlyBase. The gene structure information extracted from the annotations were checked, clustered and transformed into splicing graph. The splicing graph form of the gene constructs were then used for classification of the various types of alternative splicing events. In addition, Pfam domains were mapped onto the gene structure. Users can query the database using the query page using BLAST, FlyBase Gene Name, FlyBase Gene Symbol, Pfam Accession Number and Pfam Identifier. This allows users to determine the Drosophila melanogaster homology of their gene using a BLAST search and to visualize the alternative splicing variants if any. Users can also determine genes containing a particular domain using the Pfam Accession Numbers and Identifiers.
Proper citation: Drosophila melanogaster Exon Database (RRID:SCR_013441) Copy
http://rarge.psc.riken.jp/rartf/
Database of complete sets of Arabidopsis transcription factors with a variety of information on Arabidopsis thaliana transcription factor families including: full-length cDNA sequences, Ds-tagged mutants, multiple sequences alignments of family members, phylogenic trees, functional motifs, and so on. In addition, expression profiles of all transcription factor genes are available.
Proper citation: RARTF (RRID:SCR_013457) Copy
http://www.informatics.jax.org/genes.shtml
Searchable database of mouse genes, DNA segments, cytogenetic markers and QTLs. MGI provides access to integrated data on mouse genes and genome features, from sequences and genomic maps to gene expression and disease models.
Proper citation: Genes, Genome Features and Maps (RRID:SCR_017524) Copy
C. elegans RNAi feeding library distributed by Source BioScience Ltd. Designed for genome wide study of gene function in C. elegans through loss of function studies.
Proper citation: C. elegans RNAi Collection (Ahringer) (RRID:SCR_017064) Copy
http://mirwalk.umm.uni-heidelberg.de/
Software tool to store the predicted and the experimentally validated microRNA (miRNA)-target interaction pairs. Predictions within the complete sequence of genes of human, mouse, and rat genomes. Integrates a comparative platform of miRNA-binding sites resulting from ten different prediction datasets.
Proper citation: miRWalk (RRID:SCR_016509) Copy
https://www.alzforum.org/alzpedia
Collection of brief summaries of various genes and proteins implicated in pathophysiology of Alzheimer’s disease and other neurodegenerative disorders. It will be expanded over time and updated periodically in order to reflect current state of knowledge.
Proper citation: ALZPEDIA (RRID:SCR_017548) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.