RRID | Searching in Community Resources

InnateDB

RRID:SCR_006714

http://www.innatedb.com

Publicly available database of the genes, proteins, experimentally-verified interactions and signaling pathways involved in the innate immune response of humans, mice and bovines to microbial infection. The database captures coverage of the innate immunity interactome by integrating known interactions and pathways from major public databases together with manually-curated data into a centralized resource. The database can be mined as a knowledgebase or used with the integrated bioinformatics and visualization tools for the systems level analysis of the innate immune response. Although InnateDB curation focuses on innate immunity-relevant interactions and pathways, it also incorporates detailed annotation on the entire human, mouse and bovine interactomes by integrating data (178,000+ interactions & 3,900+ pathways) from several of the major public interaction and pathway databases. InnateDB also has integrated human, mouse and bovine orthology predictions generated using Ortholgue software. Ortholgue uses a phylogenetic distance-based method to identify possible paralogs in high-throughput orthology predictions. Integrated human and mouse conserved gene order and synteny information has also been determined to provide further support for orthology predictions. InnateDB Capabilities: * View statistics for manually-curated innate immunity relevant molecular interactions. New manually curated interactions are submitted weekly. * Search for genes and proteins of interest. * Search for experimentally-verified molecular interactions by gene/protein name, interaction type, cell type, etc. * Search genes/interactions belonging to 3,900 pathways. * Visualize interactions using an intuitive subcellular localization-based layout in Cerebral. * Upload your own list of genes along with associated gene expression data (from up to 10 experimental conditions) to interactively analyze this data in a molecular interaction network context. Once you have uploaded your data, you will be able to interactively visualize interaction networks with expression data overlaid; carry out Pathway, Gene Ontology and Transcription Factor Binding Site over-representation analyses; construct orthologous interaction networks in other species; and much more. * Access curated interaction data via a dedicated PSICQUIC webservice.

Proper citation: InnateDB (RRID:SCR_006714) Copy

Source: SciCrunch Registry

Biological General Repository for Interaction Datasets (BioGRID)

RRID:SCR_007393

http://www.thebiogrid.org/

Curated protein-protein and genetic interaction repository of raw protein and genetic interactions from major model organism species, with data compiled through comprehensive curation efforts.

Proper citation: Biological General Repository for Interaction Datasets (BioGRID) (RRID:SCR_007393) Copy

Source: SciCrunch Registry

DISEASES

RRID:SCR_015664

http://diseases.jensenlab.org/

Database that integrates evidence on disease-gene associations from automatic text mining, manually curated literature, cancer mutation data, and genome-wide association studies. It also assigns confidence scores that facilitate comparison of the different types and sources of evidence.

Proper citation: DISEASES (RRID:SCR_015664) Copy

Source: SciCrunch Registry

ESTHER

RRID:SCR_002621

http://bioweb.ensam.inra.fr/esther

Database and tools for analysis of protein and nucleic acid sequences belonging to superfamily of alpha/beta hydrolases homologous to cholinesterases. Covers multiple species, including human, mouse caenorhabditis and drosophila., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.

Proper citation: ESTHER (RRID:SCR_002621) Copy

Source: SciCrunch Registry

FlyMine

RRID:SCR_002694

http://www.flymine.org/

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 14,2026. Integrated database of genomic, expression and protein data for Drosophila, Anopheles, C. elegans and other organisms. You can run flexible queries, export results and analyze lists of data. FlyMine presents data in categories, with each providing information on a particular type of data (for example Gene Expression or Protein Interactions). Template queries, as well as the QueryBuilder itself, allow you to perform searches that span data from more than one category. Advanced users can use a flexible query interface to construct their own data mining queries across the multiple integrated data sources, to modify existing template queries or to create your own template queries. Access our FlyMine data via our Application Programming Interface (API). We provide client libraries in the following languages: Perl, Python, Ruby and & Java API

Proper citation: FlyMine (RRID:SCR_002694) Copy

Source: SciCrunch Registry

EEG Database

RRID:SCR_001581

http://archive.ics.uci.edu/ml/datasets/EEG+Database

Data set from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at 256 Hz (3.9-msec epoch) for 1 second. There were two groups of subjects: alcoholic and control. Each subject was exposed to either a single stimulus (S1) or to two stimuli (S1 and S2) which were pictures of objects chosen from the 1980 Snodgrass and Vanderwart picture set. When two stimuli were shown, they were presented in either a matched condition where S1 was identical to S2 or in a non-matched condition where S1 differed from S2. There were 122 subjects and each subject completed 120 trials where different stimuli were shown. The electrode positions were located at standard sites (Standard Electrode Position Nomenclature, American Electroencephalographic Association 1990). Zhang et al. (1995) describes in detail the data collection process. There are three versions of the EEG data set. * The Small Data Set (smni97_eeg_data.tar.gz) contains data for the 2 subjects, alcoholic a_co2a0000364 and control c_co2c0000337. For each of the 3 matching paradigms, c_1 (one presentation only), c_m (match to previous presentation) and c_n (no-match to previous presentation), 10 runs are shown. * The Large Data Set (SMNI_CMI_TRAIN.tar.gz and SMNI_CMI_TEST.tar.gz) contains data for 10 alcoholic and 10 control subjects, with 10 runs per subject per paradigm. The test data used the same 10 alcoholic and 10 control subjects as with the training data, but with 10 out-of-sample runs per subject per paradigm. * The Full Data Set contains all 120 trials for 122 subjects. The entire set of data is about 700 MBytes.

Proper citation: EEG Database (RRID:SCR_001581) Copy

Source: SciCrunch Registry

EBI Genomes

RRID:SCR_002426

http://www.ebi.ac.uk/genomes

The EBI genomes pages give access to a large number of complete genomes including bacteria, archaea, viruses, phages, plasmids, viroids and eukaryotes. Methods using whole genome shotgun data are used to gain a large amount of genome coverage for an organism. WGS data for a growing number of organisms are being submitted to DDBJ/EMBL/GenBank. Genome entries have been listed in their appropriate category which may be browsed using the website navigation tool bar on the left. While organelles are all listed in a separate category, any from Eukaryota with chromosome entries are also listed in the Eukaryota page. Within each page, entries are grouped and sorted at the species level with links to the taxonomy page for that species separating each group. Within each species, entries whose source organism has been categorized further are grouped and numbered accordingly. Links are made to: * taxonomy * complete EMBL flatfile * CON files * lists of CON segments * Project * Proteomes pages * FASTA file of Proteins * list of Proteins

Proper citation: EBI Genomes (RRID:SCR_002426) Copy

Source: SciCrunch Registry

EID: Exon-Intron Database

RRID:SCR_002469

http://bpg.utoledo.edu/~afedorov/lab/eid.html

Data sets of protein-coding intron-containing genes that contain gene information from humans, mice, rats, and other eukaryotes, as well as genes from species whose genomes have not been completely sequenced. This is a comprehensive and convenient dataset of sequences for computational biologists who study exon-intron gene structures and pre-mRNA splicing. The database is derived from GenBank release 112, and it contains protein-coding genes that harbor introns, along with extensive descriptions of each gene and its DNA and protein sequences, as well as splice motif information. They have created subdatabases of genes whose intron positions have been experimentally determined. The collection also contains data on untranslated regions of gene sequences and intron-less genes. For species with entirely sequenced genomes, species-specific databases have been generated. A novel Mammalian Orthologous Intron Database (MOID) has been introduced which includes the full set of introns that come from orthologous genes that have the same positions relative to the reading frames.

Proper citation: EID: Exon-Intron Database (RRID:SCR_002469) Copy

Source: SciCrunch Registry

HS3D - Homo Sapiens Splice Sites Dataset

RRID:SCR_002939

http://www.sci.unisannio.it/docenti/rampone/

Data set of Homo Sapiens Exons, Introns and Splice regions extracted from GenBank Rel.123 with an aim of giving standardized material to train and to assess the prediction accuracy of computational approaches for gene identification and characterization. From the complete GenBank (Primate Sequences Division) Rel.123 (162,557 entries), entries of Human Nuclear DNA including Complete CDS and more than one Exon have been selected, and 4523 exons and 3802 introns have been extracted from these entries. Details about extracted exons and introns are reported (Locus, number, Start and End position in the entry, sequence, length, G+C content, presence of not AGCT data (nucleotide scan check)). Statistics are also reported (overall nucleotides, average G+C content, nucleotide scan check results, number of not GT starting / AG ending introns, minimum / maximum / average length, length standard deviation). 3799+3799 donor and acceptor sites, as windows of 140 nucleotides around each splice site have been extracted. After discarding sequences not including canonical GTAG junctions (65+74), including insufficient data (not enough material for a 140 nucleotide window) (686+589), including not AGCT bases (29+30), and redundant (218+226) there are 2796+ 2880 windows. Finally, there are 271,937 + 332,296 windows of false splice sites, selected by searching canonical GTAG pairs in not splicing positions. The false sites in a range of +/- 60 from a true splice site are marked as proximal.

Proper citation: HS3D - Homo Sapiens Splice Sites Dataset (RRID:SCR_002939) Copy

Source: SciCrunch Registry

Speech Language Disorders Database

RRID:SCR_003655

http://qnl.bu.edu/SLDB

Curated lists of genes associated to speech / language phenotypes and structural or functional abnormalities observed in patient populations. Entrez ID gene information, as well as gene expression profiles from the Allen Brain Atlas are available. You can also download expression data for a given gene in JSON or XML format.

Proper citation: Speech Language Disorders Database (RRID:SCR_003655) Copy

Source: SciCrunch Registry

Linked Neuron Data

RRID:SCR_003658

http://www.linked-neuron-data.org/

Neuroscience data and knowledge from multiple scales and multiple data sources that has been extracted, linked, and organized to support comprehensive understanding of the brain. The core is the CAS Brain Knowledge base, a very large scale brain knowledge base based on automatic knowledge extraction and integration from various data and knowledge sources. The LND platform provides services for neuron data and knowledge extraction, representation, integration, visualization, semantic search and reasoning over the linked neuron data. Currently, LND extracts and integrates semantic data and knowledge from the following resources: PubMed, INCF-CUMBO, Allen Reference Atlas, NIF, NeuroLex, MeSH, DBPedia/Wikipedia, etc.

Proper citation: Linked Neuron Data (RRID:SCR_003658) Copy

Source: SciCrunch Registry

1000 Genomes Project and AWS

RRID:SCR_008801

http://aws.amazon.com/1000genomes/

A dataset containing the full genomic sequence of 1,700 individuals, freely available for research use. The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalogue of human genetic variation. The project has grown to 200 terabytes of genomic data including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research free of charge. The dataset containing the full genomic sequence of 1,700 individuals is now available to all via Amazon S3. The data can be found at: http://s3.amazonaws.com/1000genomes The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year. Public Data Sets on AWS provide a centralized repository of public data hosted on Amazon Simple Storage Service (Amazon S3). The data can be seamlessly accessed from AWS services such Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce (Amazon EMR), which provide organizations with the highly scalable compute resources needed to take advantage of these large data collections. AWS is storing the public data sets at no charge to the community. Researchers pay only for the additional AWS resources they need for further processing or analysis of the data. All 200 TB of the latest 1000 Genomes Project data is available in a publicly available Amazon S3 bucket. You can access the data via simple HTTP requests, or take advantage of the AWS SDKs in languages such as Ruby, Java, Python, .NET and PHP. Researchers can use the Amazon EC2 utility computing service to dive into this data without the usual capital investment required to work with data at this scale. AWS also provides a number of orchestration and automation services to help teams make their research available to others to remix and reuse. Making the data available via a bucket in Amazon S3 also means that customers can crunch the information using Hadoop via Amazon Elastic MapReduce, and take advantage of the growing collection of tools for running bioinformatics job flows, such as CloudBurst and Crossbow.

Proper citation: 1000 Genomes Project and AWS (RRID:SCR_008801) Copy

Source: SciCrunch Registry

Sol Genomics Network - Bulk download

RRID:SCR_007161

http://www.sgn.cornell.edu/bulk/input.pl?modeunigene

Allows users to download Unigene or BAC information using a list of identifiers or complete datasets with FTP., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.

Proper citation: Sol Genomics Network - Bulk download (RRID:SCR_007161) Copy

Source: SciCrunch Registry

SpliceDB

RRID:SCR_006262

http://linux1.softberry.com/spldb/SpliceDB.html

Database of canonical and non-canonical mammalian splice sites. The information about verified splice site sequences for canonical and non-canonical sites is presented with the supporting evidence. Weight matrices were built for the major splice groups, which can be incorporated into gene prediction programs.

Proper citation: SpliceDB (RRID:SCR_006262) Copy

Source: SciCrunch Registry

Drosophila melanogaster Exon Database

RRID:SCR_013441

http://proline.bic.nus.edu.sg/dedb/

Database on Drosophila melanogaster exons presented in a splicing graph form. Data is based on release 3.2 of the Drosophila melanogaster genome annotations available at FlyBase. The gene structure information extracted from the annotations were checked, clustered and transformed into splicing graph. The splicing graph form of the gene constructs were then used for classification of the various types of alternative splicing events. In addition, Pfam domains were mapped onto the gene structure. Users can query the database using the query page using BLAST, FlyBase Gene Name, FlyBase Gene Symbol, Pfam Accession Number and Pfam Identifier. This allows users to determine the Drosophila melanogaster homology of their gene using a BLAST search and to visualize the alternative splicing variants if any. Users can also determine genes containing a particular domain using the Pfam Accession Numbers and Identifiers.

Proper citation: Drosophila melanogaster Exon Database (RRID:SCR_013441) Copy

Source: SciCrunch Registry

RARTF

RRID:SCR_013457

http://rarge.psc.riken.jp/rartf/

Database of complete sets of Arabidopsis transcription factors with a variety of information on Arabidopsis thaliana transcription factor families including: full-length cDNA sequences, Ds-tagged mutants, multiple sequences alignments of family members, phylogenic trees, functional motifs, and so on. In addition, expression profiles of all transcription factor genes are available.

Proper citation: RARTF (RRID:SCR_013457) Copy

Source: SciCrunch Registry

Genes, Genome Features and Maps

RRID:SCR_017524

http://www.informatics.jax.org/genes.shtml

Searchable database of mouse genes, DNA segments, cytogenetic markers and QTLs. MGI provides access to integrated data on mouse genes and genome features, from sequences and genomic maps to gene expression and disease models.

Proper citation: Genes, Genome Features and Maps (RRID:SCR_017524) Copy

Source: SciCrunch Registry

C. elegans RNAi Collection (Ahringer)

RRID:SCR_017064

https://www.sourcebioscience.com/products/life-sciences-research/clones/rnai-resources/c-elegans-rnai-collection-ahringer/

C. elegans RNAi feeding library distributed by Source BioScience Ltd. Designed for genome wide study of gene function in C. elegans through loss of function studies.

Proper citation: C. elegans RNAi Collection (Ahringer) (RRID:SCR_017064) Copy

Source: SciCrunch Registry

miRWalk

RRID:SCR_016509

http://mirwalk.umm.uni-heidelberg.de/

Software tool to store the predicted and the experimentally validated microRNA (miRNA)-target interaction pairs. Predictions within the complete sequence of genes of human, mouse, and rat genomes. Integrates a comparative platform of miRNA-binding sites resulting from ten different prediction datasets.

Proper citation: miRWalk (RRID:SCR_016509) Copy

Source: SciCrunch Registry

ALZPEDIA

RRID:SCR_017548

https://www.alzforum.org/alzpedia

Collection of brief summaries of various genes and proteins implicated in pathophysiology of Alzheimer’s disease and other neurodegenerative disorders. It will be expanded over time and updated periodically in order to reflect current state of knowledge.

Proper citation: ALZPEDIA (RRID:SCR_017548) Copy

Source: SciCrunch Registry

Searching the RRID Resource Information Network

Our searching services are busy right now. Please try again later

Log in

Leaving Community

About

Community Resources

More Resources

Literature

Log in

Tools Select Another Resource Report Type

Options

Current Facets and Filters

Facets

Recent searches

RRID:SCR_006714

RRID:SCR_007393

RRID:SCR_015664

RRID:SCR_002621

RRID:SCR_002694

RRID:SCR_001581

RRID:SCR_002426

RRID:SCR_002469

RRID:SCR_002939

RRID:SCR_003655

RRID:SCR_003658

RRID:SCR_008801

RRID:SCR_007161

RRID:SCR_006262

RRID:SCR_013441

RRID:SCR_013457

RRID:SCR_017524

RRID:SCR_017064

RRID:SCR_016509

RRID:SCR_017548

RRID Portal Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Sources

Categories

Subcategories

Further Questions

Category Graph

About

Recent News Entries

Contact Us

SciCrunch