Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A database of human, chimpanzee, mouse, and rat proteases and protease inhibitors, as well as as the growing number of hereditary diseases caused by mutations in protease genes. Analysis of the human and mouse genomes has allowed us to annotate 581 human, 580 chimpanzee, 667 mouse, and 655 rat protease genes. Proteases are classified in five different classes according to their mechanism of catalysis. Proteases are a diverse and important group of enzymes representing >2% of the human, chimpanzee, mouse and rat genomes. This group of enzymes is implicated in numerous physiological processes. The importance of proteases is illustrated by the existence of 99 different hereditary diseases due to mutations in protease genes. Furthermore, proteases have been implicated in multiple human pathologies, including vascular diseases, rheumatoid arthritis, neurodegenerative processes, and cancer. During the last ten years, our laboratory has identified and characterized more than 60 human protease genes. Due to the importance of proteolytic enzymes in human physiology and pathology, we have recently introduced the concept of Degradome, as the complete repertoire of proteases expressed by a tissue or organism. Thanks to the recent completion of the human, chimpanzee, mouse, and rat genome sequencing projects, we were able to analyze and compare for the first time the complete protease repertoire in those mammalian organisms, as well as the complement of protease inhibitor genes. This webpage also contains the Supplementary Material of Human and mouse proteases: a comparative genomic approach Nat Rev Genet (2003) 4: 544-558, Genome sequence of the brown Norway rat yields insights into mammalian evolution Nature (2004) 428: 493-521, A genomic analysis of rat proteases and protease inhibitors Genome Res. (2004) 14: 609-622, and Comparative genomic analysis of human and chimpanzee proteases Genomics (2005) 86: 638-647.
Proper citation: Mammalian Degradome Database (RRID:SCR_007624) Copy
Datasets and tools for comparative analysis and annotation of all publicly available genomes from three domains of life in a uniquely integrated context. Plasmids that are not part of a specific microbial genome sequencing project and phage genomes are also included in order to increase its genomic context for comparative analysis. The user interface (see User Interface Map) allows navigating the microbial genome data space along its three key dimensions (genes, genomes, and functions), and groups together the main comparative analysis tools. Microbial genome data analysis in IMG usually starts with the definition of an analysis context in terms of selected genomes, functional annotations, and/or genes, followed by the individual or comparative analysis of genomes, functional annotations, or genes.
Proper citation: IMG (RRID:SCR_007733) Copy
https://leger2.helmholtz-hzi.de/cgi-bin/expLeger.pl
Knowledge database and visualization tool for comparative genomics of pathogenic and non-pathogenic Listeria species.Provides information on gene functions (as annotated or supposed by literature from homologous organisms) , protein expression levels under defined experimental conditions ,subcellular localization of proteins (expected and/or experimentally validated) , biological meaning of genes and proteins based on KEGG, InterPro and Gene Ontology.
Proper citation: LEGER: the post-genome Database for Listeria Research (RRID:SCR_007760) Copy
http://phylomedb.bioinfo.cipf.es
Database for phylomes, that is, complete collections of phylogenetic trees for all proteins encoded in a given genome. It aims at providing a repository of high-quality phylogenies and alignments for proteins encoded in model species. To derive a phylome, each protein encoded in a given genome is used as a seed to retrieve its homologs in other complete genomes. These sequences are aligned and processed to derive reliable phylogenies using several phylogenetic methods. Besides providing the evolutionary history of the gene families, phylomeDB includes phylogeny based predictions of orthology and paralogy relationships., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PhylomeDB (RRID:SCR_007850) Copy
http://papilio.ab.a.u-tokyo.ac.jp/genome/index.html
Silkbase''s objective is to build a foundation for the complete genome analysis of Bombyx mori.
Proper citation: Silkworm Genome Database (RRID:SCR_008242) Copy
http://cmbi.bjmu.edu.cn/cmbidata/cgf/CGF_Database/cytokine.medic.kumamoto-u.ac.jp/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 26, 2016. A collection of cDNA, gene and protein records of cytokines deposited in public databases provides various information about the cytokine members of vertebrates in other databases including NCBI GenBank, Swiss-Prot, UniGene, TIGR (The Institute for Genomic Research) Gene Indices, Ensembl, Entrez Gene, Mouse Genome Informatics (MGI) and Rat Genome Database (RGD). It also provides orthologous relationship of cytokine members and includes novel members identified in the databases.
Proper citation: Cytokine Family Database (RRID:SCR_008134) Copy
http://animal.dna.affrc.go.jp/agp/index.html
Database of comparative gene mapping between species to assist the mapping of the genes related to phenotypic traits in livestock. The linkage maps, cytogenetic maps, polymerase chain reaction primers of pig, cattle, mouse and human, and their references have been included in the database, and the correspondence among species have been stipulated in the database. AGP is an animal genome database developed on a Unix workstation and maintained by a relational database management system. It is a joint project of National Institute of Agrobiological Sciences (NIAS) and Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries (STAFF-Institute), under cooperation with other related research institutes. AGP also contains the Pig Expression Data Explorer (PEDE), a database of porcine EST collections derived from full-length cDNA libraries and full-length sequences of the cDNA clones picked from the EST collection. The EST sequences have been clustered and assembled, and their similarity to sequences in RefSeq, and UniGene determined. The PEDE database system was constructed to store sequences and similarity data of swine full-length cDNA libraries and to make them available to users. It provides interfaces for keyword and ID searches of BLAST results and enables users to obtain sequence data and names of clones of interest. Putative SNPs in EST assemblies have been classified according to breed specificity and their effect on coding amino acids, and the assemblies are equipped with an SNP search interface. The database contains porcine nucleotide sequences and cDNA clones that are ready for analyses such as expression in mammalian cells, because of their high likelihood of containing full-length CDS. PEDE will be useful for researchers who want to explore genes that may be responsible for traits such as disease susceptibility. The database also offers information regarding major and minor porcine-specific antigens, which might be investigated in regard to the use of pigs as models in various medical research applications.
Proper citation: Animal Genome Database (RRID:SCR_008165) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 20,2019.The COG-database has become a powerful tool in the field of comparative genomics. The construction of this data-base is based on sequence homologies of proteins from different completely sequenced genomes. Highly homologous proteins are assigned to clusters of orthologous groups. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Here is a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.
Proper citation: Phylogenetic Clusters of Orthologous Groups Ranking (RRID:SCR_008223) Copy
http://www.nisc.nih.gov/projects/comp_seq.html
Generates data for use in developing and refining computational tools for comparing genomic sequence from multiple species. The NISC Comparative Sequencing Program's goal is to establish a data resource consisting of sequences for the same set of targeted genomic regions derived from multiple animal species. The broader program includes plans for a diverse set of analytical studies using the generated sequence and the publication of a series of papers describing the results of those analysis in peer-reviewed journals in a timely fashion. Experimentally, this project involves the shotgun sequencing of mapped BAC clones. For each BAC, an assembly is first performed when a sufficient number of sequence reads have been generated to provide full shotgun coverage of the clone. At that time, the assembled sequence is submitted to the HTGS division of GenBank. Subsequent refinements of the sequence, including the generation of higher-accuracy finished sequence, results in the updating of the sequence record in GenBank. By immediately submitting our BAC-derived sequences to GenBank, it makes their data available as a public service to allow colleagues to speed up their research, consistent with the now well-established routine of sequencing centers participating in the Human Genome Project. However, at the same time, it has made considerable investment in acquiring these mapping and sequence data, including sizable efforts of graduate students, postdoctoral fellows, and other trainees. Furthermore, in most cases, large data sets involving multiple BAC sequences from multiple species must first be generated, often taking many months to accumulate, before the planned analysis can be performed and the resulting papers written and submitted for publication.
Proper citation: Comparative Vertebrate Sequencing (RRID:SCR_008213) Copy
The aim of the PEROXISOME database (PeroxisomeDB) is to gather, organize and integrate curated information on peroxisomal genes, their encoded proteins, their molecular function and metabolic pathway they belong to, and their related disorders. PeroxisomeDB contains the complete peroxisomal proteome of Homo sapiens (encoded by 85 genes) and Saccharomyces cerevisiae (encoded by 61 genes). Now, we have included 34 new organism genomes with the acquisition of 2426 new peroxisomal homolog proteins. PeroxisomeDB 2.0 integrates the peroxisomal metabolome of whole microbody family by the new incorporation of the glycosome proteomes of trypanosomatids and the glyoxysome proteome of Arabidopsis thaliana. The site also provides a Peroxisome Metabolome of peroxisomal genes and proteins, their molecular interactions and metabolic pathways, tools for comparative genomics, predictive tools. Sponsors: Preoxisome Database is funded by Institut de Gntique et deBiologie Molculaire et Cellulaire.
Proper citation: Peroxisome Database (RRID:SCR_008352) Copy
http://cssb.biology.gatech.edu/skolnick/files/gpcr/gpcr.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 19,2019.Database of tertiary structural modeling results of threading assembly refinement (TASSER) method for all 907 G protein-coupled receptors (GPCRs) in human genome. All sequences were collected from GPCR database http://www.gpcr.org/7tm/ and http://www.expasy.org/cgi-bin/lists?7tmrlist.txt. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. G protein-coupled receptors (GPCRs), encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER) method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global C(alpha) root-mean-squared deviation from native of 4.6 angstroms, with a root-mean-squared deviation in the transmembrane helix region of 2.1 angstroms. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness and robustness of the in silico models for GPCR functional analysis. Sponsors: GPCR is funded by the University at Buffalo, Buffalo, New York.
Proper citation: Structure modeling of 907 G protein coupled receptors in the human genome (RRID:SCR_008351) Copy
http://genome.jgi.doe.gov/programs/metagenomes/index.jsf
Portal providing access to metagenomics projects, data and tools supported by the DOE Joint Genome Institute (JGI). A primary motivation for metagenomics is that most microbes found in nature exist in complex, interdependent communities and cannot readily be grown in isolation in the laboratory. One can, however, isolate DNA or RNA from the community as a whole, and studies of such communities have revealed a diversity of microbes far beyond those found in culture collections. It is suspected that these uncultivated organisms must harbor considerable as-yet undiscovered genomic, functional, and metabolic features and capabilities. Thus to fully explore microbial genomics, it is imperative that we access the genomes of these elusive players.
Proper citation: Metagenomics Program at JGI (RRID:SCR_008804) Copy
http://fungi.ensembl.org/index.html
The Ensembl Genomes project produces genome databases for important species from across the taxonomic range, using the Ensembl software system. Five sites are now available, one of which is Ensembl Fungi, which houses fungal species. Sponsors: EnsembFungi is a project run by EMBL - EBI to maintain annotation on selected genomes, based on the software developed in the Ensembl project developed jointly by the EBI and the Wellcome Trust Sanger Institute.
Proper citation: Ensembl Fungi (RRID:SCR_008681) Copy
http://plants.ensembl.org/index.html
Ensembl Genomes project produces genome databases for important species from across the taxonomic range, using the Ensembl software system. Five sites are now available, one of which is Ensembl Plants, which houses plant species. Sponsors: EnsembPlants is a project run by EMBL - EBI to maintain annotation on selected genomes, based on the software developed in the Ensembl project developed jointly by the EBI and the Wellcome Trust Sanger Institute.
Proper citation: Ensembl Plants (RRID:SCR_008680) Copy
http://bacteria.ensembl.org/index.html
The Ensembl Genomes project produces genome databases for important species from across the taxonomic range, using the Ensembl software system. Five sites are now available, one of which is Ensembl Bacteria, which houses bacterial species. All bacterial collections in Ensembl Bacteria have been updated with the latest data from ENA and UniProtKB. New genomes have been added to Escherichia/Shigella (3 additional genomes) and Staphylococcus (3 additional genomes). The mapping of array probes has been expanded to all genomes in the Escherichia/Shigella and Staphylococcus collections. Ensembl Bacteria also now features improved interfaces for selecting regions of circular molecules a new visualisation allowing the large scale comparison of multiple genomes. In multi-synteny view, users can select multiple genomes and observe the syntenic relationships between them. Sponsors: EnsembBacteria is a project run by EMBL - EBI to maintain annotation on selected genomes, based on the software developed in the Ensembl project developed jointly by the EBI and the Wellcome Trust Sanger Institute.
Proper citation: Ensembl Bacteria (RRID:SCR_008679) Copy
https://bbgre.brc.iop.kcl.ac.uk
A database and associated tools for investigating the genetic basis of neurodisability. It combines phenotype information from patients with neurodevelopmental and behavioral problems with clinical genetic data, and displays this information on the human genome map. Basic access to genetic information (deletions, duplications) relating to participants with neurodevelopmental disorders is provided without an account; access to the full dataset requires an account. The genetic information that is available to view comprises potentially pathogenic copy number variation across the genome, detected by array comparative genome hybridization (aCGH) using a customized 44K oligonucleotide array.
Proper citation: Brain and Body Genetic Resource Exchange (RRID:SCR_008959) Copy
http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000674.v1.p1
Human genetics data from an immense (78,000) and ethnically diverse population available for secondary analysis to qualified researchers through the database of Genotypes and Phenotypes (dbGaP). It offers the opportunity to identify potential genetic risks and influences on a broad range of health conditions, particularly those related to aging. The GERA cohort is part of the Research Program on Genes, Environment, and Health (RPGEH), which includes more than 430,000 adult members of the Kaiser Permanente Northern California system. Data from this larger cohort include electronic medical records, behavioral and demographic information from surveys, and saliva samples from 200,000 participants obtained with informed consent for genomic and other analyses. The RPGEH database was made possible largely through early support from the Robert Wood Johnson Foundation to accelerate such health research. The genetic information in the GERA cohort translates into more than 55 billion bits of genetic data. Using newly developed techniques, the researchers conducted genome-wide scans to rapidly identify single nucleotide polymorphisms (SNPs) in the genomes of the people in the GERA cohort. These data will form the basis of genome-wide association studies (GWAS) that can look at hundreds of thousands to millions of SNPs at the same time. The RPGEH then combined the genetic data with information derived from Kaiser Permanente''s comprehensive longitudinal electronic medical records, as well as extensive survey data on participants'' health habits and backgrounds, providing researchers with an unparalleled research resource. As information is added to the Kaiser-UCSF database, the dbGaP database will also be updated.
Proper citation: Resource for Genetic Epidemiology Research on Adult Health and Aging (RRID:SCR_010472) Copy
http://www.genomicus.biologie.ens.fr/genomicus-72.01/cgi-bin/search.pl
A genome browser that enables users to navigate in genomes in several dimensions: linearly along chromosome axes, transversaly across different species, and chronologicaly along evolutionary time.
Proper citation: Genomicus (RRID:SCR_011791) Copy
http://www.informatics.jax.org/
Community model organism database for laboratory mouse and authoritative source for phenotype and functional annotations of mouse genes. MGD includes complete catalog of mouse genes and genome features with integrated access to genetic, genomic and phenotypic information, all serving to further the use of the mouse as a model system for studying human biology and disease. MGD is a major component of the Mouse Genome Informatics.Contains standardized descriptions of mouse phenotypes, associations between mouse models and human genetic diseases, extensive integration of DNA and protein sequence data, normalized representation of genome and genome variant information. Data are obtained and integrated via manual curation of the biomedical literature, direct contributions from individual investigators and downloads from major informatics resource centers. MGD collaborates with the bioinformatics community on the development and use of biomedical ontologies such as the Gene Ontology (GO) and the Mammalian Phenotype (MP) Ontology.
Proper citation: Mouse Genome Database (RRID:SCR_012953) Copy
THIS RESOURCE IS NO LONGER IN SERVICE.Documented on April 14,2022. Database of comprehensive information on the approximately 600 prokaryote species that are present in the human oral cavity. The majority of these species are uncultivated and unnamed, recognized primarily by their 16S rRNA sequences. The HOMD presents a provisional naming scheme for the currently unnamed species so that strain, clone, and probe data from any laboratory can be directly linked to a stably named reference entity. The HOMD links sequence data with phenotypic, phylogenetic, clinical, and bibliographic information. Full and partial oral bacterial genome sequences determined as part of this project and the Human Microbiome Project, are being added to the HOMD as they become available. HOMD offers easy to use tools for viewing all publicly available oral bacterial genomes. Data is also downloadable.
Proper citation: HOMD (RRID:SCR_012770) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.