Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.broadinstitute.org/annotation/tetraodon/
This database have been funded by the National Human Genome Research Institute (NHGRI) to produce shotgun sequence of the Tetraodon nigriviridis genome. The strategy involves Whole Genome Shotgun (WGS) sequencing, in which sequence from the entire genome is generated. Whole genome shotgun libraries were prepared from Tetraodon genomic DNA obtained from the laboratory of Jean Weissenbach at Genoscope. Additional sequence data of approximately 2.5X coverage of Tetraodon has also been generated by Genoscope in plasmid and BAC end reads. Broad and Genoscope intend to pool their data and generate whole genome assemblies. Tetraodon nigroviridis is a freshwater pufferfish of the order Tetraodontiformes and lives in the rivers and estuaries of Indonesia, Malaysia and India. This species is 20-30 million years distant from Fugu rubripes, a marine pufferfish from the same family. The gene repertoire of T. nigroviridis is very similar to that of other vertebrates. However, its relatively small genome of 385 Mb is eight times more compact than that of human, mostly because intergenic and intronic sequences are reduced in size compared to other vertebrate genomes. These genome characteristics along with the large evolutionary distance between bony fish and mammals make Tetraodon a compact vertebrate reference genome - a powerful tool for comparative genetics and for quick and reliable identification of human genes.
Proper citation: Tetraodon nigroviridis Database (RRID:SCR_007123) Copy
Alternative splicing essentially increases the diversity of the transcriptome and has important implications for physiology, development and the genesis of diseases. This resource uses a different approach to investigate alternative splicing (instead of the conventional case-by case fashion) and integrates all transcripts derived from a gene into a single splicing graph. ASG is a database of splicing graphs for human genes, using transcript information from various major sources (Ensembl, RefSeq, STACK, TIGR and UniGene). Each transcript corresponds to a path in the graph, and alternative splicing is displayed by bifurcations. This representation preserves the relationships between different splicing variants and allows us to investigate systematically all possible putative transcripts. Web interface allows users to display the splicing graphs, to interactively assemble transcripts and to access their sequences as well as neighboring genomic regions. ASG also provide for each gene, an exhaustive pre-computed catalog of putative transcriptsin total more than 1.2 million sequences. It has found that ~65 of the investigated genes show evidence for alternative splicing, and in 5 of the cases, a single gene might produce over 100 transcripts.
Proper citation: Alternate splicing gallery (RRID:SCR_008129) Copy
http://kinasedb.ontology.ims.u-tokyo.ac.jp
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. KinasePathwayDatabase is an integrated database concerning completed sequenced major eukaryotes, which contains the classification of protein kinases and their functional conservation and orthologous tables among species, protein-protein interaction data, domain information, structural information, and automatic pathway graph image interface. The protein-protein interactions are extracted by natural language processing (NLP) from abstracts using basic word pattern and protein name dictionary GENA: developed by our group. In this system, pathways are easily compared among species using protein interactions data more than 47,000 and orthologous tables.
Proper citation: Kinase Pathway Database (RRID:SCR_008199) Copy
http://www.roselab.jhu.edu/coil/
The Protein Coil Library is a library of protein structure fragments derived from the Protein Data Bank (PDB). The fragments in this library are those fragments in the PDB that cannot be classified as either alpha-helix or beta-strand. Three-dimensional structures as well as side-chain and backbone torsion angles are stored in the database. The Protein Coil Library allows rapid and comprehensive access to non-alpha-helix and non-beta-strand fragments contained in the Protein Data Bank (PDB). The library contains both sequence and structure information together with calculated torsion angles for both the backbone and side chains. Several search options are implemented, including a query function that uses output from popular PDB-culling servers directly. Additionally, several popular searches are stored and updated for immediate access. The library is a useful tool for exploring conformational propensities, turn motifs, and a recent model of the unfolded state. The library stores the complete torsion angle descriptions for the fragments as well as the three dimensional structures of the fragments themselves. The goal of extracting and pre-calculating this data is to allow for more straightforward investigation of peptide structure without the background of secondary structure elements. In addition to searching by PDB ID, it is possible to download a particular size class, perform a batch search of PDB/chain ID''s, or download precompiled lists of PDB ID''s of interest (PDB Select, etc.). For users interested in browsing the entire database at once or maintaining their own locally-updated copy of the library, FTP access instructions are also provided. The files stored in the coil library FTP site or returned after a batch search are organized heirarchically by PDB ID. This is done to reduce filesystem access times and fascilitate searches using the UNIX find utility. At the lowest directory level in the heirarchy, files are further sorted by fragment length. As a result, the number of files in a particular directory is generally less then 50, yielding relatively fast access on UNIX/Linux filesystems. The heirarchical organization is based on the middle two letters of the PDB ID. For example, hen egg lysozyme, which has a PDB ID of 1HEL, will be located in the directory h/he/. At the final level, fragments of varying sizes are stored in directories that correspond to their fragment length. Again, using lysozyme as an example, any seven-residue fragments, if they exist, will reside in the directory h/he/7/. Similarly, seven-residue fragments from 2HEX and 1HE0 will also be in this location. Sponsors: The Protein Coil Library is funded by Johns Hopkins University.
Proper citation: The Protein Coil Library (RRID:SCR_008233) Copy
http://pbil.univ-lyon1.fr/acuts/ACUTS.html
THIS RESOURCE IS NO LONGER IN SERVICE, Documented on August 12, 2014. Database that identifies new regulatory elements in untranslated regions of protein-coding genes (5 prime flanks, 5 prime UTRs, introns, 3 prime UTRs and 3 prime flanks). The analyses is focused on genes from metazoan species (essentially vertebrates, insects and nematodes). Information on highly conserved regions (sequences, alignments, annotations, bibliographic references) are compiled. Currently 176 out of 326 detected highly conserved regions (HCRs) have been analyzed and incorporated in the database. You can also access the list of annotated conserved elements and the list of conserved elements that remain to be processed. Their approach is based on comparative sequence analysis, for the identification of phylogenetic footprints.
Proper citation: Ancient conserved untranslated sequences (RRID:SCR_008130) Copy
http://animal.dna.affrc.go.jp/agp/index.html
Database of comparative gene mapping between species to assist the mapping of the genes related to phenotypic traits in livestock. The linkage maps, cytogenetic maps, polymerase chain reaction primers of pig, cattle, mouse and human, and their references have been included in the database, and the correspondence among species have been stipulated in the database. AGP is an animal genome database developed on a Unix workstation and maintained by a relational database management system. It is a joint project of National Institute of Agrobiological Sciences (NIAS) and Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries (STAFF-Institute), under cooperation with other related research institutes. AGP also contains the Pig Expression Data Explorer (PEDE), a database of porcine EST collections derived from full-length cDNA libraries and full-length sequences of the cDNA clones picked from the EST collection. The EST sequences have been clustered and assembled, and their similarity to sequences in RefSeq, and UniGene determined. The PEDE database system was constructed to store sequences and similarity data of swine full-length cDNA libraries and to make them available to users. It provides interfaces for keyword and ID searches of BLAST results and enables users to obtain sequence data and names of clones of interest. Putative SNPs in EST assemblies have been classified according to breed specificity and their effect on coding amino acids, and the assemblies are equipped with an SNP search interface. The database contains porcine nucleotide sequences and cDNA clones that are ready for analyses such as expression in mammalian cells, because of their high likelihood of containing full-length CDS. PEDE will be useful for researchers who want to explore genes that may be responsible for traits such as disease susceptibility. The database also offers information regarding major and minor porcine-specific antigens, which might be investigated in regard to the use of pigs as models in various medical research applications.
Proper citation: Animal Genome Database (RRID:SCR_008165) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 20,2019.The COG-database has become a powerful tool in the field of comparative genomics. The construction of this data-base is based on sequence homologies of proteins from different completely sequenced genomes. Highly homologous proteins are assigned to clusters of orthologous groups. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Here is a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or approximately 54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of approximately 20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (approximately 1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes.
Proper citation: Phylogenetic Clusters of Orthologous Groups Ranking (RRID:SCR_008223) Copy
http://www.nisc.nih.gov/projects/comp_seq.html
Generates data for use in developing and refining computational tools for comparing genomic sequence from multiple species. The NISC Comparative Sequencing Program's goal is to establish a data resource consisting of sequences for the same set of targeted genomic regions derived from multiple animal species. The broader program includes plans for a diverse set of analytical studies using the generated sequence and the publication of a series of papers describing the results of those analysis in peer-reviewed journals in a timely fashion. Experimentally, this project involves the shotgun sequencing of mapped BAC clones. For each BAC, an assembly is first performed when a sufficient number of sequence reads have been generated to provide full shotgun coverage of the clone. At that time, the assembled sequence is submitted to the HTGS division of GenBank. Subsequent refinements of the sequence, including the generation of higher-accuracy finished sequence, results in the updating of the sequence record in GenBank. By immediately submitting our BAC-derived sequences to GenBank, it makes their data available as a public service to allow colleagues to speed up their research, consistent with the now well-established routine of sequencing centers participating in the Human Genome Project. However, at the same time, it has made considerable investment in acquiring these mapping and sequence data, including sizable efforts of graduate students, postdoctoral fellows, and other trainees. Furthermore, in most cases, large data sets involving multiple BAC sequences from multiple species must first be generated, often taking many months to accumulate, before the planned analysis can be performed and the resulting papers written and submitted for publication.
Proper citation: Comparative Vertebrate Sequencing (RRID:SCR_008213) Copy
http://genome.jgi.doe.gov/programs/metagenomes/index.jsf
Portal providing access to metagenomics projects, data and tools supported by the DOE Joint Genome Institute (JGI). A primary motivation for metagenomics is that most microbes found in nature exist in complex, interdependent communities and cannot readily be grown in isolation in the laboratory. One can, however, isolate DNA or RNA from the community as a whole, and studies of such communities have revealed a diversity of microbes far beyond those found in culture collections. It is suspected that these uncultivated organisms must harbor considerable as-yet undiscovered genomic, functional, and metabolic features and capabilities. Thus to fully explore microbial genomics, it is imperative that we access the genomes of these elusive players.
Proper citation: Metagenomics Program at JGI (RRID:SCR_008804) Copy
http://lemur.amu.edu.pl/share/php/mirnest/home.php
A database of animal, plant and virus microRNA data maintained at the University of Poznan. The database provides: * 9980 miRNA candiates from 420 animal and plant species predicted in Expressed Sequence Tags * predicted targets for plant candidates * RNA-seq reads mapped to candidates from 29 species * external data from 12 databases that includes sequences, polymorphism, expression and regulation. miRNEST 1.0, it contains miRNA from 563 animals, plants and viruses plant species.
Proper citation: miRNEST (RRID:SCR_008907) Copy
http://cgap.nci.nih.gov/Chromosomes/Mitelman
The web site includes genomic data for humans and mice, including transcript sequence, gene expression patterns, single-nucleotide polymorphisms, clone resources, and cytogenetic information. Descriptions of the methods and reagents used in deriving the CGAP datasets are also provided. An extensive suite of informatics tools facilitates queries and analysis of the CGAP data by the community. One of the newest features of the CGAP web site is an electronic version of the Mitelman Database of Chromosome Aberrations in Cancer. The data in the Mitelman Database is manually culled from the literature and subsequently organized into three distinct sub-databases, as follows: -The sub-database of cases contains the data that relates chromosomal aberrations to specific tumor characteristics in individual patient cases. It can be searched using either the Cases Quick Searcher or the Cases Full Searcher. -The sub-database of molecular biology and clinical associations contains no data from individual patient cases. Instead, the data is pulled from studies with distinct information about: -Molecular biology associations that relate chromosomal aberrations and tumor histologies to genomic sequence data, typically genes rearranged as a consequence of structural chromosome changes. -Clinical associations that relate chromosomal aberrations and/or gene rearrangements and tumor histologies to clinical variables, such as prognosis, tumor grade, and patient characteristics. It can be searched using the Molecular Biology and Clinical (MBC) Associations Searcher -The reference sub-database contains all the references culled from the literature i.e., the sum of the references from the cases and the molecular biology and clinical associations. It can be searched using the Reference Searcher. CGAP has developed six web search tools to help you analyze the information within the Mitelman Database: -The Cases Quick Searcher allows you to query the individual patient cases using the four major fields: aberration, breakpoint, morphology, and topography. -The Cases Full Searcher permits a more detailed search of the same individual patient cases as above, by including more cytogenetic field choices and adding search fields for patient characteristics and references. -The Molecular Biology Associations Searcher does not search any of the individual patient cases. It searches studies pertaining to gene rearrangements as a consequence of cytogenetic aberrations. -The Clinical Associations Searcher does not search any of the individual patient cases. It searches studies pertaining to clinical associations of cytogenetic aberrations and/or gene rearrangements. -The Recurrent Chromosome Aberrations Searcher provides a way to search for structural and numerical abnormalities that are recurrent, i.e., present in two or more cases with the same morphology and topography. -The Reference Searcher queries only the references themselves, i.e., the references from the individual cases and the molecular biology and clinical associations. Sponsors: This database is sponsored by the University of Lund, Sweden and have support from the Swedish Cancer Society and the Swedish Children''s Cancer Foundation
Proper citation: Mitelman Database of Chromosome Aberrations in Cancer (RRID:SCR_012877) Copy
http://www.informatics.jax.org/
Community model organism database for laboratory mouse and authoritative source for phenotype and functional annotations of mouse genes. MGD includes complete catalog of mouse genes and genome features with integrated access to genetic, genomic and phenotypic information, all serving to further the use of the mouse as a model system for studying human biology and disease. MGD is a major component of the Mouse Genome Informatics.Contains standardized descriptions of mouse phenotypes, associations between mouse models and human genetic diseases, extensive integration of DNA and protein sequence data, normalized representation of genome and genome variant information. Data are obtained and integrated via manual curation of the biomedical literature, direct contributions from individual investigators and downloads from major informatics resource centers. MGD collaborates with the bioinformatics community on the development and use of biomedical ontologies such as the Gene Ontology (GO) and the Mammalian Phenotype (MP) Ontology.
Proper citation: Mouse Genome Database (RRID:SCR_012953) Copy
THIS RESOURCE IS NO LONGER IN SERVICE.Documented on April 14,2022. Database of comprehensive information on the approximately 600 prokaryote species that are present in the human oral cavity. The majority of these species are uncultivated and unnamed, recognized primarily by their 16S rRNA sequences. The HOMD presents a provisional naming scheme for the currently unnamed species so that strain, clone, and probe data from any laboratory can be directly linked to a stably named reference entity. The HOMD links sequence data with phenotypic, phylogenetic, clinical, and bibliographic information. Full and partial oral bacterial genome sequences determined as part of this project and the Human Microbiome Project, are being added to the HOMD as they become available. HOMD offers easy to use tools for viewing all publicly available oral bacterial genomes. Data is also downloadable.
Proper citation: HOMD (RRID:SCR_012770) Copy
SYFPEITHI is a database comprising more than 7000 peptide sequences known to bind class I and class II MHC molecules. The entries are compiled from published reports only. It contains a collection of MHC class I and class II ligands and peptide motifs of humans and other species, such as apes, cattle, chicken, and mouse, for example, and is continuously updated. Searches for MHC alleles, MHC motifs, natural ligands, T-cell epitopes, source proteins/organisms and references are possible. Hyperlinks to the EMBL and PubMed databases are included. In addition, ligand predictions are available for a number of MHC allelic products. The database is based on previous publications on T-cell epitopes and MHC ligands. It contains information on: -Peptide sequences -anchor positions -MHC specificity -source proteins, source organisms -publication references Since the number of motifs continuously increases, it was necessary to set up a database which facilitates the search for peptides and allows the prediction of T-cell epitopes. The prediction is based on published motifs (pool sequencing, natural ligands) and takes into consideration the amino acids in the anchor and auxiliary anchor positions, as well as other frequent amino acids. The score is calculated according to the following rules: The amino acids of a certain peptide are given a specific value depending on whether they are anchor, auxiliary anchor or preferred residue. Ideal anchors will be given 10 points, unusual anchors 6-8 points, auxiliary anchors 4-6 and preferred residues 1-4 points. Amino acids that are regarded as having a negative effect on the binding ability are given values between -1 and -3. Sponsors: SYFPEITHI is supported by DFG-Sonderforschungsbereich 685 and theEuropean Union: EU BIOMED CT95-1627, BIOTECH CT95-0263, and EU QLQ-CT-1999-00713.
Proper citation: SYFPEITHI: A Database for MHC Ligands and Peptide Motifs (RRID:SCR_013182) Copy
http://xin.cz3.nus.edu.sg/group/trmp/trmp.asp
The Therapeutically Relevant Multiple Pathways Database is designed to provide information about such multiple pathways and related therapeutic targets described in the literatures, the targeted disease conditions, and the corresponding drugs/ligands directed at each of these targets. This database currently contains 11 entries of multiple pathways, 97 entries of individual pathways, 120 targets covering 72 disease conditions along with 120 sets of drugs directed at each of these targets. Each entry can be retrieved through multiple methods including multiple pathway name, individual pathway name and disease name. Additional information provided include protein name, synonyms, Swissprot AC number, species, gene name and location, protein sequence (AASEQ) and gene sequence (NTSEQ) as well as potential therapeutic implications while applicable. Cross-links to other databases are provided which include Genecard, GDB, Locuslink, NCBI, KEGG, OMIM, SwissProt to facilitate the access of more detailed information about various aspects of the particular target or non-target protein. Queries can be submitted by entering or selecting the required information in any one or combination of the fields in the form. User can specify full name or any part of the name in a text field, or choose one item from an selection field. Sponsors: TRMP is supported by the National University of Singapore.
Proper citation: Therapeutically Relevant Multiple Pathways Database (RRID:SCR_013471) Copy
https://hive.biochemistry.gwu.edu/dna.cgi?cmd=tissue_codon_usage&id=586358&mode=cocoputs
Database includes genomic codon-pair and dinucleotide statistics of all organisms with sequenced genome. Facilitates genetic variation analyses and recombinant gene design. Derived from all available GenBank and RefSeq data.
Proper citation: Codon and Codon-Pair Usage Tables (RRID:SCR_018504) Copy
Collection of manually curated data regarding structure and antimicrobial activity of natural and synthetic peptides. Provides the information and analytical resources to develop antimicrobial compounds with the high therapeutic index.
Proper citation: Database of Antimicrobial Activity and Structure of Peptides (RRID:SCR_016600) Copy
The ArkDB database system aims to provide a comprehensive public repository for genome mapping data from farmed and other animal Species. The system also aims to provide a route in to genomic and other sequence from the initial viewpoint of linkage mapping, RH mapping, physical mapping or - possibly more importantly - QTL mapping data. Sponsors: ArkDB is funded by Biotechnology and Biological Sciences Research Council (BBSRC), UK. Cat, Chicken, Cow, Deer, Duck, Horse, Pig, Quail, Salmon, Sea Bass, Sheep, Turkey, QLT map, Linkage map, RH map, Farm animal, Genome map, Sequence, Mapping
Proper citation: ArkDB - Genomes For The Rest of Us (RRID:SCR_001838) Copy
https://cell-innovation.nig.ac.jp/GNP/index_e.html
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 23,2022. Integrated database of experiment data generated by participating research institutes and public databases relating to: 1) transcription starting position of human genes in the human genome, 2) conjunction to control region on transcriptional factors and the human genome 3) protein-protein interaction with a central focus on transcription factors organized for use in genome level research. Gene Search is the function to search the integrated database by using keywords and public IDs. The search results can be visualized by: * Genome Explorer : provides annotation of landmarks (genes, transcription start sites, etc.) aligned in accordance with their genome locations. * PPI Network : provides a graphical view of protein-protein interaction (PPI) network from the experimental data generated under the project and the public datasets. * Expression Profile : clusters genes by expression pattern and display the result with heatmap. The function provides genes which have relation of coregulation and anti-coregulation. * Comparison Viewer : This function gives the view to compare the genomic regions between human and mouse homologous genes. The viewer shows the distribution of transcription start sites (TSS) as the way of separable by tissues or time points with other landmarks on genome region. * Gene Stock : This is the function to save the gene list that you are interested until the session is closed.
Proper citation: Genome Network Platform (RRID:SCR_001737) Copy
http://services.bio.ifi.lmu.de:1046/AutoPSIDB/
Searchable database for predicted protein sequences and structures. It has the ability to search through PDB ID, UniProt ID, and descriptive classifiers.
Proper citation: AutoPSI database of predicted SCOP classifications (RRID:SCR_001923) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.