Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://chromium.lovd.nl/LOVD2/home.php?select_db=CDKN2A
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The CDKN2A Database presents the germline and somatic variants of the CDKN2A tumor suppressor gene recorded in human disease through June 2003, annotated with evolutionary, structural, and functional information, in a format that allows the user to either download it or manipulate it for their purposes online. The goal is to provide a database that can be used as a resource by researchers and geneticists and that aids in the interpretation of CDKN2A missense variants. Most online mutation databases present flat files that cannot be manipulated, are often incomplete, and have varying degrees of annotation that may or may not help to interpret the data. They hope to use CDKN2A as a prototype for integrating computational and laboratory data to help interpret variants in other cancer-related genes and other single nucleotide polymorphisms (SNPs) found throughout the genome. Another goal of the lab is to interpret the functional and disease significance of missense variants in cancer susceptibility genes. Eventually, these results will be relevant to the interpretation of single nucleotide polymorphisms (SNPs) in general. The CDKN2A locus is a valuable model for assessing relationships among variation, structure, function, and disease because: Variants of this gene are associated with hereditary cancer: Familial Melanoma (and related syndromes); somatic alterations play a role in carcinogenesis; allelic variants occur whose functional consequences are unknown; reliable functional assays exist; and crystal structure is known. All variants in the database are recorded according to the nomenclature guidelines as outlined by the Human Genome Variation Society. This database is currently designed for research purposes only and is not yet recommended as a clinical resource. Many of the mutations reported here have not been tested for disease association and may represent normal, non-disease causing polymorphisms.
Proper citation: CDKN2A Database (RRID:SCR_008179) Copy
http://jbirc.jbic.or.jp/hinv/ppi/
The PPI view displays H-InvDB human protein-protein interaction (PPI) information. It is constructed by assigning interaction data to H-InvDB proteins which were originally predicted from transcriptional products generated by the H-Invitational project. The PPI view is now providing 32,198 human PPIs comprised of 9,268 H-InvDB proteins. H-Invitational Database (H-InvDB) is an integrated database of human genes and transcripts. By extensive analyses of all human transcripts, we provide curated annotations of human genes and transcripts that include gene structures, alternative splicing isoforms, non-coding functional RNAs, protein functions, functional domains, sub-cellular localizations, metabolic pathways, protein 3D structure, genetic polymorphisms (SNPs, indels and microsatellite repeats) , relation with diseases, gene expression profiling, molecular evolutionary features, protein-protein interactions (PPIs) and gene families/groups. Sponsors: This research is financially supported by the Ministry of Economy, Trade and Industry of Japan (METI), the Ministry of Education, Culture, Sports, Science and Technology of Japan (MEXT) and the Japan Biological Informatics Consortium (JBIC). Also, this work is partly supported by the Research Grant for the RIKEN Genome Exploration Research Project from MEXT to Y.H. and the Grant for the RIKEN Frontier Research System, Functional RNA research program.
Proper citation: H-Invitational Database: Protein-Protein Interaction Viewer (RRID:SCR_008054) Copy
http://amazonia.montp.inserm.fr/
A web interface and associated tools for easy query of public human transcriptome data by keyword, through thematic pages with list annotations. Amazonia provides a thematic entry to public transcriptomes: users may for instance query a gene on a Stem Cells page, where they will see the expression of their favorite gene across selected microarray experiments related to stem cell biology. This selection of samples can be customized at will among the 6331 samples currently present in the database. Every transcriptome study results in the identification of lists of genes relevant to a given biological condition. In order to include this valuable information in any new query in the Amazonia database, they indicate for each gene in which lists it is included. This is a straightforward and efficient way to synthesize hundreds of microarray publications., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: AmaZonia: Explore the Jungle of Microarrays Results (RRID:SCR_008405) Copy
http://www.ebi.ac.uk/asd/altextron/indexhtml
THIS RESOURCE IS NO LONGER IN SERVICE. A computer generated high quality dataset of human transcript-confirmed constitutive and alternative exons and introns. The alternative events have been delineated and annotated with various characterizations. AltExtron is the prototype database for the production version AltSplice. AltExtron is more geared towards investigating various aspects of the methodologies used, and focuses in general on the biology behind alternative splicing. The complete data used in this work is available for downloading in several flat files, containing human genes, introns, exons, isoform events, human-mouse comparisons, and additional information on GC-AG introns. Two versions of AltExtron data are available - one as prototype (for human) and another as latest build (for human, drosophila, mouse, and others) based on EMBL/GenBank (Feb 2003).
Proper citation: AltExtron Database (RRID:SCR_008404) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, it has been replaced by Monarch Initiative. LAMHDI, the initiative to Link Animal Models to Human DIsease, is designed to accelerate the research process by providing biomedical researchers with a simple, comprehensive Web-based resource to find the best animal model for their research. LAMDHI is a free, Web-based, resource to help researchers bridge the gap between bench testing and human trials. It provides a free, unbiased resource that enables scientists to quickly find the best animal models for their research studies. LAMHDI includes mouse data from MGI, the Mouse Genome Informatics website; zebrafish data from ZFIN, the Zebrafish Model Organism Database; rat data from RGD, the Rat Genome Database; yeast data from SGD, the Saccharomyces Genome Database; and fly data from FlyBase. LAMHDI.org is operational today, and data is added regularly. Enhancements are planned to let researchers contribute their knowledge of the animal models available through LAMHDI. The LAMHDI goal is to allow researchers to share information about and access to animal models so they can refine research and testing, and reduce or replace the use of animal models where possible. LAMHDI Database Search: LAMHDI brings together scientifically validated information from various sources to create a composite multi-species database of animal models of human disease. To do this, the LAMHDI database is prepared from a variety of sources. The LAMHDI team takes publicly available data from OMIM, NCBI''s Entrez Gene database, Homologene, and WikiPathways, and builds a mathematical graph (think of it as a map or a web) that links these data together. OMIM is used to link human diseases with specific human genes, and Entrez provides universal identifiers for each of those genes. Human genes are linked to their counterpart genes in other species with Homologene, and those genes are linked to other genes tentatively or authoritatively using the data in WikiPathways. This preparatory work gives LAMHDI a web of human diseases linked to specific human genes, orthologous human genes, homologous genes in other species, and both human and non-human genes involved in specific metabolic pathways associated with those diseases. LAMHDI includes model data that partners provide directly from their data structures. For instance, MGI provides information about mouse models, including a disease for each model, as well as some genetic information (the ID of the model, in fact, identifies one or more genes). ZFIN provides genetic information for each zebrafish model, but no diseases, so zebrafish models are integrated by using the genes as the glue. For instance, a zebrafish model built to feature the zebrafish PKD2 gene would plug into the larger disease-gene map at the node representing the zebrafish PKD2 gene, which is connected to the node representing the human PKD2 gene, which in turn is connected to the node representing the human disease known as polycystic kidney disease. (Some of the partner data LAMHDI receives can even extend the base map. MGI provides a disease for every model, and in some cases this allows the creation of a disease-to-gene relationship in the LAMHDI database that might not already be documented in the OMIM dataset.) With curatorial and model information in hand, LAMHDI runs a lengthy automated process that exhaustively searches for every possible path between each model and each disease in the data, up to a set number of hops, producing for each disease-to-model pair a set of links from the disease to the model. The algorithm avoids circular paths and paths that include more than one disease anywhere in the middle of the path. At the end of this phase, LAMHDI has a comprehensive set of paths representing all the disease-to-model relationships in the data, varying in length from one hop to many hops. Each disease-to-model path is essentially a string of nodes in the data, where each node represents a disease, a gene, a linkage between genes (an orthologue, a homologue, or a pathway connection, referred to as a gene cluster or association), or a model. Each node has a human-friendly label, a set of terms and keywords, and - in most cases - a URL linking the node to the data source where it originated. When a researcher submits a search on the LAMHDI website, LAMHDI searches for the user''s search terms in its precomputed list of all known disease-to-model paths. It looks for the terms not only in the disease and model nodes, but also in every node along each path. The complete set of hits may include multiple paths between any given disease-to-model pair of endpoints. Each of these disease-to-model pair sets is ordered by the number of hops it involves, and the one involving the fewest hops is chosen to represent its respective disease-to-model pair in the search results presented to the user. Results are sorted by scores that represent their matches. The number of hops is one barometer of the strength of the evidence linking the model and the disease; fewer hops indicates the relationship is stronger, more hops indicates it may be weaker. This indicator works best for comparing models from a single partner dataset: MGI explicitly identifies a disease for each mouse model, so there can be disease-to-model hits for mice that involve just one hop. Because ZFIN does not explicitly identify a disease for each model, no zebrafish model will involve fewer than four hops to the nearest disease, from the zebrafish model to a zebrafish gene to a gene cluster to a human gene to a human disease.
Proper citation: LAMHDI: The Initiative to Link Animal Models to Human DIsease (RRID:SCR_008643) Copy
http://alizadehlab.stanford.edu/
This is an open-source Mouse Exonic Evidence-Based Oligonucleotide Chip (MEEBOChip), and are in the process of building the human counterpart, HEEBOChip. The set of 70mers for MEEBOChip is already available from Illumina, Inc., with synthesis of HEEBOChip 70mers in progress. Both arrays are based on a novel selection of exonic long-oligonucleotides (70-mers) from a genomic annotation of the corresponding complete genome sequences, using a transcriptome-based annotation of exon structure for each genomic locus. Using a combination of existing and custom-tailored tools and datasets (including millions of mRNA and EST sequences), we built and performed a systematic examination of transcript-supported exon structure for each genomic locus at the base-pair level (i.e., exonic evidence). This strategy allowed them to select both constitutive and in many cases alternative exons for nearly every gene in the corresponding genome (e.g., protocadherin locus), allowing an unprecedented exploration of human and mouse biology. Furthermore, they used experimentally derived data to hone the selection of these 70mers, helping maximize their performance under typical fluorescent labeling and hybridization conditions. Specifically, they applied and refined the ArrayOligoSelector algorithm from Joe DeRisis laboratory to select 70mers, considering not only their uniqueness (i.e., hybridization specificity) within the content of the entire genome, but also to overcome the known biases of labeling and hybridization methods (e.g., 3-biased reverse transcription and in vitro transcription reactions).
Proper citation: Alizadehlab: MeeboChip and HeeboChip Open Source Project (RRID:SCR_008384) Copy
http://www.molecularbrain.org/
MolecularBrain is an attempt to collect, collates, analyze and present the microarray derived gene expression data from various brain regions side by side. Transcription Profile of any gene in Mouse (online) and Human Brain (not yet) can be accessed as a histogram along with links to access various aspects of that gene. The expression levels were calculated from microarray data deposited at GEO (Gene expression omnibus). The molecular brain database could be searched using the built in search tool with the terms Entrez GeneID, gene symbol, synonym or description. Gene information along with their expression values can be also accessed from the alphabetical list of gene symbols on the footer. The protocol and GEO sample information is available.
Proper citation: Molecular Brain: Transcription Profiles of Mouse and Human Brains (RRID:SCR_008689) Copy
http://www.cmbi.ru.nl/GeneSeeker/
The GeneSeeker allows you to search across different databases simultaneously, given a known human genetic location and expression/phenotypic pattern. The GeneSeeker returns any found gene names which are located on the specified location and expressed in the specified tissue. To search for more expression location in one search, just enter them in the textbox for the expression location and separate them with logical operators (and, or, not). You can specify as many tissues as you want, the program starts 20 queries simultaneously, and then waits for a query to finish before starting another query, to keep server loads to a minimum. You can also search only for expression, just leave the cytogenetic location fields blank, and do the query. If you only want to look for one cytogenetic location, only fill in the first location field, and the GeneSeeker will search with only this one. Housekeeping genes , found in Swissprot can be excluded, or genes that are to be excluded can be specified. Human chromosome localizations are translated with an oxford-grid to mouse chromosome localizations, and then submitted to the Mgd. Sponsors: GeneSeeker is a service provided by the Centre for Molecular and Biomolecular Informatics (CMBI).
Proper citation: GeneSeeker (RRID:SCR_008347) Copy
http://mips.gsf.de/services/genomes/uwe25/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 15, 2013. This is the official database of the environmental chlamydia genome project. This resource provides access to finished sequence for Parachlamydia-related symbiont UWE25 and to a wide range of manual annotations, automatical analyses and derived datasets. Functional classification and description has been manually annotated according to the Annotation guidelines. Chlamydiae are the major cause of preventable blindness and sexually transmitted disease. Genome analysis of a chlamydia-related symbiont of free-living amoebae revealed that it is twice as large as any of the pathogenic chlamydiae and had few signs of recent lateral gene acquisition. We showed that about 700 million years ago the last common ancestor of pathogenic and symbiotic chlamydiae was already adapted to intracellular survival in early eukaryotes and contained many virulence factors found in modern pathogenic chlamydiae, including a type III secretion system. Ancient chlamydiae appear to be the originators of mechanisms for the exploitation of eukaryotic cells. Environmental chlamydiae have recently been recognized as obligate endosymbionts of free-living amoebae and have been implicated as potential human pathogens. Environmental chlamydiae form a deep branching evolutionary lineage within the medically important order Chlamydiales. Despite their high diversity and ubiquitous distribution in clinical and environmental samples only limited information about genetics and ecology of these microorganisms is available. The Parachlamydia-related Acanthamoeba symbiont UWE25 was therefore selected as representative environmental chlamydia strain for whole genome sequencing. Comparative genome analysis was performed using PEDANT and simap. Sponsors: The environmental chlamydia genome project was funded by the bmb+f (German Federal Ministry of Education and Research) and is part of the Competence Network PathoGenoMiK.
Proper citation: Protochlamydia amoebophila UWE25 (RRID:SCR_008222) Copy
http://compbio.uthsc.edu/miRSNP/
Database of naturally occurring DNA variations in microRNA (miRNA) seed regions and miRNA target sites. MicroRNAs pair to the transcripts of protein-coding genes and cause translational repression or mRNA destabilization. SNPs and INDELs in miRNAs and their target sites may affect miRNA-mRNA interaction, and hence affect miRNA-mediated gene repression. The PolymiRTS database was created by scanning 3'UTRs of mRNAs in human and mouse for SNPs and INDELs in miRNA target sites. Then, the potential downstream effects of these polymorphisms on gene expression and higher-order phenotypes are identified. Specifically, genes containing PolymiRTSs, cis-acting expression QTLs, and physiological QTLs in mouse and the results of genome-wide association studies (GWAS) of human traits and diseases are linked in the database. The PolymiRTS database also includes polymorphisms in target sites that have been supported by a variety of experimental methods and polymorphisms in miRNA seed regions.
Proper citation: PolymiRTS (RRID:SCR_003389) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone., documented September 2, 2016. Database for defining official rat gene symbols. It includes rat gene symbols from three major sources: the Rat Genome Database (RGD), Ensembl, and NCBI-Gene. All rat symbols are compared with official symbols from orthologous human genes as specified by the Human Gene Nomenclature Committee (HGNC). Based on the outcome of the comparisons, a rat gene symbol may be selected. Rat symbols that do not match a human ortholog undergo a strict procedure of comparisons between the different rat gene sources as well as with the Mouse Genome Database (MGD). For each rat gene this procedure results in an unambiguous gene designation. The designation is presented as a status level that accompanies every rat gene symbol suggested in the database. The status level describes both how a rat symbol was selected, and its validity. Rat Gene Symbol Tracker approves rat gene symbols by an automatic procedure. The rat genes are presented with links to RGD, Ensembl, NCBI Gene, MGI and HGNC. RGST ensures that each acclaimed rat gene symbol is unique and follows the guidelines given by the RGNC. To each symbol a status level associated, describing the gene naming process.
Proper citation: Rat Gene Symbol Tracker (RRID:SCR_003261) Copy
http://bioinfo.mbi.ucla.edu/ASAP/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on 8/12/13. Database to access and mine alternative splicing information coming from genomics and proteomics based on genome-wide analyses of alternative splicing in human (30 793 alternative splice relationships found) from detailed alignment of expressed sequences onto the genomic sequence. ASAP provides precise gene exon-intron structure, alternative splicing, tissue specificity of alternative splice forms, and protein isoform sequences resulting from alternative splicing. They developed an automated method for discovering human tissue-specific regulation of alternative splicing through a genome-wide analysis of expressed sequence tags (ESTs), which involves classifying human EST libraries according to tissue categories and Bayesian statistical analysis. They use the UniGene clusters of human Expressed Sequence Tags (ESTs) to identify splices. The UniGene EST's are clustered so that a single cluster roughly corresponds to a gene (or at least a part of a gene). A single EST represents a portion of a processed (already spliced) mRNA. A given cluster contains many ESTs, each representing an outcome of a series of splicing events. The ESTs in UniGene contain the different mRNA isoforms transcribed from an alternatively spliced gene. They are not predicting alternative splicing, but locating it based on EST analysis. The discovered splices are further analyzed to determine alternative splicing events. They have identified 6201 alternative splice relationships in human genes, through a genome-wide analysis of expressed sequence tags (ESTs). Starting with 2.1 million human mRNA and EST sequences, they mapped expressed sequences onto the draft human genome sequence and only accepted splices that obeyed the standard splice site consensus. After constructing a tissue list of 46 human tissues with 2 million human ESTs, they generated a database of novel human alternative splices that is four times larger than our previous report, and used Bayesian statistics to compare the relative abundance of every pair of alternative splices in these tissues. Using several statistical criteria for tissue specificity, they have identified 667 tissue-specific alternative splicing relationships and analyzed their distribution in human tissues. They have validated our results by comparison with independent studies. This genome-wide analysis of tissue specificity of alternative splicing will provide a useful resource to study the tissue-specific functions of transcripts and the association of tissue-specific variants with human diseases.
Proper citation: ASAP: the Alternative Splicing Annotation Project (RRID:SCR_003415) Copy
http://www.hgsc.bcm.tmc.edu/content/hapmap-3-and-encode-3
Draft release 3 for genome-wide SNP genotyping and targeted sequencing in DNA samples from a variety of human populations (sometimes referred to as the HapMap 3 samples). This release contains the following data: * SNP genotype data generated from 1184 samples, collected using two platforms: the Illumina Human1M (by the Wellcome Trust Sanger Institute) and the Affymetrix SNP 6.0 (by the Broad Institute). Data from the two platforms have been merged for this release. * PCR-based resequencing data (by Baylor College of Medicine Human Genome Sequencing Center) across ten 100-kb regions (collectively referred to as ENCODE 3) in 712 samples. Since this is a draft release, please check this site regularly for updates and new releases. The HapMap 3 sample collection comprises 1,301 samples (including the original 270 samples used in Phase I and II of the International HapMap Project) from 11 populations, listed below alphabetically by their 3-letter labels. Five of the ten ENCODE 3 regions overlap with the HapMap-ENCODE regions; the other five are regions selected at random from the ENCODE target regions (excluding the 10 HapMap-ENCODE regions). All ENCODE 3 regions are 100-kb in size, and are centered within each respective ENCODE region. The HapMap 3 and ENCORE 3 data are downloadable from the ftp site.
Proper citation: HapMap 3 and ENCODE 3 (RRID:SCR_004563) Copy
Database for identifying orthologous phenotypes (phenologs). Mapping between genotype and phenotype is often non-obvious, complicating prediction of genes underlying specific phenotypes. This problem can be addressed through comparative analyses of phenotypes. We define phenologs based upon overlapping sets of orthologous genes associated with each phenotype. Comparisons of >189,000 human, mouse, yeast, and worm gene-phenotype associations reveal many significant phenologs, including novel non-obvious human disease models. For example, phenologs suggest a yeast model for mammalian angiogenesis defects and an invertebrate model for vertebrate neural tube birth defects. Phenologs thus create a rich framework for comparing mutational phenotypes, identify adaptive reuse of gene systems, and suggest new disease genes. To search for phenologs, go to the basic search page and enter a list of genes in the box provided, using Entrez gene identifiers for mouse/human genes, locus ids for yeast (e.g., YHR200W), or sequence names for worm (e.g., B0205.3). It is expected that this list of genes will all be associated with a particular system, trait, mutational phenotype, or disease. The search will return all identified model organism/human mutational phenotypes that show any overlap with the input set of the genes, ranked according to their hypergeometric probability scores. Clicking on a particular phenolog will result in a list of genes associated with the phenotype, from which potential new candidate genes can identified. Currently known phenotypes in the database are available from the link labeled ''Find phenotypes'', where the associated gene can be submitted as queries, or alternately, can be searched directly from the link provided.
Proper citation: Phenologs (RRID:SCR_005529) Copy
A publicly available database of Transposed elements (TEs) which are located within protein-coding genes of 7 organisms: human, mouse, chicken, zebrafish, fruilt fly, nematode and sea squirt. Using TranspoGene the user can learn about the many aspects of the effect these TEs have on their hosting genes, such as: exonization events (including alternative splicing-related data), insertion of TEs into introns, exons, and promoters, specific location of the TE over the gene, evolutionary divergence of the TE from its consensus sequence and involvement in diseases. TranspoGene database is quickly searchable through its website, enables many kinds of searches and is available for download. TranspoGene contains information regarding specific type and family of the TEs, genomic and mRNA location, sequence, supporting transcript accession and alignment to the TE consensus sequence. The database also contains host gene specific data: gene name, genomic location, Swiss-Prot and RefSeq accessions, diseases associated with the gene and splicing pattern. The TranspoGene and microTranspoGene databases can be used by researchers interested in the effect of TE insertion on the eukaryotic transcriptome.
Proper citation: TranspoGene (RRID:SCR_005634) Copy
http://www.hpppi.iicb.res.in/btox/
Database of Bacterial ExoToxins for Human is a database of sequences, structures, interaction networks and analytical results for 229 exotoxins, from 26 different human pathogenic bacterial genus. All toxins are classified into 24 different Toxin classes. The aim of DBETH is to provide a comprehensive database for human pathogenic bacterial exotoxins. DBETH also provides a platform to its users to identify potential exotoxin like sequences through Homology based as well as Non-homology based methods. In homology based approach the users can identify potential exotoxin like sequences either running BLASTp against the toxin sequences or by running HMMER against toxin domains identified by DBETH from human pathogenic bacterial exotoxins. In Non-homology based part DBETH uses a machine learning approach to identify potential exotoxins (Toxin Prediction by Support Vector Machine based approach).
Proper citation: DBETH - Database for Bacterial ExoToxins for Humans (RRID:SCR_005908) Copy
The Kabat Database determines the combining site of antibodies based on the available amino acid sequences. The precise delineation of complementarity determining regions (CDR) of both light and heavy chains provides the first example of how properly aligned sequences can be used to derive structural and functional information of biological macromolecules. The Kabat database now includes nucleotide sequences, sequences of T cell receptors for antigens (TCR), major histocompatibility complex (MHC) class I and II molecules, and other proteins of immunological interest. The Kabat Database searching and analysis tools package is an ASP.NET web-based portal containing lookup tools, sequence matching tools, alignment tools, length distribution tools, positional correlation tools and much more. The searching and analysis tools are custom made for the aligned data sets contained in both the SQL Server and ASCII text flat file formats. The searching and analysis tools may be run on a single PC workstation or in a distributed environment. The analysis tools are written in ASP.NET and C# and are available in Visual Studio .NET 2003/2005/2008 formats. The Kabat Database was initially started in 1970 to determine the combining site of antibodies based on the available amino acid sequences at that time. Bence Jones proteins, mostly from human, were aligned, using the now-known Kabat numbering system, and a quantitative measure, variability, was calculated for every position. Three peaks, at positions 24-34, 50-56 and 89-97, were identified and proposed to form the complementarity determining regions (CDR) of light chains. Subsequently, antibody heavy chain amino acid sequences were also aligned using a different numbering system, since the locations of their CDRs (31-35B, 50-65 and 95-102) are different from those of the light chains. CDRL1 starts right after the first invariant Cys 23 of light chains, while CDRH1 is eight amino acid residues away from the first invariant Cys 22 of heavy chains. During the past 30 years, the Kabat database has grown to include nucleotide sequences, sequences of T cell receptors for antigens (TCR), major histocompatibility complex (MHC) class I and II molecules and other proteins of immunological interest. It has been used extensively by immunologists to derive useful structural and functional information from the primary sequences of these proteins.
Proper citation: Kabat Database of Sequences of Proteins of Immunological Interest (RRID:SCR_006465) Copy
This database presents the entire DNA sequence of the first diploid genome sequence of a Han Chinese, a representative of Asian population. The genome, named as YH, represents the start of YanHuang Project, which aims to sequence 100 Chinese individuals in 3 years. It was assembled based on 3.3 billion reads (117.7Gbp raw data) generated by Illumina Genome Analyzer. In total of 102.9Gbp nucleotides were mapped onto the NCBI human reference genome (Build 36) by self-developed software SOAP (Short Oligonucleotide Alignment Program), and 3.07 million SNPs were identified. The personal genome data is illustrated in a MapView, which is powered by GBrowse. A new module was developed to browse large-scale short reads alignment. This module enabled users track detailed divergences between consensus and sequencing reads. In total of 53,643 HGMD recorders were used to screen YH SNPs to retrieve phenotype related information, to superficially explain the donor's genome. Blast service to align query sequences against YH genome consensus was also provided.
Proper citation: YanHuang Project (RRID:SCR_006077) Copy
http://bond.unleashedinformatics.com/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.BOND, which requires registration of a free account, is a resource used to perform cross-database searches of available sequence, interaction, complex and pathway information. BOND integrates a range of component databases including GenBank and BIND, the Biomolecular Interaction Network Database. BOND contains 70+ million biological sequences, 33,000 structures, 38,000 GO terms, and over 200,000 human curated interactions contained in BIND, and is open access. BOND serves the interests of the developing global interactome effort encompassing the genomic, proteomic and metabolomic research communities. BOND is the first open access search resource to integrate sequence and interaction information. BOND integrates BLAST functionality, and contains a well-documented API. BOND also stores annotation links for sequences, including links to Genome Ontology descriptions, MedLine abstracts, taxon identifiers, associated structures, redundant sequences, sequence neighbors, conserved domains, data base cross-references, Online Mendalian Inheritance in Man identifiers, LocusLink identifiers and complete genomes. BIND on BOND The Biomolecular Interaction Network Database (BIND), a component database of BOND, is a collection of records documenting molecular interactions. The contents of BIND include high-throughput data submissions and hand-curated information gathered from the scientific literature. BIND is an interaction database with three classifications for molecular associations: molecules that associate with each other to form interactions, molecular complexes that are formed from one or more interaction(s) and pathways that are defined by a specific sequence of two or more interactions.Interactions A BIND record represents an interaction between two or more objects that is believed to occur in a living organism. A biological object can be a protein, DNA, RNA, ligand, molecular complex, gene, photon or an unclassified biological entity. BIND records are created for interactions which have been shown experimentally and published in at least one peer-reviewed journal. A record also references any papers with experimental evidence that support or dispute the associated interaction. Interactions are the basic units of BIND and can be linked together to form molecular complexes or pathways. The BIND interaction viewer is a tool to visualize and analyze molecular interactions, complexes and pathways. The BIND interaction viewer uses Ontoglyphs to display information about a protein via attributes such as molecular function, biological process and sub-cellular localization. Ontoglyphs allow to graphically and interactively explore interaction networks, by visualizing interactions in the context of 34 functional, 25 binding specificity and 24 sub-cellular localization Ontoglyphs categories. We will continue to provide an open access version of BOND, providing its subscribers with free, unlimited access to a core content set. But we are confident you will soon want to upgrade to BONDplus.
Proper citation: Biomolecular Object Network Databank (RRID:SCR_007433) Copy
http://mips.gsf.de/genre/proj/ustilago/
The MIPS Ustilago maydis Genome Database aims to present information on the molecular structure and functional network of the entirely sequenced, filamentous fungus Ustilago maydis. The underlying sequence is the initial release of the high quality draft sequence of the Broad Institute. The goal of the MIPS database is to provide a comprehensive genome database in the Genome Research Environment in parallel with other fungal genomes to enable in depth fungal comparative analysis. The specific aims are to: 1. Generate and assemble Whole Genome Shotgun sequence reads yielding 10X coverage of the U. maydis genome 2. Integrate the genomic sequence assembly with physical maps generated by Bayer CropScience 3. Perform automated annotation of the sequence assembly 4. Align the strain 521 assembly with the FB1 assembly provided by Exelixis 5. Release the sequence assembly and results of our annotation and analysis to public Ustilago maydis is a basidiomycete fungal pathogen of maize and teosinte. The genome size is approximately 20 Mb. The fungus induces tumors on host plants and forms masses of diploid teliospores. These spores germinate and form haploid meiotic products that can be propagated in culture as yeast-like cells. Haploid strains of opposite mating type fuse and form a filamentous, dikaryotic cell type that invades plant tissue to reinitiate infection. Ustilago maydis is an important model system for studying pathogen-host interactions and has been studied for more than 100 years by plant pathologists. Molecular genetic research with U. maydis focuses on recombination, the role of mating in pathogenesis, and signaling pathways that influence virulence. Recently, the fungus has emerged as an excellent experimental model for the molecular genetic analysis of phytopathogenesis, particularly in the characterization of infection-specific morphogenesis in response to signals from host plants. Ustilago maydis also serves as an important model for other basidiomycete plant pathogens that are more difficult to work with in the laboratory, such as the rust and bunt fungi. Genomic sequence of U. maydis will also be valuable for comparative analysis of other fungal genomes, especially with respect to understanding the host range of fungal phytopathogens. The analysis of U. maydis would provide a framework for studying the hundreds of other Ustilago species that attack important crops, such as barley, wheat, sorghum, and sugarcane. Comparisons would also be possible with other basidiomycete fungi, such as the important human pathogen C. neoformans. Commercially, U. maydis is an excellent model for the discovery of antifungal drugs. In addition, maize tumors caused by U. maydis are prized in Hispanic cuisine and there is interest in improving commercial production. The complete putative gene set of the Broad Institute''s second release is loaded into the database and in addition all deviating putative genes from a putative gene set produced by MIPS with different gene prediction parameters are also loaded. The complete dataset will then be analysed, gene predictions will be manually corrected due to combined information derived from different gene prediction algorithms and, more important, protein and EST comparisons. Gene prediction will be restricted to ORFs larger than 50 codons; smaller ORFs will be included only if similarities to other proteins or EST matches confirm their existence or if a coding region was postulated by all prediction programs used. The resulting proteins will be annotated. They will be classified according to the MIPS classification catalogue receiving appropriate descriptions. All proteins with a known, characterized homolog will be automatically assigned to functional categories using the MIPS functional catalog. All extracted proteins are in addition automatically analysed and annotated by the PEDANT suite.
Proper citation: MIPS Ustilago maydis Database (RRID:SCR_007563) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.