Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A comprehensive biochemical knowledge-base on human metabolism, this community-driven, consensus metabolic reconstruction integrates metabolic information from five different resources: * Recon 1, a global human metabolic reconstruction (Duarte et al, PNAS, 104(6), 1777-1782, 2007) * EHMN, Edinburgh Human Metabolic Network (Hao et al., BMC Bioinformatics 11, 393, 2010) * HepatoNet1, a liver metabolic reconstruction (Gille et al., Molecular Systems Biology 6, 411, 2010), * Ac/FAO module, an acylcarnitine/fatty acid oxidation module (Sahoo et al., Molecular bioSystems 8, 2545-2558, 2012), * a human small intestinal enterocytes reconstruction (Sahoo and Thiele, submitted). Additionally, more than 370 transport and exchange reactions were added, based on a literature review. Recon 2 is fully semantically annotated (Le Nov��re, N. et al. Nat Biotechnol 23, 1509-1515, 2005) with references to persistent and publicly available chemical and gene databases, unambiguously identifying its components and increasing its applicability for third-party users. Here you can explore the content of the reconstruction by searching/browsing metabolites and reactions. Recon 2 predictive model is available in the Systems Biology Markup Language format.
Proper citation: Recon x (RRID:SCR_006345) Copy
http://www.informatics.jax.org
International database for laboratory mouse. Data offered by The Jackson Laboratory includes information on integrated genetic, genomic, and biological data. MGI creates and maintains integrated representation of mouse genetic, genomic, expression, and phenotype data and develops reference data set and consensus data views, synthesizes comparative genomic data between mouse and other mammals, maintains set of links and collaborations with other bioinformatics resources, develops and supports analysis and data submission tools, and provides technical support for database users. Projects contributing to this resource are: Mouse Genome Database (MGD) Project, Gene Expression Database (GXD) Project, Mouse Tumor Biology (MTB) Database Project, Gene Ontology (GO) Project at MGI, and MouseCyc Project at MGI.
Proper citation: Mouse Genome Informatics (MGI) (RRID:SCR_006460) Copy
http://www.snpedia.com/index.php/SNPedia
Wiki investigating human genetics including information about the effects of variations in DNA, citing peer-reviewed scientific publications. It is used by Promethease to analyze and help explain your DNA. It is based on a wiki model in order to foster communication about genetic variation and to allow interested community members to help it evolve to become ever more relevant. As the cost of genotyping (and especially of fully determining your own genomic sequence) continues to drop, we''''ll all want to know more - a lot more - about the meaning of these DNA variations and SNPedia will be here to help. SNPedia has been launched to help realize the potential of the Human Genome Project to connect to our daily lives and well-being. For more information see the Wikipedia page, http://en.wikipedia.org/wiki/SNPedia * Download URL: http://www.SNPedia.com/index.php/Bulk * Web Service URL: http://bots.SNPedia.com/api.php
Proper citation: SNPedia (RRID:SCR_006125) Copy
ProPortal is a database containing genomic, metagenomic, transcriptomic and field data for the marine cyanobacterium Prochlorococcus. Our goal is to provide a source of cross-referenced data across multiple scales of biological organization--from the genome to the ecosystem--embracing the full diversity of ecotypic variation within this microbial taxon, its sister group, Synechococcus and phage that infect them. The site currently contains the genomes of 13 Prochlorococcus strains, 11 Synechococcus strains and 28 cyanophage strains that infect one or both groups. Cyanobacterial and cyanophage genes are clustered into orthologous groups that can be accessed by keyword search or through a genome browser. Users can also identify orthologous gene clusters shared by cyanobacterial and cyanophage genomes. Gene expression data for Prochlorococcus ecotypes MED4 and MIT9313 allow users to identify genes that are up or downregulated in response to environmental stressors. In addition, the transcriptome in synchronized cells grown on a 24-h light-dark cycle reveals the choreography of gene expression in cells in a ''natural'' state. Metagenomic sequences from the Global Ocean Survey from Prochlorococcus, Synechococcus and phage genomes are archived so users can examine the differences between populations from diverse habitats. Finally, an example of cyanobacterial population data from the field is included.
Proper citation: ProPortal (RRID:SCR_006112) Copy
This database presents the entire DNA sequence of the first diploid genome sequence of a Han Chinese, a representative of Asian population. The genome, named as YH, represents the start of YanHuang Project, which aims to sequence 100 Chinese individuals in 3 years. It was assembled based on 3.3 billion reads (117.7Gbp raw data) generated by Illumina Genome Analyzer. In total of 102.9Gbp nucleotides were mapped onto the NCBI human reference genome (Build 36) by self-developed software SOAP (Short Oligonucleotide Alignment Program), and 3.07 million SNPs were identified. The personal genome data is illustrated in a MapView, which is powered by GBrowse. A new module was developed to browse large-scale short reads alignment. This module enabled users track detailed divergences between consensus and sequencing reads. In total of 53,643 HGMD recorders were used to screen YH SNPs to retrieve phenotype related information, to superficially explain the donor's genome. Blast service to align query sequences against YH genome consensus was also provided.
Proper citation: YanHuang Project (RRID:SCR_006077) Copy
http://research.nhgri.nih.gov/CGD/
Manually curated database of all conditions with known genetic causes, focusing on medically significant genetic data with available interventions. Includes gene symbol, conditions, allelic conditions, inheritance, age in which interventions are indicated, clinical categorization, and general description of interventions/rationale. Contents are intended to describe types of interventions that might be considered. Includes only single gene alterations and does not include genetic associations or susceptibility factors related to more complex diseases.
Proper citation: Clinical Genomic Database (RRID:SCR_006427) Copy
http://igdb.nsclc.ibms.sinica.edu.tw/
IGDB.NSCLC database is aiming to facilitate and prioritize identified lung cancer genes and microRNAs for pathological and mechanistic studies of lung tumorigenesis and for developing new strategies for clinical interventions. We integrated and curated various lung cancer genomic datasets to present # lung cancer genes with somatic mutations, experimental supports and statistic significance in association with clinicopathological features; # genomic alterations with copy number alterations (CNA) detected by high density SNP arrays, gain or loss regions detected by arrayed comparative genome hybridization (aCGH), and loss of heterozygosity (LOH) detected by microsatellite markers; # aberrant expression of genes and microRNAs detected by various microarrays. IGDB.NSCLC database provides user friendly interfaces and searching functions to display multiple layers of evidence for detecting lung cancer target genes and microRNAs, especially emphasizing on concordant alterations: # genes with altered expression located in the CNA regions; # microRNAs with altered expression located in the CNA regions; # somatic mutation genes located in the CNA regions; and # genes associated with clinicopathological features located in the CNA regions. These concordant altered genes and miRNAs should be prioritized for further basic and clinical studies.
Proper citation: IGDB.NSCLC (RRID:SCR_006048) Copy
https://www.ncbi.nlm.nih.gov/geo/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on January 19, 2022.
Proper citation: NCBI Epigenomics (RRID:SCR_006151) Copy
http://www.ncbi.nlm.nih.gov/CCDS/
Database (anonymous FTP) resulting from a collaborative effort to identify a core set of human and mouse protein coding regions that are consistently annotated and of high quality. The long term goal is to support convergence towards a standard set of gene annotations. Collaborators are EBI, NCBI, UCSC, WTSI and the initial results are also available from the participants'''' genome browser Web sites. In addition, CCDS identifiers are indicated on the relevant NCBI RefSeq and Entrez Gene records and in Map Viewer displays of RNA (RefSeq) and Gene annotations on the reference assembly.
Proper citation: Consensus CDS (RRID:SCR_006729) Copy
http://bioinformatics.biol.uoa.gr/cuticleDB
A relational database containing all structural proteins of Arthropod cuticle identified to date. Many come from direct sequencing of proteins isolated from cuticle and from sequences from cDNAs that share common features with these authentic cuticular proteins. It also includes proteins from the five sequenced genomes where manual annotation has been applied to cuticular proteins: Anopheles gambiae, Apis mellifera, Bombyx mori, Drosophila melanogaster, and Nasonia vitripennis. Some sequences were confirmed as authentic cuticular proteins because protein sequencing revealed that they were present in cuticle; others were identified by sequence homology and other criteria. Entries provides information about whether sequences are putative or authentic cuticular proteins. CuticleDB was primarily designed to contain correct and full annotation of cuticular protein data. The database will be of help to future genome annotators. Users will be able to test hypotheses for the existence of known and also of yet unknown motifs in cuticular proteins. An analysis of motifs may contribute to understanding how proteins contribute to the physical properties of cuticle as well as to the precise nature of their interaction with chitin.
Proper citation: CuticleDB (RRID:SCR_007045) Copy
A database and interactive web site for manipulating and displaying annotations on genomes. Features include: detailed views of the genome; use of a variety of premade or personally made glyphs ; customizable order and appearance of tracks by administrators and end-users; search by annotation ID, name, or comment; support of third party annotation using GFF formats; DNA and GFF dumps; connectivity to different databases, including BioSQL and Chado; and a customizable plug-in architecture (e.g. run BLAST, find oligonucleotides, design primers, etc.). GBrowse is distributed as source code for Macintosh OS X, UNIX and Linux platforms, and as pre-packaged binaries for Windows machines. It can be installed using the standard Perl module build procedure, or automated using a network-based install script. In order to use the net installer, you will need to have Perl 5.8.6 or higher and the Apache web server installed. The wiki portion accepts data submissions.
Proper citation: GBrowse (RRID:SCR_006829) Copy
http://bond.unleashedinformatics.com/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.BOND, which requires registration of a free account, is a resource used to perform cross-database searches of available sequence, interaction, complex and pathway information. BOND integrates a range of component databases including GenBank and BIND, the Biomolecular Interaction Network Database. BOND contains 70+ million biological sequences, 33,000 structures, 38,000 GO terms, and over 200,000 human curated interactions contained in BIND, and is open access. BOND serves the interests of the developing global interactome effort encompassing the genomic, proteomic and metabolomic research communities. BOND is the first open access search resource to integrate sequence and interaction information. BOND integrates BLAST functionality, and contains a well-documented API. BOND also stores annotation links for sequences, including links to Genome Ontology descriptions, MedLine abstracts, taxon identifiers, associated structures, redundant sequences, sequence neighbors, conserved domains, data base cross-references, Online Mendalian Inheritance in Man identifiers, LocusLink identifiers and complete genomes. BIND on BOND The Biomolecular Interaction Network Database (BIND), a component database of BOND, is a collection of records documenting molecular interactions. The contents of BIND include high-throughput data submissions and hand-curated information gathered from the scientific literature. BIND is an interaction database with three classifications for molecular associations: molecules that associate with each other to form interactions, molecular complexes that are formed from one or more interaction(s) and pathways that are defined by a specific sequence of two or more interactions.Interactions A BIND record represents an interaction between two or more objects that is believed to occur in a living organism. A biological object can be a protein, DNA, RNA, ligand, molecular complex, gene, photon or an unclassified biological entity. BIND records are created for interactions which have been shown experimentally and published in at least one peer-reviewed journal. A record also references any papers with experimental evidence that support or dispute the associated interaction. Interactions are the basic units of BIND and can be linked together to form molecular complexes or pathways. The BIND interaction viewer is a tool to visualize and analyze molecular interactions, complexes and pathways. The BIND interaction viewer uses Ontoglyphs to display information about a protein via attributes such as molecular function, biological process and sub-cellular localization. Ontoglyphs allow to graphically and interactively explore interaction networks, by visualizing interactions in the context of 34 functional, 25 binding specificity and 24 sub-cellular localization Ontoglyphs categories. We will continue to provide an open access version of BOND, providing its subscribers with free, unlimited access to a core content set. But we are confident you will soon want to upgrade to BONDplus.
Proper citation: Biomolecular Object Network Databank (RRID:SCR_007433) Copy
http://mips.gsf.de/genre/proj/ustilago/
The MIPS Ustilago maydis Genome Database aims to present information on the molecular structure and functional network of the entirely sequenced, filamentous fungus Ustilago maydis. The underlying sequence is the initial release of the high quality draft sequence of the Broad Institute. The goal of the MIPS database is to provide a comprehensive genome database in the Genome Research Environment in parallel with other fungal genomes to enable in depth fungal comparative analysis. The specific aims are to: 1. Generate and assemble Whole Genome Shotgun sequence reads yielding 10X coverage of the U. maydis genome 2. Integrate the genomic sequence assembly with physical maps generated by Bayer CropScience 3. Perform automated annotation of the sequence assembly 4. Align the strain 521 assembly with the FB1 assembly provided by Exelixis 5. Release the sequence assembly and results of our annotation and analysis to public Ustilago maydis is a basidiomycete fungal pathogen of maize and teosinte. The genome size is approximately 20 Mb. The fungus induces tumors on host plants and forms masses of diploid teliospores. These spores germinate and form haploid meiotic products that can be propagated in culture as yeast-like cells. Haploid strains of opposite mating type fuse and form a filamentous, dikaryotic cell type that invades plant tissue to reinitiate infection. Ustilago maydis is an important model system for studying pathogen-host interactions and has been studied for more than 100 years by plant pathologists. Molecular genetic research with U. maydis focuses on recombination, the role of mating in pathogenesis, and signaling pathways that influence virulence. Recently, the fungus has emerged as an excellent experimental model for the molecular genetic analysis of phytopathogenesis, particularly in the characterization of infection-specific morphogenesis in response to signals from host plants. Ustilago maydis also serves as an important model for other basidiomycete plant pathogens that are more difficult to work with in the laboratory, such as the rust and bunt fungi. Genomic sequence of U. maydis will also be valuable for comparative analysis of other fungal genomes, especially with respect to understanding the host range of fungal phytopathogens. The analysis of U. maydis would provide a framework for studying the hundreds of other Ustilago species that attack important crops, such as barley, wheat, sorghum, and sugarcane. Comparisons would also be possible with other basidiomycete fungi, such as the important human pathogen C. neoformans. Commercially, U. maydis is an excellent model for the discovery of antifungal drugs. In addition, maize tumors caused by U. maydis are prized in Hispanic cuisine and there is interest in improving commercial production. The complete putative gene set of the Broad Institute''s second release is loaded into the database and in addition all deviating putative genes from a putative gene set produced by MIPS with different gene prediction parameters are also loaded. The complete dataset will then be analysed, gene predictions will be manually corrected due to combined information derived from different gene prediction algorithms and, more important, protein and EST comparisons. Gene prediction will be restricted to ORFs larger than 50 codons; smaller ORFs will be included only if similarities to other proteins or EST matches confirm their existence or if a coding region was postulated by all prediction programs used. The resulting proteins will be annotated. They will be classified according to the MIPS classification catalogue receiving appropriate descriptions. All proteins with a known, characterized homolog will be automatically assigned to functional categories using the MIPS functional catalog. All extracted proteins are in addition automatically analysed and annotated by the PEDANT suite.
Proper citation: MIPS Ustilago maydis Database (RRID:SCR_007563) Copy
Database about gene regulation and gene expression in prokaryotes. It includes a manually curated and unique collection of transcription factor binding sites. A variety of bioinformatics tools for the prediction, analysis and visualization of regulons and gene reglulatory networks is included. The integrated approach provides information about molecular networks in prokaryotes with focus on pathogenic organisms. In detail this concerns: * transcriptional regulation (transcription factors and their DNA binding sites * signal transduction (two-component systems, phosphylation cascades) * protein interactions (complex formation, oligomerization) * biochemical pathways (chemical reactions) * other regulation events (e.g. codon usage, etc. ...) It aims to be a resource to model protein-host interactions and to be a suitable platform to analyze high-throughput data from proteomis and transcriptomics experiments (systems biology). Currently it mainly contains detailed information about operon and promoter structures including huge collections of transcription factor binding sites. If an appropriate number of regulatory binding sites is available, a position weight matrix (PWM) and a sequence logo is provided, which can be used to predict new binding sites. This data is collected manually by screening the original scientific literature. PRODORIC also handles protein-protein interactions and signal-transduction cascades that commonly occur in form of two-component systems in prokaryotes. Furthermore it contains metabolic network data imported from the KEGG database., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: PRODORIC (RRID:SCR_007074) Copy
http://www.ncbi.nlm.nih.gov/COG
A database for phylogenetic classification for proteins encoded in complete genomes. Clusters of Orthologous Groups of proteins (COGs) were delineated by comparing protein sequences encoded in complete genomes, representing major phylogenetic lineages. Each COG consists of individual proteins or groups of paralogs from at least 3 lineages and thus corresponds to an ancient conserved domain. Please be aware that COGs hasn't been updated in many years and will not be.
Proper citation: COG (RRID:SCR_007139) Copy
http://genolist.pasteur.fr/Colibri/
Database dedicated to the analysis of the genome of Escherichia coli. Its purpose is to collate and integrate various aspects of the genomic information from E. coli, the paradigm of Gram-negative bacteria. Colibri provides a complete dataset of DNA and protein sequences derived from the paradigm strain E. coli K-12, linked to the relevant annotations and functional assignments. It allows one to easily browse through these data and retrieve information, using various criteria (gene names, location, keywords, etc.). The data contained in Colibri originates from two major sources of information, the reference genomic DNA sequence from the E. coli Genome Project and the feature annotations from the EcoGene data collection., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Colibri (RRID:SCR_007606) Copy
Collection of male germ cell transcriptiome information derived from Serial Analysis of Gene Expression (SAGE). It includes the three key germ cell stages in spermatogenesis, including mouse type A spermatogonia (Spga), pachytene spermatocytes (Spcy), and round spermatids (Sptd). A total of 452,095 SAGE tags are represented in all the libraries and is by far the most comprehensive resource available. Users can choose a global view of germ cell transcriptome data in the UCSC Genome browser. They can also search genes or specify searching criteria based on tag sequence, chromosomal location or tag counts.
Proper citation: GermSAGE (RRID:SCR_007689) Copy
http://projects.tcag.ca/xenodup/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 16, 2013. It contains information about segmental duplications in the genomes of chimpanzee, mouse, and rat. The criteria used to identify regions of segmental duplication are: * Sequence identity of at least 90% * Sequence length of at least 5 kb * Not be entirely composed of repetitive elements. BACKGROUND: The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (>/= 5 kb) and recent (>/= 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies. RESULTS: We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of "unmapped" chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice. CONCLUSION: Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis. The segmental duplication data and summary statistics are available for download and can also be visualized in a genome browser in the GBrowse section. Selected annotation tracks (except the segmental duplication track) have also been obtained from UCSC and loaded into the genome browser. Detailed information (e.g. overlapping genes, overlapping clones, detailed alignment) can be obtained by clicking on a duplication cluster in GBrowse. Both keyword search and BLAT search are available. Analyses based on previous genome assemblies can be found in the Previous Analyses section. Recent Developments The Non-Human Genome Segmental Duplication Database is continually updated including the archived copies of the analysis of all previous genome assemblies and will include all new species as they become available. Acknowledgments We thank The Centre for Applied Genomics at the Hospital for Sick Children (HSC) as well as collaborators worldwide. Supported by Genome Canada the Howard Hughes Medical Institute International Scholar Program (to S.W.S.) and the HSC Foundation.
Proper citation: Non-Human Genome Segmental Duplication Database (RRID:SCR_000470) Copy
Database and integrated tools to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data.
Proper citation: Bovine Genome Database (RRID:SCR_000148) Copy
Database and browser that provides a central resource to archive and display association between genetic variation and high-throughput molecular-level phenotypes. This effort originated with the NIH GTEx roadmap project: however the scope of this resource will be extended to include any available genotype/molecular phenotype datasets.
Proper citation: GTEx eQTL Browser (RRID:SCR_001618) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the dkNET Resources search. From here you can search through a compilation of resources used by dkNET and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that dkNET has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on dkNET then you can log in from here to get additional features in dkNET such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into dkNET you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within dkNET that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.