Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.ncbi.nlm.nih.gov/ieb/research/acembly/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone., documented August 29, 2016. AceView offers an integrated view of the human, nematode and Arabidopsis genes reconstructed by co-alignment of all publicly available mRNAs and ESTs on the genome sequence. Our goals are to offer a reliable up-to-date resource on the genes and their functions and to stimulate further validating experiments at the bench. AceView provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences (mRNAs from GenBank or RefSeq, and single pass cDNA sequences from dbEST and Trace). These experimental cDNA sequences are first co-aligned on the genome then clustered into a minimal number of alternative transcript variants and grouped into genes. Using exhaustively and with high quality standards the available cDNA sequences evidences the beauty and complexity of mammals' transcriptome, and the relative simplicity of the nematode and plant transcriptomes. Genes are classified according to their inferred coding potential; many presumably non-coding genes are discovered. Genes are named by Entrez Gene names when available, else by AceView gene names, stable from release to release. Alternative features (promoters, introns and exons, polyadenylation signals) and coding potential, including motifs, domains, and homologies are annotated in depth; tissues where expression has been observed are listed in order of representation; diseases, phenotypes, pathways, functions, localization or interactions are annotated by mining selected sources, in particular PubMed, GAD and Entrez Gene, and also by performing manual annotation, especially in the worm. In this way, both the anatomy and physiology of the experimentally cDNA supported human, mouse and nematode genes are thoroughly annotated. Our goals are to offer an up-to-date resource on the genes, in the hope to stimulate further experiments at the bench, or to help medical research. AceView can be queried by meaningful words or groups of words as well as by most standard identifiers, such as gene names, Entrez Gene ID, UniGene ID, GenBank accessions.
Proper citation: AceView (RRID:SCR_002277) Copy
http://fullmal.hgc.jp/index_ajax.html
FULL-malaria is a database for a full-length-enriched cDNA library from the human malaria parasite Plasmodium falciparum. Because of its medical importance, this organism is the first target for genome sequencing of a eukaryotic pathogen; the sequences of two of its 14 chromosomes have already been determined. However, for the full exploitation of this rapidly accumulating information, correct identification of the genes and study of their expression are essential. Using the oligo-capping method, this database has produced a full-length-enriched cDNA library from erythrocytic stage parasites and performed one-pass reading. The database consists of nucleotide sequences of 2490 random clones that include 390 (16%) known malaria genes according to BLASTN analysis of the nr-nt database in GenBank; these represent 98 genes, and the clones for 48 of these genes contain the complete protein-coding sequence (49%). On the other hand, comparisons with the complete chromosome 2 sequence revealed that 35 of 210 predicted genes are expressed, and in addition led to detection of three new gene candidates that were not previously known. In total, 19 of these 38 clones (50%) were full-length. From these observations, it is expected that the database contains approximately 1000 genes, including 500 full-length clones. It should be an invaluable resource for the development of vaccines and novel drugs. Full-malaria has been updated in at least three points. (i) 8934 sequences generated from the addition of new libraries added so that the database collection of 11,424 full-length cDNAs covers 1375 (25%) of the estimated number of the entire 5409 parasite genes. (ii) All of its full-length cDNAs and GenBank EST sequences were mapped to genomic sequences together with publicly available annotated genes and other predictions. This precisely determined the gene structures and positions of the transcriptional start sites, which are indispensable for the identification of the promoter regions. (iii) A total of 4257 cDNA sequences were newly generated from murine malaria parasites, Plasmodium yoelii yoelii. The genome/cDNA sequences were compared at both nucleotide and amino acid levels, with those of P.falciparum, and the sequence alignment for each gene is presented graphically. This part of the database serves as a versatile platform to elucidate the function(s) of malaria genes by a comparative genomic approach. It should also be noted that all of the cDNAs represented in this database are supported by physical cDNA clones, which are publicly and freely available, and should serve as indispensable resources to explore functional analyses of malaria genomes. Sponsors: This database has been constructed and maintained by a Grant-in-Aid for Publication of Scientific Research Results from the Japan Society for the Promotion of Science (JSPS). This work was also supported by a Special Coordination Funds for Promoting Science and Technology from the Science and Technology Agency of Japan (STA) and a Grant-in-Aid for Scientific Research on Priority Areas from the Ministry of Education, Science, Sports and Culture of Japan.
Proper citation: Full-Malaria: Malaria Full-Length cDNA Database (RRID:SCR_002348) Copy
http://people.biochem.umass.edu/fournierlab/3dmodmap/
Database of maps showing the sites of modified rRNA nucleotides. Access to the rRNA sequences, secondary structures both with modification sites indicated, 3D modification maps and the supporting tables of equivalent nucleotides for rRNA from model organisms including yeast, arabidopsis, e. coli and human is provided. This database complements the Yeast snoRNA Database at UMass-Amherst and relies on linking to some content from that database, as well as to others by colleagues in related fields. Therefore, please be very cognizant as to the source when citing information obtained herein. Locations of modified rRNA nucleotides within the 3D structure of the ribosome.
Proper citation: 3D Ribosomal Modification Maps Database (RRID:SCR_003097) Copy
http://compbio.uthsc.edu/miRSNP/
Database of naturally occurring DNA variations in microRNA (miRNA) seed regions and miRNA target sites. MicroRNAs pair to the transcripts of protein-coding genes and cause translational repression or mRNA destabilization. SNPs and INDELs in miRNAs and their target sites may affect miRNA-mRNA interaction, and hence affect miRNA-mediated gene repression. The PolymiRTS database was created by scanning 3'UTRs of mRNAs in human and mouse for SNPs and INDELs in miRNA target sites. Then, the potential downstream effects of these polymorphisms on gene expression and higher-order phenotypes are identified. Specifically, genes containing PolymiRTSs, cis-acting expression QTLs, and physiological QTLs in mouse and the results of genome-wide association studies (GWAS) of human traits and diseases are linked in the database. The PolymiRTS database also includes polymorphisms in target sites that have been supported by a variety of experimental methods and polymorphisms in miRNA seed regions.
Proper citation: PolymiRTS (RRID:SCR_003389) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone., documented September 2, 2016. Database for defining official rat gene symbols. It includes rat gene symbols from three major sources: the Rat Genome Database (RGD), Ensembl, and NCBI-Gene. All rat symbols are compared with official symbols from orthologous human genes as specified by the Human Gene Nomenclature Committee (HGNC). Based on the outcome of the comparisons, a rat gene symbol may be selected. Rat symbols that do not match a human ortholog undergo a strict procedure of comparisons between the different rat gene sources as well as with the Mouse Genome Database (MGD). For each rat gene this procedure results in an unambiguous gene designation. The designation is presented as a status level that accompanies every rat gene symbol suggested in the database. The status level describes both how a rat symbol was selected, and its validity. Rat Gene Symbol Tracker approves rat gene symbols by an automatic procedure. The rat genes are presented with links to RGD, Ensembl, NCBI Gene, MGI and HGNC. RGST ensures that each acclaimed rat gene symbol is unique and follows the guidelines given by the RGNC. To each symbol a status level associated, describing the gene naming process.
Proper citation: Rat Gene Symbol Tracker (RRID:SCR_003261) Copy
http://bioinfo.mbi.ucla.edu/ASAP/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on 8/12/13. Database to access and mine alternative splicing information coming from genomics and proteomics based on genome-wide analyses of alternative splicing in human (30 793 alternative splice relationships found) from detailed alignment of expressed sequences onto the genomic sequence. ASAP provides precise gene exon-intron structure, alternative splicing, tissue specificity of alternative splice forms, and protein isoform sequences resulting from alternative splicing. They developed an automated method for discovering human tissue-specific regulation of alternative splicing through a genome-wide analysis of expressed sequence tags (ESTs), which involves classifying human EST libraries according to tissue categories and Bayesian statistical analysis. They use the UniGene clusters of human Expressed Sequence Tags (ESTs) to identify splices. The UniGene EST's are clustered so that a single cluster roughly corresponds to a gene (or at least a part of a gene). A single EST represents a portion of a processed (already spliced) mRNA. A given cluster contains many ESTs, each representing an outcome of a series of splicing events. The ESTs in UniGene contain the different mRNA isoforms transcribed from an alternatively spliced gene. They are not predicting alternative splicing, but locating it based on EST analysis. The discovered splices are further analyzed to determine alternative splicing events. They have identified 6201 alternative splice relationships in human genes, through a genome-wide analysis of expressed sequence tags (ESTs). Starting with 2.1 million human mRNA and EST sequences, they mapped expressed sequences onto the draft human genome sequence and only accepted splices that obeyed the standard splice site consensus. After constructing a tissue list of 46 human tissues with 2 million human ESTs, they generated a database of novel human alternative splices that is four times larger than our previous report, and used Bayesian statistics to compare the relative abundance of every pair of alternative splices in these tissues. Using several statistical criteria for tissue specificity, they have identified 667 tissue-specific alternative splicing relationships and analyzed their distribution in human tissues. They have validated our results by comparison with independent studies. This genome-wide analysis of tissue specificity of alternative splicing will provide a useful resource to study the tissue-specific functions of transcripts and the association of tissue-specific variants with human diseases.
Proper citation: ASAP: the Alternative Splicing Annotation Project (RRID:SCR_003415) Copy
Database of polymorphisms and mutations of the human mitochondrial DNA. It reports published and unpublished data on human mitochondrial DNA variation. All data is curated by hand. If you would like to submit published articles to be included in mitomap, please send them the citation and a pdf.
Proper citation: MITOMAP - A human mitochondrial genome database (RRID:SCR_002996) Copy
http://www.hgsc.bcm.tmc.edu/content/hapmap-3-and-encode-3
Draft release 3 for genome-wide SNP genotyping and targeted sequencing in DNA samples from a variety of human populations (sometimes referred to as the HapMap 3 samples). This release contains the following data: * SNP genotype data generated from 1184 samples, collected using two platforms: the Illumina Human1M (by the Wellcome Trust Sanger Institute) and the Affymetrix SNP 6.0 (by the Broad Institute). Data from the two platforms have been merged for this release. * PCR-based resequencing data (by Baylor College of Medicine Human Genome Sequencing Center) across ten 100-kb regions (collectively referred to as ENCODE 3) in 712 samples. Since this is a draft release, please check this site regularly for updates and new releases. The HapMap 3 sample collection comprises 1,301 samples (including the original 270 samples used in Phase I and II of the International HapMap Project) from 11 populations, listed below alphabetically by their 3-letter labels. Five of the ten ENCODE 3 regions overlap with the HapMap-ENCODE regions; the other five are regions selected at random from the ENCODE target regions (excluding the 10 HapMap-ENCODE regions). All ENCODE 3 regions are 100-kb in size, and are centered within each respective ENCODE region. The HapMap 3 and ENCORE 3 data are downloadable from the ftp site.
Proper citation: HapMap 3 and ENCODE 3 (RRID:SCR_004563) Copy
http://aws.amazon.com/1000genomes/
A dataset containing the full genomic sequence of 1,700 individuals, freely available for research use. The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalogue of human genetic variation. The project has grown to 200 terabytes of genomic data including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research free of charge. The dataset containing the full genomic sequence of 1,700 individuals is now available to all via Amazon S3. The data can be found at: http://s3.amazonaws.com/1000genomes The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year. Public Data Sets on AWS provide a centralized repository of public data hosted on Amazon Simple Storage Service (Amazon S3). The data can be seamlessly accessed from AWS services such Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce (Amazon EMR), which provide organizations with the highly scalable compute resources needed to take advantage of these large data collections. AWS is storing the public data sets at no charge to the community. Researchers pay only for the additional AWS resources they need for further processing or analysis of the data. All 200 TB of the latest 1000 Genomes Project data is available in a publicly available Amazon S3 bucket. You can access the data via simple HTTP requests, or take advantage of the AWS SDKs in languages such as Ruby, Java, Python, .NET and PHP. Researchers can use the Amazon EC2 utility computing service to dive into this data without the usual capital investment required to work with data at this scale. AWS also provides a number of orchestration and automation services to help teams make their research available to others to remix and reuse. Making the data available via a bucket in Amazon S3 also means that customers can crunch the information using Hadoop via Amazon Elastic MapReduce, and take advantage of the growing collection of tools for running bioinformatics job flows, such as CloudBurst and Crossbow.
Proper citation: 1000 Genomes Project and AWS (RRID:SCR_008801) Copy
https://confluence.crbs.ucsd.edu/display/NIF/StemCellInfo
Data tables providing an overview of information about stem cells that have been derived from mice and humans. The tables summarize published research that characterizes cells that are capable of developing into cells of multiple germ layers (i.e., multipotent or pluripotent) or that can generate the differentiated cell types of another tissue (i.e., plasticity) such as a bone marrow cell becoming a neuronal cell. The tables do not include information about cells considered progenitor or precursor cells or those that can proliferate without the demonstrated ability to generate cell types of other tissues. The tables list the tissue from which the cells were derived, the types of cells that developed, the conditions under which differentiation occurred, the methods by which the cells were characterized, and the primary references for the information.
Proper citation: National Institutes of Health Stem Cell Tables (RRID:SCR_008359) Copy
http://www.cs.tau.ac.il/~shlomito/tissue-net/
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. Network visualizations in which the expression and predicted flux data are projected over the global human network. These network visualizations are accessible through the supplemental website using the publicly available Cytoscape software (Cline, Smoot et al. 2007). Since many high degree nodes exist in the network, special layouts are required to produce network visualizations that are readily interpretable. To this end we produced network visualizations in which hub nodes are repeated multiple times and hence layouts with a small number of edge crossings can be generated. Contains entries for brain compartments and brain pathways.
Proper citation: Network-based Prediction of Human Tissue-specific Metabolism (RRID:SCR_007392) Copy
http://www.emcdda.europa.eu/eib
The EIB provides assessment tests for substance disorder related clinical instruments that are freely available. Details regarding copyright and/or possible use restrictions are specified for each instrument. Instruments are generally classed according to the intervention field they are designed to be used in (treatment, prevention, or harm reduction), though some instruments may be usable in more than one field.
Proper citation: Evaluation Instruments Bank (RRID:SCR_013246) Copy
http://dunham.gs.washington.edu/protocols.shtml
A portal for Maitreya Dunham's lab, which works on the genomic analysis of experimental evolution in yeast using microarrays and the chemostat. Research interests of the lab include experimental evolution of genetic networks in yeast, aneuploidy and copy number variation, comparative genomics, technology development and human genetics in yeast.
Proper citation: Maitreya Dunham's Lab (RRID:SCR_000784) Copy
The mission of ILAR is to evaluate and disseminate information on issues related to the scientific, technological, and ethical use of animals and related biological resources in research, testing, and education. Using the principles of refinement, reduction, and replacement (3Rs) as a foundation, ILAR promotes high-quality science through the humane care and use of animals and the implementation of alternatives. Through the reports of expert committees, the ILAR Journal, web-based resources, and other means of communication, ILAR functions as a component of the National Academies to provide independent, objective advice to the federal government, the international biomedical research community, and the public. ILAR supports the responsible use of animals in research, testing, and education as a key component to advancing the health and quality of life of humans and animals. It promotes high-quality science and humane care and use of research animals based upon the principles of refinement, replacement, and reduction (the 3Rs) and high ethical standards. It fosters best practices that enhance human and animal welfare by organizing and disseminating information and by facilitating dialogue among interested parties. It has developed a unique Search Engine to search for animal models and strains. This search engine surveys all the websites of vendors and repositories of laboratory animals and biological material on our Links page. The ILAR develops guidelines on laboratory animal care and use and conducts conferences, symposia, and workshops on important laboratory animal problems. ILAR publishes the ILAR Journal on a quarterly basis, as well as conference proceedings and special reports prepared by committees of experts. A list of ILAR publications on issues related to laboratory animal research is available on the Web site. As part of the Animal Models and Genetic Stocks Information Exchange Program, ILAR staff members answer direct telephone and mail inquiries and maintain a Web page containing a database on animal models and genetic stock. The Web site also offers a comprehensive search engine that enables users to find information on the existence and location of special animal models, correct nomenclature to identify animals, and related topics such as diseases of animals and relevant publications. Sponsors: ILAR receives funding from the following sponsors: -Abbott Laboratories -Abbott Fund -American College of Laboratory Animal Medicine (ACLAM) -American Society of Laboratory Animal Practitioners (ASLAP) -Association for Assessment and Accreditation of Laboratory Animal Care (AAALAC) -Bristol-Myers Squibb Co. -Charles River -Charles River Laboratories Foundation -Covance -Federation of American Societies for Experimental Biology (FASEB) -GlaxoSmithKline -Merck & Co., Inc. -National Science Foundation (NSF) -Pfizer -Scientists Center for Animal Welfare (SCAW) -U.S. Department of Agriculture (USDA) -U.S. Department of the Army -U.S. Department of Health and Human Services (DHHS) :*National Institutes of Health (NIH) :*Office of Research Integrity (ORI) -U.S. Department of the Navy -U.S. Department of Veterans Affairs -Wellcome Trust -Wyeth Pharmaceuticals
Proper citation: Institute for Laboratory Animal Research (RRID:SCR_006872) Copy
http://bond.unleashedinformatics.com/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.BOND, which requires registration of a free account, is a resource used to perform cross-database searches of available sequence, interaction, complex and pathway information. BOND integrates a range of component databases including GenBank and BIND, the Biomolecular Interaction Network Database. BOND contains 70+ million biological sequences, 33,000 structures, 38,000 GO terms, and over 200,000 human curated interactions contained in BIND, and is open access. BOND serves the interests of the developing global interactome effort encompassing the genomic, proteomic and metabolomic research communities. BOND is the first open access search resource to integrate sequence and interaction information. BOND integrates BLAST functionality, and contains a well-documented API. BOND also stores annotation links for sequences, including links to Genome Ontology descriptions, MedLine abstracts, taxon identifiers, associated structures, redundant sequences, sequence neighbors, conserved domains, data base cross-references, Online Mendalian Inheritance in Man identifiers, LocusLink identifiers and complete genomes. BIND on BOND The Biomolecular Interaction Network Database (BIND), a component database of BOND, is a collection of records documenting molecular interactions. The contents of BIND include high-throughput data submissions and hand-curated information gathered from the scientific literature. BIND is an interaction database with three classifications for molecular associations: molecules that associate with each other to form interactions, molecular complexes that are formed from one or more interaction(s) and pathways that are defined by a specific sequence of two or more interactions.Interactions A BIND record represents an interaction between two or more objects that is believed to occur in a living organism. A biological object can be a protein, DNA, RNA, ligand, molecular complex, gene, photon or an unclassified biological entity. BIND records are created for interactions which have been shown experimentally and published in at least one peer-reviewed journal. A record also references any papers with experimental evidence that support or dispute the associated interaction. Interactions are the basic units of BIND and can be linked together to form molecular complexes or pathways. The BIND interaction viewer is a tool to visualize and analyze molecular interactions, complexes and pathways. The BIND interaction viewer uses Ontoglyphs to display information about a protein via attributes such as molecular function, biological process and sub-cellular localization. Ontoglyphs allow to graphically and interactively explore interaction networks, by visualizing interactions in the context of 34 functional, 25 binding specificity and 24 sub-cellular localization Ontoglyphs categories. We will continue to provide an open access version of BOND, providing its subscribers with free, unlimited access to a core content set. But we are confident you will soon want to upgrade to BONDplus.
Proper citation: Biomolecular Object Network Databank (RRID:SCR_007433) Copy
http://mips.gsf.de/genre/proj/ustilago/
The MIPS Ustilago maydis Genome Database aims to present information on the molecular structure and functional network of the entirely sequenced, filamentous fungus Ustilago maydis. The underlying sequence is the initial release of the high quality draft sequence of the Broad Institute. The goal of the MIPS database is to provide a comprehensive genome database in the Genome Research Environment in parallel with other fungal genomes to enable in depth fungal comparative analysis. The specific aims are to: 1. Generate and assemble Whole Genome Shotgun sequence reads yielding 10X coverage of the U. maydis genome 2. Integrate the genomic sequence assembly with physical maps generated by Bayer CropScience 3. Perform automated annotation of the sequence assembly 4. Align the strain 521 assembly with the FB1 assembly provided by Exelixis 5. Release the sequence assembly and results of our annotation and analysis to public Ustilago maydis is a basidiomycete fungal pathogen of maize and teosinte. The genome size is approximately 20 Mb. The fungus induces tumors on host plants and forms masses of diploid teliospores. These spores germinate and form haploid meiotic products that can be propagated in culture as yeast-like cells. Haploid strains of opposite mating type fuse and form a filamentous, dikaryotic cell type that invades plant tissue to reinitiate infection. Ustilago maydis is an important model system for studying pathogen-host interactions and has been studied for more than 100 years by plant pathologists. Molecular genetic research with U. maydis focuses on recombination, the role of mating in pathogenesis, and signaling pathways that influence virulence. Recently, the fungus has emerged as an excellent experimental model for the molecular genetic analysis of phytopathogenesis, particularly in the characterization of infection-specific morphogenesis in response to signals from host plants. Ustilago maydis also serves as an important model for other basidiomycete plant pathogens that are more difficult to work with in the laboratory, such as the rust and bunt fungi. Genomic sequence of U. maydis will also be valuable for comparative analysis of other fungal genomes, especially with respect to understanding the host range of fungal phytopathogens. The analysis of U. maydis would provide a framework for studying the hundreds of other Ustilago species that attack important crops, such as barley, wheat, sorghum, and sugarcane. Comparisons would also be possible with other basidiomycete fungi, such as the important human pathogen C. neoformans. Commercially, U. maydis is an excellent model for the discovery of antifungal drugs. In addition, maize tumors caused by U. maydis are prized in Hispanic cuisine and there is interest in improving commercial production. The complete putative gene set of the Broad Institute''s second release is loaded into the database and in addition all deviating putative genes from a putative gene set produced by MIPS with different gene prediction parameters are also loaded. The complete dataset will then be analysed, gene predictions will be manually corrected due to combined information derived from different gene prediction algorithms and, more important, protein and EST comparisons. Gene prediction will be restricted to ORFs larger than 50 codons; smaller ORFs will be included only if similarities to other proteins or EST matches confirm their existence or if a coding region was postulated by all prediction programs used. The resulting proteins will be annotated. They will be classified according to the MIPS classification catalogue receiving appropriate descriptions. All proteins with a known, characterized homolog will be automatically assigned to functional categories using the MIPS functional catalog. All extracted proteins are in addition automatically analysed and annotated by the PEDANT suite.
Proper citation: MIPS Ustilago maydis Database (RRID:SCR_007563) Copy
http://projects.tcag.ca/humandup/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 17, 2013. It contains information about segmental duplications in the human genome. The criteria used to identify regions of segmental duplication are: Sequence identity of at least 90, Sequence length of at least 5 kb, Not be entirely composed of repetitive elements. Background Previous studies have suggested that recent segmental duplications, which are often involved in chromosome rearrangements underlying genomic disease, account for some 5 of the human genome. We have developed rapid computational heuristics based on BLAST analysis to detect segmental duplications, as well as regions containing potential sequence misassignments in the human genome assemblies. Results Our analysis of the June 2002 public human genome assembly revealed that 107.4 of 3,043.1 megabases (Mb) (3.53) of sequence contained segmental duplications, each with size equal or more than 5 kb and 90 identity. We have also detected that 38.9 Mb (1.28) of sequence within this assembly is likely to be involved in sequence misassignment errors. Furthermore, we have identified a significant subset (199,965 of 2,327,473 or 8.6) of single-nucleotide polymorphisms (SNPs) in the public databases that are not true SNPs but are potential paralogous sequence variants. Conclusion Using two distinct computational approaches, we have identified most of the sequences in the human genome that have undergone recent segmental duplications. Near-identical segmental duplications present a major challenge to the completion of the human genome sequence. Potential sequence misassignments detected in this study would require additional efforts to resolve. The segmental duplication data and summary statistics are available for download. Data for Human Genome (based on the May 2004 Human Genome Assembly (hg17)) Visualize duplication relationships in GBrowse (GBrowse) Duplicon Pair relationships (GFF) Genes within duplication regions (HTML) Genome duplication content (MS Excel) The segmental duplication data can be visualized in a genome browser in the GBrowse section. Selected human genome annotation tracks (except the segmental duplication track) have also been obtained from UCSC and loaded into the genome browser. Detailed information (e.g. overlapping genes, overlapping clones, detailed alignment) can be obtained by clicking on a duplication cluster in GBrowse. Both keyword search and BLAT search are available. Analyses based on previous human genome assemblies can be found in the Previous Analyses section. Acknowledgments We thank The Centre for Applied Genomics at the Hospital for Sick Children (HSC) as well as collaborators worldwide. Supported by Genome Canada the Howard Hughes Medical Institute International Scholar Program (to S.W.S.) and the HSC Foundation.
Proper citation: Human Genome Segmental Duplication Database (RRID:SCR_007728) Copy
SYSTERS is a database of protein sequences grouped into homologous families and superfamilies. The SYSTERS project aims to provide a meaningful partitioning of the whole protein sequence space by a fully automatic procedure. A refined two-step algorithm assigns each protein to a family and a superfamily. The sequence data underlying SYSTERS release 4 now comprise several protein sequence databases derived from completely sequenced genomes (ENSEMBL, TAIR, SGD and GeneDB), in addition to the comprehensive Swiss-Prot/TrEMBL databases. To augment the automatically derived results, information from external databases like Pfam and Gene Ontology are added to the web server. Furthermore, users can retrieve pre-processed analyses of families like multiple alignments and phylogenetic trees. New query options comprise a batch retrieval tool for functional inference about families based on automatic keyword extraction from sequence annotations. A new access point, PhyloMatrix, allows the retrieval of phylogenetic profiles of SYSTERS families across organisms with completely sequenced genomes. Gene, Human, Vertebrate, Genome, Human ORFs
Proper citation: SYSTERS (RRID:SCR_007955) Copy
The official compendium for the Anatomical Therapeutic Chemical Classification System (ATC)-code descriptions. The Centre's main tasks are development and maintenance of the ATC/DDD system, including: * To classify drugs according to the ATC system. * Priority will be given to the classification of single substances, while combination products available internationally (i.e. important fixed combinations) will be dealt with as far as possible. * To establish DDDs for drugs which have been assigned an ATC code. * To review and revise as necessary the ATC classification system and DDDs. * To stimulate and influence the practical use of the ATC system by co-operating with researchers in the drug utilization field. Support: The WHO Collaborating Centre for Drug Statistics Methodology was established in 1982. The Centre is situated in Oslo at the Norwegian Institute of Public Health. The Centre is funded by the Norwegian government.
Proper citation: WHO Collaborating Centre for Drug Statistics Methodology (RRID:SCR_000677) Copy
http://cddb.nhlbi.nih.gov/cddb/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 16, 2013. This database is intended to serve as a learning tool to obtain curated information for the design of microarray targets to scan collecting duct tissues (human, rat, mouse). The database focuses on regulatory and transporter proteins expressed in the collecting duct, but when collecting duct proteins are a member of a larger family of proteins, common additional members of the family are included even if they have not been demonstrated to be expressed in the collecting duct. An Internet-accessible database has been devised for major collecting duct proteins involved in transport and regulation of cellular processes. The individual proteins included in this database are those culled from literature searches and from previously published studies involving cDNA arrays and serial analysis of gene expression (SAGE). Design of microarray targets for the study of kidney collecting duct tissues is facilitated by the database, which includes links to curated base pair and amino acid sequence data, relevant literature, and related databases. Use of the database is illustrated by a search for water channel proteins, aquaporins, and by a subsequent search for vasopressin receptors. Links are shown to the literature and to sequence data for human, rat, and mouse, as well as to relevant web-based resources. Extension of the database is dynamic and is done through a maintenance interface. This permits creation of new categories, updating of existing entries, and addition of new ones. CDDB is a database that organizes lists of genes found in collecting duct tissues from three mammalian species: human, rat, and mouse. Proteins are divided into categories by family relationships and functional classification, and each category is assigned a section in the database. Each section includes links to the literature and to sequence information for genes, proteins, expressed sequence tags, and related information. The user can peruse a section or use a search engine at the bottom of the web page to search the database for a name or abbreviation or for a link to a sequence. Each entry in the database includes links to relevant papers in the kidney and collecting duct literature. It uses links to PubMed to generate MEDLINE searches for retrieval of references. In addition, each entry includes links to curated sequence data available in LocusLink. Individual links are made to sequence and protein data for human, rat, and mouse. Links are then added as curated sequences become available for proteins identified in the renal collecting duct and for proteins identified in kidney and similar in function or homologous to proteins identified in the collecting duct.
Proper citation: Collecting Duct Database (RRID:SCR_000759) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.