Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://swissregulon.unibas.ch/fcgi/sr/swissregulon
A database of genome-wide annotations of regulatory sites. The predictions are based on Bayesian probabilistic analysis of a combination of input information including: * Experimentally determined binding sites reported in the literature. * Known sequence-specificities of transcription factors. * ChIP-chip and ChIP-seq data. * Alignments of orthologous non-coding regions. Predictions were made using the PhyloGibbs, MotEvo, IRUS and ISMARA algorithms developed in their group, depending on the data available for each organism. Annotations can be viewed in a Gbrowse genome browser and can also be downloaded in flat file format.
Proper citation: SwissRegulon (RRID:SCR_005333) Copy
A publicly available database of Transposed elements (TEs) which are located within protein-coding genes of 7 organisms: human, mouse, chicken, zebrafish, fruilt fly, nematode and sea squirt. Using TranspoGene the user can learn about the many aspects of the effect these TEs have on their hosting genes, such as: exonization events (including alternative splicing-related data), insertion of TEs into introns, exons, and promoters, specific location of the TE over the gene, evolutionary divergence of the TE from its consensus sequence and involvement in diseases. TranspoGene database is quickly searchable through its website, enables many kinds of searches and is available for download. TranspoGene contains information regarding specific type and family of the TEs, genomic and mRNA location, sequence, supporting transcript accession and alignment to the TE consensus sequence. The database also contains host gene specific data: gene name, genomic location, Swiss-Prot and RefSeq accessions, diseases associated with the gene and splicing pattern. The TranspoGene and microTranspoGene databases can be used by researchers interested in the effect of TE insertion on the eukaryotic transcriptome.
Proper citation: TranspoGene (RRID:SCR_005634) Copy
A knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest.
Proper citation: BiGG Database (RRID:SCR_005809) Copy
http://h-invitational.jp/varygene/
It consists of a Genome Browser, an LD Search System, and the VaryGene 2 system. The Generic Genome Browser is a combination of database and interactive Web page for manipulating and displaying annotations on genomes, while LDSearchSystem is a search system for linkage disequilibrium (LD) bins. VaryGene 2 is a system to search, display, and download our research results on human polymorphism based on publicly available data and annotations of transcripts presented by H-InvDB. VaryGene 2 provides information about single nucleotide polymorphisms (SNPs), deletion-insertion polymorphisms (DIPs), short tandem repeats (STRs), single amino acid repeats (SARs), structural variation (or copy number variations: CNVs), and their relations to the genome, transcripts, and functional domains. Users can search by polymorphisms, transcripts, STRs/SARs, and CNVs.
Proper citation: VarySysDB (RRID:SCR_005880) Copy
This database presents the entire DNA sequence of the first diploid genome sequence of a Han Chinese, a representative of Asian population. The genome, named as YH, represents the start of YanHuang Project, which aims to sequence 100 Chinese individuals in 3 years. It was assembled based on 3.3 billion reads (117.7Gbp raw data) generated by Illumina Genome Analyzer. In total of 102.9Gbp nucleotides were mapped onto the NCBI human reference genome (Build 36) by self-developed software SOAP (Short Oligonucleotide Alignment Program), and 3.07 million SNPs were identified. The personal genome data is illustrated in a MapView, which is powered by GBrowse. A new module was developed to browse large-scale short reads alignment. This module enabled users track detailed divergences between consensus and sequencing reads. In total of 53,643 HGMD recorders were used to screen YH SNPs to retrieve phenotype related information, to superficially explain the donor's genome. Blast service to align query sequences against YH genome consensus was also provided.
Proper citation: YanHuang Project (RRID:SCR_006077) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 15, 2013. TRIPLES provides full public access to the data and reagents generated from ongoing functional analysis of the yeast genome. Using a novel transposon-tagging approach, we have analyzed disruption phenotypes, gene expression, and protein localization on a genome-wide scale in Saccharomyces. The data generated from this study may be accessed through our database, TRIPLES ; additionally, all reagents generated in this study are freely available from on-line order forms (linked to TRIPLES as well). multipurpose, mini-transposon, mutant alleles, phenotypes, protein localization, gene expression, Saccharomyces cerevisiae, Web-accessible database, transposon-mutagenized yeast strains, downloaded, tab-delimited, text file, protein localization data, fluorescent micrographs, staining patterns, indirect immunofluorescence analysis of indicated epitope-tagged proteins, subcellular localization of the yeast proteome, visual library, Nucleic Acid Sequence Data Library (GenBank), clone report, graphic map, transposon insertions (represented as flags)
Proper citation: TRIPLES- a database of TRansposon-Insertion Phenotypes Localization and Expression in Saccharomyces (RRID:SCR_005714) Copy
http://igdb.nsclc.ibms.sinica.edu.tw/
IGDB.NSCLC database is aiming to facilitate and prioritize identified lung cancer genes and microRNAs for pathological and mechanistic studies of lung tumorigenesis and for developing new strategies for clinical interventions. We integrated and curated various lung cancer genomic datasets to present # lung cancer genes with somatic mutations, experimental supports and statistic significance in association with clinicopathological features; # genomic alterations with copy number alterations (CNA) detected by high density SNP arrays, gain or loss regions detected by arrayed comparative genome hybridization (aCGH), and loss of heterozygosity (LOH) detected by microsatellite markers; # aberrant expression of genes and microRNAs detected by various microarrays. IGDB.NSCLC database provides user friendly interfaces and searching functions to display multiple layers of evidence for detecting lung cancer target genes and microRNAs, especially emphasizing on concordant alterations: # genes with altered expression located in the CNA regions; # microRNAs with altered expression located in the CNA regions; # somatic mutation genes located in the CNA regions; and # genes associated with clinicopathological features located in the CNA regions. These concordant altered genes and miRNAs should be prioritized for further basic and clinical studies.
Proper citation: IGDB.NSCLC (RRID:SCR_006048) Copy
Society that develop standards for biological research data quality, annotation and exchange. They facilitate the creation and use of software tools that build on these standards and allow researchers to annotate and share their data easily. They promote scientific discovery that is driven by genome wide and other biological research data integration and meta-analysis. Historically, FGED began with a focus on microarrays and gene expression data. However, the scope of FGED now includes data generated using any technology when applied to genome-scale studies of gene expression, binding, modification and other related applications.
Proper citation: FGED (RRID:SCR_001897) Copy
http://img.jgi.doe.gov/cgi-bin/m/main.cgi
Resource for analysis and annotation of genome and metagenome datasets in comprehensive comparative context. IMG provides users with tools for analyzing publicly available genome datasets and metagenome datasets.
Proper citation: IMG System (RRID:SCR_002965) Copy
http://projects.tcag.ca/xenodup/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 16, 2013. It contains information about segmental duplications in the genomes of chimpanzee, mouse, and rat. The criteria used to identify regions of segmental duplication are: * Sequence identity of at least 90% * Sequence length of at least 5 kb * Not be entirely composed of repetitive elements. BACKGROUND: The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (>/= 5 kb) and recent (>/= 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies. RESULTS: We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of "unmapped" chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice. CONCLUSION: Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis. The segmental duplication data and summary statistics are available for download and can also be visualized in a genome browser in the GBrowse section. Selected annotation tracks (except the segmental duplication track) have also been obtained from UCSC and loaded into the genome browser. Detailed information (e.g. overlapping genes, overlapping clones, detailed alignment) can be obtained by clicking on a duplication cluster in GBrowse. Both keyword search and BLAT search are available. Analyses based on previous genome assemblies can be found in the Previous Analyses section. Recent Developments The Non-Human Genome Segmental Duplication Database is continually updated including the archived copies of the analysis of all previous genome assemblies and will include all new species as they become available. Acknowledgments We thank The Centre for Applied Genomics at the Hospital for Sick Children (HSC) as well as collaborators worldwide. Supported by Genome Canada the Howard Hughes Medical Institute International Scholar Program (to S.W.S.) and the HSC Foundation.
Proper citation: Non-Human Genome Segmental Duplication Database (RRID:SCR_000470) Copy
Database and integrated tools to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data.
Proper citation: Bovine Genome Database (RRID:SCR_000148) Copy
Database and browser that provides a central resource to archive and display association between genetic variation and high-throughput molecular-level phenotypes. This effort originated with the NIH GTEx roadmap project: however the scope of this resource will be extended to include any available genotype/molecular phenotype datasets.
Proper citation: GTEx eQTL Browser (RRID:SCR_001618) Copy
https://fungi.ensembl.org/Neurospora_crassa/Info/Index
It's strategy involves Whole Genome Shotgun (WGS) sequencing, in which sequence from the entire genome is generated and reassembled. This method is standard for microbial genome sequencing, and has been successfully applied to Drosophila. Neurospora is an ideal candidate for this approach because of the low repeat content of the genome. Neurospora crassa Database has expanded the scope of its database by including a mitochondrial annotation, incorporating information from the Neurospora compendium, and assigning NCU numbers to tRNA and rRNAs. They have improved the annotation process to predict untranslated regions and to reduce the number of spurious predictions. As a result, version 3 contains 9,826 genes, 794 fewer than version 2. During the initial phase of a WGS project they sequence both ends of the 4 kb inserts from a plasmid library prepared using randomly sheared and sized-selected DNA. The shotgun reads are assembled by recognizing overlapping regions of sequence and making use of the knowledge of the orientation and distance of the paired reads from each plasmid. Obtaining deep sequence coverage though high levels of sequence redundancy assures that the majority of the genome is represented in the initial assembly and that the consensus sequence is of high quality. Their approach toward the initial assembly was conservative, meaning they would rather fail to join sequence contigs that might overlap each other than risk making false joins between two closely related but non-overlapping genomic regions. Hence, the initial assembly contains many sequence contigs and over time these contigs will increase in size and decrease in number as they are joined together. After shotgun sequencing and assembly there was a second phase of sequencing in which additional sequence was obtained from specific regions that were missing from the original assembly or are recognized to be of low quality in the consensus. The Neurospora crassa sequencing project reflects a close collaboration between the Broad Institute and the Neurospora research community. Principal investigators include Bruce Birren and Chad Nusbaum from the Broad Institute, Matt Sachs at the Oregon Graduate Institute of Science and Technology, Chuck Staben at the University of Kentucky and Jak Kinsey at the Fungal Genetics Stock Center at the University of Kansas Medical Center. In addition, we have a larger Advisory Board made up of a number of Neurospora researchers. Sponsors: They have been funded by the National Science Foundation to sequence the N. crassa genome and make the information publicly available.
Proper citation: Neurospora crassa Database (RRID:SCR_001372) Copy
http://mech.ctb.pku.edu.cn/protisa/
Database of confirmed translation initiation sites (TISs) for prokaryotic genomes. The confirmed data has supporting evidence from different sources, including experiments records in the public protein database Swiss-Prot, literature, conserved domain search and sequence alignment among orthologous genes. Combing with predictions from the-state-of-the-art TIS predictor MED-Start/MED-StartPlus (in release 1.0 & 1.2) and TriTISA (since release 1.4) and annotations on potential regulatory signals, the database can serve as a refined annotation resource for the public database RefSeq.
Proper citation: ProTISA (RRID:SCR_002138) Copy
http://megasun.bch.umontreal.ca/ogmpproj.html
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on October 28,2025. It investigates mitochondrial genome diversity and evolution by systematically determining the complete mitochondrial DNA (mtDNA) sequences of a phylogenetically broad selection of protists. The mtDNAs of lower fungi and choanoflagellates are being analyzed by the Fungal Mitochondrial Genome Project (FMGP), a sister project to the OGMP.
Proper citation: Organelle Genomics (RRID:SCR_002137) Copy
http://pallab.serc.iisc.ernet.in/gester/
Database of intrinsic terminators of transcription that is comprized of >2,200,000 bacterial terminators identified from a total of 2036 chromosomes and 1508 plasmids. Information about structural parameters of individual terminators such as sequence, length of stem and loop, mismatches and gaps, U-trail, genomic coordinates and gene name and accession number is available in both tabular form and as a composite figure. Summary statistics for terminator profiles of whole genome can be also obtained. Raw data files for individual genomes can be downloaded (.zip files) for detailed investigations. Data is organized into different tiers such that users can fine-tune their search by entering name of the species, or taxon ID or genomes with a certain number of terminators. To visualize the occurrence of the terminators, an interactive map, with the resolution to single gene level, has been developed.
Proper citation: WebGeSTer DB (RRID:SCR_002165) Copy
Database to retrieve and compare gene expression patterns between animal species. Bgee first maps heterogeneous expression data (currently bulk RNA-Seq, scRNA-Seq, Affymetrix, in situ hybridization, and EST data) to anatomy and development of different species. Bgee is based exclusively on curated healthy wild-type expression data (e.g., no gene knock-out, no treatment, no disease), to provide a comparable reference of gene expression.
Proper citation: Bgee: dataBase for Gene Expression Evolution (RRID:SCR_002028) Copy
https://enigma.lbl.gov/regprecise/
Collection of manually curated inferences of regulons in prokaryotic genomes. Database for capturing, visualization and analysis of transcription factor regulons that were reconstructed by comparative genomic approach in wide variety of prokaryotic genomes., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: RegPrecise (RRID:SCR_002149) Copy
http://giladlab.uchicago.edu/orthoExon/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 23,2022. Database of orthologous exon regions in the genomes of human, chimpanzee, and rhesus macaque. It can be used in analysis of multi-species RNA-seq expression data, allowing for comparisons of exon-level expression across primates, as well as comparative examination of alternative splicing and transcript isoforms.
Proper citation: Primate Orthologous Exon Database (RRID:SCR_002065) Copy
http://www.nih.gov/science/models/rat/
The Rat Genome Program was launched after the National Institutes of Health (NIH) realized the potential of rat models in understanding basic biology and human health and disease. The purpose of this NIH Rat Genomics and Genetics web site is to serve as a central point for information on NIH sponsored and related rat genetic and genomic activities and resources. It will provide information on: the follow up to recommendations made to the NIH; funding opportunities for rat genomic and genetic tools and resources; major rat genomic resources available and/or produced in response to the NIH Rat Program; courses and meetings related to rat genomics and genetics; and selected reports and publications. These programs have produced a wide variety of resources and a way to link and capitalize upon the data and resources of other model organisms and the human. In conjunction with and in addition to these programs, the NIH, through the RGWG, has convened advisory groups and workshops to discuss the opportunities that rat models offer and provide recommendations on the investments that are needed to capitalize on these opportunities.
Proper citation: NIH Rat Genomics and Genetics (RRID:SCR_002267) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the dkNET Resources search. From here you can search through a compilation of resources used by dkNET and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that dkNET has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on dkNET then you can log in from here to get additional features in dkNET such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into dkNET you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within dkNET that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.