SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.sanger.ac.uk/Projects/Fungi/
Fungal genomes available from the Sanger Institute. Data are accessible in a number of ways; for each organism there is a BLAST server, allowing search of the sequences. Sequences can also be down-loaded directly by FTP. In addition, for those organisms being sequenced using a cosmid approach, finished and annotated cosmids are submitted to EMBL and other public databases.
Proper citation: Fungi Sequencing Projects (RRID:SCR_008524) Copy
http://www.informatics.jax.org/genes.shtml
Searchable database of mouse genes, DNA segments, cytogenetic markers and QTLs. MGI provides access to integrated data on mouse genes and genome features, from sequences and genomic maps to gene expression and disease models.
Proper citation: Genes, Genome Features and Maps (RRID:SCR_017524) Copy
Data collection of large scale genome wide DNA methylation analysis of 1,000 mother-child pairs at serial time points across life course (ARIES).
Proper citation: mqtldb (RRID:SCR_018002) Copy
http://www.unil.ch/comparativegenometrics/
The Comparative Genometrics website displays for sequenced genomes, three different genometric analyses: the DNA walk and the GC and TA skews during the initial phase. Although primarily focused on prokaryotic chromosomes, the CG website posts genometric information on paradigm plasmids, phages, viruses, and organelles. The genometric analyses are available via phylogenetic tree or alphabetical list. It also offers small genome information, for mitochondria, chloroplasts, viruses, bacteriophages, and plasmids.
Proper citation: Comparative Genometrics (RRID:SCR_012920) Copy
http://www.sanger.ac.uk/Projects/D_rerio/
Database of zebrafish genome.
Proper citation: Zebrafish Genome Project (RRID:SCR_013157) Copy
A database of phylogenetic trees of animal genes. It aims at developing a curated resource that gives reliable information about ortholog and paralog assignments, and evolutionary history of various gene families. TreeFam defines a gene family as a group of genes that evolved after the speciation of single-metazoan animals. It also tries to include outgroup genes like yeast (S. cerevisiae and S. pombe) and plant (A. thaliana) to reveal these distant members.TreeFam is also an ortholog database. Unlike other pairwise alignment based ones, TreeFam infers orthologs by means of gene trees. It fits a gene tree into the universal species tree and finds historical duplications, speciations and losses events. TreeFam uses this information to evaluate tree building, guide manual curation, and infer complex ortholog and paralog relations.The basic elements of TreeFam are gene families that can be divided into two parts: TreeFam-A and TreeFam-B families. TreeFam-B families are automatically created. They might contain errors given complex phylogenies. TreeFam-A families are manually curated from TreeFam-B ones. Family names and node names are assigned at the same time. The ultimate goal of TreeFam is to present a curated resource for all the families. phylogenetic tree, animal, vertebrate, invertebrate, gene, ortholog, paralog, evolutionary history, gene families, single-metazoan animals, outgroup genes like yeast (S. cerevisiae and S. pombe), plant (A. thaliana), historical duplications, speciations, losses, Human, Genome, comparative genomics
Proper citation: Tree families database (RRID:SCR_013401) Copy
THIS RESOURCE IS NO LONGER IN SERVICE,documented on August 16, 2019. Fugu genome is among the smallest vertebrate genomes and has proved to be a valuable reference genome for identifying genes and other functional elements such as regulatory elements in the human and other vertebrate genomes, and for understanding the structure and evolution of vertebrate genomes. This site presents version 4 of the Fugu genome, released in October 2004 by the International Fugu Genome Consortium. Fugu rubripes has a very compact genome, with less than 15 consisting of dispersed repetitive sequence, which makes it ideal for gene discovery. A draft sequence of the fugu genome was determined by the International Fugu Genome Consortium in 2002 using the ''whole-genome shotgun'' sequencing strategy. Fugu is the second vertebrate genome to be sequenced, the first being the human genome. This webpage presents the annotation made on the fourth assembly by the IMCB team using the Ensembl annotation pipeline. We are continuing with the gap filling work and linking of the scaffolds to obtain super-contigs.
Proper citation: Fugu Genome Project (RRID:SCR_013014) Copy
http://dorina.mdc-berlin.de/rbp_browser/dorina.html
In animals, RNA binding proteins (RBPs) and microRNAs (miRNAs) post-transcriptionally regulate the expression of virtually all genes by binding to RNA. Recent advances in experimental and computational methods facilitate transcriptome-wide mapping of these interactions. It is thought that the combinatorial action of RBPs and miRNAs on target mRNAs form a post-transcriptional regulatory code. We provide a database that supports the quest for deciphering this regulatory code. Within doRiNA, we are systematically curating, storing and integrating binding site data for RBPs and miRNAs. Users are free to take a target (mRNA) or regulator (RBP and/or miRNA) centric view on the data. We have implemented a database framework with short query response times for complex searches (e.g. asking for all targets of a particular combination of regulators). All search results can be browsed, inspected and analyzed in conjunction with a huge selection of other genome-wide data, because our database is directly linked to a local copy of the UCSC genome browser. At the time of writing, doRiNA encompasses RBP data for the human, mouse and worm genomes. For computational miRNA target site predictions, we provide an update of PicTar predictions.
Proper citation: doRiNA (RRID:SCR_013222) Copy
http://proline.bic.nus.edu.sg/dedb/
Database on Drosophila melanogaster exons presented in a splicing graph form. Data is based on release 3.2 of the Drosophila melanogaster genome annotations available at FlyBase. The gene structure information extracted from the annotations were checked, clustered and transformed into splicing graph. The splicing graph form of the gene constructs were then used for classification of the various types of alternative splicing events. In addition, Pfam domains were mapped onto the gene structure. Users can query the database using the query page using BLAST, FlyBase Gene Name, FlyBase Gene Symbol, Pfam Accession Number and Pfam Identifier. This allows users to determine the Drosophila melanogaster homology of their gene using a BLAST search and to visualize the alternative splicing variants if any. Users can also determine genes containing a particular domain using the Pfam Accession Numbers and Identifiers.
Proper citation: Drosophila melanogaster Exon Database (RRID:SCR_013441) Copy
A genome and functional genomic database for the protozoan parasite Toxoplasma gondii. It incorporates the sequence and annotation of the T. gondii ME49 strain, as well as genome sequences for the GT1, VEG and RH (Chr Ia, Chr Ib) strains. Sequence information is integrated with various other genomic-scale data, including community annotation, ESTs, gene expression and proteomics data. Organisms * Toxoplasma gondii (ME49, RH, GT1, Veg strains) * Neospora caninum * environmental isolate sequences from numerous species Tools * BLAST: Identify Sequence Similarities * Sequence Retrieval: Retrieve Specific Sequences using IDs and coordinates * PubMed and Entrez: View the Latest Toxoplasma, Neospora Pubmed and Entrez Results * Genome Browser: View Sequences and Features in the genome browser * Ancillary Genome Browse: Access Additional info like Probeset data and Toxoplasma Array info
Proper citation: ApiDB ToxoDB (RRID:SCR_013453) Copy
Database that provide a genomic information and comparative genomics platform on sea urchins and related echinoderms. It provide collection of information to directly support experimental work on these useful research models in cell and developmental biology.
Proper citation: EchinoBase (RRID:SCR_013732) Copy
This site has been developed by Kazusa DNA Research Institute for the purpose of offering the science community the analyzed sequence data produced by a multi-national Arabidopsis genome sequencing project coordinated by the Arabidopsis Genome Initiatives (AGI). The aim of this service is to enable users to browse the annotated sequence data produced by all the sequencing teams of AGI through an user-friendly graphic display system and search engines. Gene structures proposed on the annotated sequences as well as those predicted by computer programs are presented and each graphic item has a hyperlink to detailed information of the corresponding area. The nucleotide sequence data deposited in GenBank by AGI was downloaded, re-computer-analyzed at Kazusa and parsed results are displayed graphically.
Proper citation: Kazusa Arabidopsis data opening site (RRID:SCR_013511) Copy
Database of compiled, public, deep sequencing miRNA data and several novel tools to facilitate exploration of massive data. The miR-seq browser supports users to examine short read alignment with the secondary structure and read count information available in concurrent windows. Features such as sequence editing, sorting, ordering, import and export of user data are of great utility for studying iso-miRs, miRNA editing and modifications. miRNA����??target relation is essential for understanding miRNA function. Coexpression analysis of miRNA and target mRNAs, based on miRNA-seq and RNA-seq data from the same sample, is visualized in the heat-map and network views where users can investigate the inverse correlation of gene expression and target relations, compiled from various databases of predicted and validated targets.
Proper citation: miRGator (RRID:SCR_007793) Copy
http://projects.tcag.ca/humandup/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 17, 2013. It contains information about segmental duplications in the human genome. The criteria used to identify regions of segmental duplication are: Sequence identity of at least 90, Sequence length of at least 5 kb, Not be entirely composed of repetitive elements. Background Previous studies have suggested that recent segmental duplications, which are often involved in chromosome rearrangements underlying genomic disease, account for some 5 of the human genome. We have developed rapid computational heuristics based on BLAST analysis to detect segmental duplications, as well as regions containing potential sequence misassignments in the human genome assemblies. Results Our analysis of the June 2002 public human genome assembly revealed that 107.4 of 3,043.1 megabases (Mb) (3.53) of sequence contained segmental duplications, each with size equal or more than 5 kb and 90 identity. We have also detected that 38.9 Mb (1.28) of sequence within this assembly is likely to be involved in sequence misassignment errors. Furthermore, we have identified a significant subset (199,965 of 2,327,473 or 8.6) of single-nucleotide polymorphisms (SNPs) in the public databases that are not true SNPs but are potential paralogous sequence variants. Conclusion Using two distinct computational approaches, we have identified most of the sequences in the human genome that have undergone recent segmental duplications. Near-identical segmental duplications present a major challenge to the completion of the human genome sequence. Potential sequence misassignments detected in this study would require additional efforts to resolve. The segmental duplication data and summary statistics are available for download. Data for Human Genome (based on the May 2004 Human Genome Assembly (hg17)) Visualize duplication relationships in GBrowse (GBrowse) Duplicon Pair relationships (GFF) Genes within duplication regions (HTML) Genome duplication content (MS Excel) The segmental duplication data can be visualized in a genome browser in the GBrowse section. Selected human genome annotation tracks (except the segmental duplication track) have also been obtained from UCSC and loaded into the genome browser. Detailed information (e.g. overlapping genes, overlapping clones, detailed alignment) can be obtained by clicking on a duplication cluster in GBrowse. Both keyword search and BLAT search are available. Analyses based on previous human genome assemblies can be found in the Previous Analyses section. Acknowledgments We thank The Centre for Applied Genomics at the Hospital for Sick Children (HSC) as well as collaborators worldwide. Supported by Genome Canada the Howard Hughes Medical Institute International Scholar Program (to S.W.S.) and the HSC Foundation.
Proper citation: Human Genome Segmental Duplication Database (RRID:SCR_007728) Copy
It provides a database based on a pre-computed similarity matrix covering the similarity space formed by >4 million amino acid sequences from public databases and completely sequenced genomes. The database is capable of handling very large datasets and is updated incrementally. For sequence similarity searches and pairwise alignments, we implemented a grid-enabled software system, which is based on FASTA heuristics and the Smith Waterman algorithm. SimpleSIMAP and AdvancedSIMAP retrieve homologs for given protein sequences that need to be contained in the SIMAP database. While SimpleSIMAP provides only selected parameters and preconfigured search spaces, the AdvancedSIMAP allows the user to specify search space, filtering and sorting parameters in a flexible manner. Both types of queries result in lists of homologs that are linked in turn to their homologs. So the web interfaces allow users to explore quickly and interactively the protein world by homology. Sponsors: SIMAP is supported by the Department of Genome Oriented Bioinformatics of the Technische Universitt Mnchen and the Institute for Bioinformatics of the GSF-National Research Center for Environment and Health.
Proper citation: SIMAP (RRID:SCR_007927) Copy
SYSTERS is a database of protein sequences grouped into homologous families and superfamilies. The SYSTERS project aims to provide a meaningful partitioning of the whole protein sequence space by a fully automatic procedure. A refined two-step algorithm assigns each protein to a family and a superfamily. The sequence data underlying SYSTERS release 4 now comprise several protein sequence databases derived from completely sequenced genomes (ENSEMBL, TAIR, SGD and GeneDB), in addition to the comprehensive Swiss-Prot/TrEMBL databases. To augment the automatically derived results, information from external databases like Pfam and Gene Ontology are added to the web server. Furthermore, users can retrieve pre-processed analyses of families like multiple alignments and phylogenetic trees. New query options comprise a batch retrieval tool for functional inference about families based on automatic keyword extraction from sequence annotations. A new access point, PhyloMatrix, allows the retrieval of phylogenetic profiles of SYSTERS families across organisms with completely sequenced genomes. Gene, Human, Vertebrate, Genome, Human ORFs
Proper citation: SYSTERS (RRID:SCR_007955) Copy
http://supfam.org/SUPERFAMILY/
SUPERFAMILY is a database of structural and functional protein annotations for all completely sequenced organisms. The SUPERFAMILY annotation is based on a collection of hidden Markov models, which represent structural protein domains at the SCOP superfamily level. A superfamily groups together domains which have an evolutionary relationship. The annotation is produced by scanning protein sequences from over 1,700 completely sequenced genomes against the hidden Markov models.
Proper citation: SUPERFAMILY (RRID:SCR_007952) Copy
http://www.thearkdb.org/arkdb/
This website contains the mapping sequence of poultry. The ArkDB database system aims to provide a comprehensive public repository for genome mapping data from farmed and other animal species. In doing so, it aims to provide a route in to genomic and other sequence from the initial viewpoint of linkage mapping, RH mapping, physical mapping or - possibly more importantly - QTL mapping data. It's supported, in part, by the USDA-CSREES National Animal Genome Research Program in order to serve the poultry genome mapping community. This system represents a complete rewrite of the original version with the code migrated to java and the underlying database targeted at postgres (although any standards-compliant database engine should suffice). The initial release records details of maps and the markers that they contain. There are alternative entry points that target either a chromosome or a specific mapping analysis as the starting point. Limited relationships between markers are recorded and displayed. As with the previous version, all maps are drawn using data extracted from the database on the fly.
Proper citation: ChickBase (RRID:SCR_008147) Copy
http://microbialgenomics.energy.gov/index.shtml
Through its Microbial Genome Program (MGP) and its Genomics:GTL (GTL) program, DOEs Office of Biological and Environmental Research (BER) has sequenced more than 485 microbial genomes and 30 microbial communities having specialized biological capabilities. Identifying these genes will help investigators discern how gene activities in whole living systems are orchestrated to solve myriad life challenges. The MGP was begun in 1994 as a spinoff from the Human Genome Program. The goal of the program was to sequence the genomes of a number of nonpathogenic microbes that would be useful in solving DOE''s mission challenges in environmental-waste cleanup, energy production, carbon cycling, and biotechnology. Past projects include microbial genome program, microbial cell project, and the Laboratory Science Program at the DOE Joint Genome Institute. The two ongoing projects are Genomics: GTL program and Community Sequencing Program at the DOE Joint Genome Institute. Sponsors: Site sponsored by the U.S. Department of Energy Office of Science, Office of Biological and Environmental Research, THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Microbial Genomics Program (RRID:SCR_008140) Copy
http://mips.helmholtz-muenchen.de/genre/proj/mpcdb/
A database of manually annotated mammalian protein complexes. To obtain a high-quality dataset, information was extracted from individual experiments described in the scientific literature. Data from high-throughput experiments was not included.
Proper citation: Mammalian Protein Complex Data Base (RRID:SCR_008209) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within FDI Lab - SciCrunch.org that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.