Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
Model organism database for the social amoeba Dictyostelium discoideum that provides the biomedical research community with integrated, high quality data and tools for Dictyostelium discoideum and related species. dictyBase houses the complete genome sequence, ESTs, and the entire body of literature relevant to Dictyostelium. This information is curated to provide accurate gene models and functional annotations, with the goal of fully annotating the genome to provide a ''''reference genome'''' in the Amoebozoa clade. They highlight several new features in the present update: (i) new annotations; (ii) improved interface with web 2.0 functionality; (iii) the initial steps towards a genome portal for the Amoebozoa; (iv) ortholog display; and (v) the complete integration of the Dicty Stock Center with dictyBase. The Dicty Stock Center currently holds over 1500 strains targeting over 930 different genes. There are over 100 different distinct amoebozoan species. In addition, the collection contains nearly 600 plasmids and other materials such as antibodies and cDNA libraries. The strain collection includes: * strain catalog * natural isolates * MNNG chemical mutants * tester strains for parasexual genetics * auxotroph strains * null mutants * GFP-labeled strains for cell biology * plasmid catalog The Dicty Stock Center can accept Dictyostelium strains, plasmids, and other materials relevant for research using Dictyostelium such as antibodies and cDNA or genomic libraries.
Proper citation: Dictyostelium discoideum genome database (RRID:SCR_006643) Copy
A fungal rDNA internal transcribed spacer (ITS) sequence database (although additional genes and genetic markers are also welcome) to facilitate identification of environmental samples of fungal DNA. Additional important features include user annotation of INSD sequences to add metadata on, e.g., locality, habitat, soil, climate, and interacting taxa. The user can furthermore annotate INSD sequences with additional species identifications that will appear in the results of any analyses done. UNITE focuses on high-quality ITS sequences generated from fruiting bodies collected and identified by experts and deposited in public herbaria. In addition, it also holds all fungal ITS sequences in the International Nucleotide Sequence Databases (INSD: NCBI, EMBL, DDBJ). Both sets of sequences may be used in any analyses carried out. UNITE is accompanied by a project management system called PlutoF, where users can store field data, document the sequencing lab procedures, manage sequences, and make analyses. PlutoF intends to make it possible for taxonomists, ecologists, and biogeographers to use a common platform for data storage, handling, and analyses, with the intent of facilitating an integration of these disciplines. A user can have an unlimited number of projects but still make analyses across any project data available to him.
Proper citation: UNITE (RRID:SCR_006518) Copy
A comparative platform for green plant genomics. Families of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These families allow easy access to clade specific orthology / paralogy relationships as well as clade specific genes and gene expansions. As of release v9.1, Phytozome provides access to forty-one sequenced and annotated green plant genomes which have been clustered into gene families at 20 evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Phytozome (RRID:SCR_006507) Copy
DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.
Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy
http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/
Consortium that puts sequences into a chromosome context and provides the best possible reference assembly for human, mouse, and zebrafish via FTP. Tools to facilitate the curation of genome assemblies based on the sequence overlaps of long, high quality sequences.
Proper citation: Genome Reference Consortium (RRID:SCR_006553) Copy
http://compbio.dfci.harvard.edu/tgi/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.The goal of The Gene Index Project is to use the available Expressed Sequence Transcript (EST) and gene sequences, along with the reference genomes wherever available, to provide an inventory of likely genes and their variants and to annotate these with information regarding the functional roles played by these genes and their products. The promise of genome projects has been a complete catalog of genes in a wide range of organisms. While genome projects have been successful in providing reference genome sequences, the problem of finding genes and their variants in genomic sequence remains an ongoing challenge. TGI has created an inventory that contains genes and their variants together with description. In addition, this resource is attempting to use these catalogs to find links between genes and pathways in different species and to provide lists of features within completed genomes that can aid in the understanding of how gene expression is regulated. DATABASES *Eukaryotic Gene Orthologues (formerly known as TOGA - TIGR Orthologous Gene Alignment): Eukaryotic Gene Orthologues (EGO) at DFGI are generated by pair-wise comparison between the Tentative Consensus (TC) sequences that comprise the Dana Farber Gene Indices from individual organisms. The reciprocal pairs of the best match were clustered into individual groups and multiple sequence alignments were displayed for each group. *GeneChip Oncology Database (GCOD):Cancer gene expression database is a collection of publicly available microarray expression data on Affymetrix GeneChip Arrays related to human cancers. Currently only datasets with available raw data (Affymetrix .CEL files) are processed. All processed datasets were subjected to extensive manual curation, uniform processing and consistent quality control. You can browse the experiments in our collection, perform statistical analysis, and download processed data; or to search gene expression profiles using Entrez gene symbol, Unigene ID, or Affymetrix probeset ID. *Gene Indices: As of July 1, 2008, there are 111 publicly available gene indices. They are separated into 4 categories for better organization and easier access. Animal: 41, Plant: 45, Protist: 15, Fungal: 10 *Genomic Maps: Human, mouse, rat, chicken, drosophila melanogaster, zebrafish, mosquito, caenorhabditis elegans, Arabidopsis thaliana, rice, yeast, fission yeast Dana-Farber Cancer Institute (DFCI) Gene Indices Software Tools: *TGI Clustering tools (TGICL): a software system for fast clustering of large EST datasets. *GICL: this package contains the scripts and all the necessary pre-compiled binaries for 32bit Linux systems. *clview: an assembly file viewer. *SeqClean:a script for automated trimming and validation of ESTs or other DNA sequences by screening for various contaminants, low quality and low-complexity sequences. *cdbfasta/cdbyank: fast indexing/retrieval of fasta records from flat file databases. *DAS/XML Genomic Viewer The Genomic viewer borrows modules from http://www.biodas.org (lstein (at) cshl.org) & http://webreference.com.
Proper citation: Gene Index Project (RRID:SCR_002148) Copy
Database of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus including information about genes and proteins of Aspergillus nidulans and Aspergillus fumigatus; descriptions and classifications of their biological roles, molecular functions, and subcellular localizations; gene, protein, and chromosome sequence information; tools for analysis and comparison of sequences; and links to literature information; as well as a multispecies comparative genomics browser tool (Sybil) for exploration of orthology and synteny across multiple sequenced Sgenus species. Also available are Gene Ontology (GO) and community resources. Based on the Candida Genome Database, the Aspergillus Genome Database is a resource for genomic sequence data and gene and protein information for Aspergilli. Among its many species, the genus contains an excellent model organism (A. nidulans, or its teleomorph Emericella nidulans), an important pathogen of the immunocompromised (A. fumigatus), an agriculturally important toxin producer (A. flavus), and two species used in industrial processes (A. niger and A. oryzae). Search options allow you to: *Search AspGD database using keywords. *Find chromosomal features that match specific properties or annotations. *Find AspGD web pages using keywords located on the page. *Find information on one gene from many databases. *Search for keywords related to a phenotype (e.g., conidiation), an allele (such as veA1), or an experimental condition (e.g., light). Analysis and Tools allow you to: *Find similarities between a sequence of interest and Aspergillus DNA or protein sequences. *Display and analyze an Aspergillus sequence (or other sequence) in many ways. *Navigate the chromosomes set. View nucleotide and protein sequence. *Find short DNA/protein sequence matches in Aspergillus. *Design sequencing and PCR primers for Aspergillus or other input sequences. *Display the restriction map for a Aspergillus or other input sequence. *Find similarities between a sequence of interest and fungal nucleotide or protein sequences. AspGD welcomes data submissions.
Proper citation: ASPGD (RRID:SCR_002047) Copy
http://www.ncbi.nlm.nih.gov/HTGS/
Database of high-throughput genome sequences from large-scale genome sequencing centers, including unfinished and finished sequences. It was created to accommodate a growing need to make unfinished genomic sequence data rapidly available to the scientific community in a coordinated effort among the International Nucleotide Sequence databases, DDBJ, EMBL, and GenBank. Sequences are prepared for submission by using NCBI's software tools Sequin or tbl2asn. Each center has an FTP directory into which new or updated sequence files are placed. Sequence data in this division are available for BLAST homology searches against either the htgs database or the month database, which includes all new submissions for the prior month. Unfinished HTG sequences containing contigs greater than 2 kb are assigned an accession number and deposited in the HTG division. A typical HTG record might consist of all the first-pass sequence data generated from a single cosmid, BAC, YAC, or P1 clone, which together make up more than 2 kb and contain one or more gaps. A single accession number is assigned to this collection of sequences, and each record includes a clear indication of the status (phase 1 or 2) plus a prominent warning that the sequence data are unfinished and may contain errors. The accession number does not change as sequence records are updated; only the most recent version of a HTG record remains in GenBank.
Proper citation: High Throughput Genomic Sequences Division (RRID:SCR_002150) Copy
Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.
Proper citation: SAMTOOLS (RRID:SCR_002105) Copy
https://ftp.ncbi.nlm.nih.gov/pub/mhc/mhc/Final%20Archive/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on August 23, 2019 Database was open, publicly accessible platform for DNA and clinical data related to human Major Histocompatibility Complex (MHC). Data from IHWG workshops were provided as well., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: dbMHC (RRID:SCR_002302) Copy
http://bioafrica.mrc.ac.za/index.html
The BioAfrica HIV-1 Proteomics Resource is a website that contains detailed information about the HIV-1 proteome and protease cleavage sites, as well as data-mining tools that can be used to manipulate and query protein sequence data, a BLAST tool for initiating structural analyses of HIV-1 proteins, and a proteomics tools directory. HIV Proteomics Resource contains information about each HIV-1 gene product in regard to expression, post-transcriptional / post-translational modifications, localization, functional activities, and potential interactions with viral and host macromolecules. The Proteome section contains extensive data on each of 19 HIV-1 proteins, including their functional properties, a sample analysis of HIV-1HXB2, structural models and links to other online resources. The HIV-1 Protease Cleavage Sites section provides information on the position, subtype variation and genetic evolution of Gag, Gag-Pol and Nef cleavage sites.
Proper citation: BioAfrica HIV Informatics in Africa (RRID:SCR_002295) Copy
Maintains and provides archival, retrieval and analytical resources for biological information. Central DDBJ resource consists of public, open-access nucleotide sequence databases including raw sequence reads, assembly information and functional annotation. Database content is exchanged with EBI and NCBI within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). In 2011, DDBJ launched two new resources: DDBJ Omics Archive and BioProject. DOR is archival database of functional genomics data generated by microarray and highly parallel new generation sequencers. Data are exchanged between the ArrayExpress at EBI and DOR in the common MAGE-TAB format. BioProject provides organizational framework to access metadata about research projects and data from projects that are deposited into different databases.
Proper citation: DNA DataBank of Japan (DDBJ) (RRID:SCR_002359) Copy
http://www.ncbi.nlm.nih.gov/genome
Database that organizes information on genomes including sequences, maps, chromosomes, assemblies, and annotations in six major organism groups: Archaea, Bacteria, Eukaryotes, Viruses, Viroids, and Plasmids. Genomes of over 1,200 organisms can be found in this database, representing both completely sequenced organisms and those for which sequencing is in progress. Users can browse by organism, and view genome maps and protein clusters. Links to other prokaryotic and archaeal genome projects, as well as BLAST tools and access to the rest of the NCBI online resources are available.
Proper citation: NCBI Genome (RRID:SCR_002474) Copy
https://services.healthtech.dtu.dk/
Center for Biological Sequence Analysis of the Technical University of Denmark conducts basic research in the field of bioinformatics and systems biology and directs its research primarily towards topics related to the elucidation of the functional aspects of complex biological mechanisms. A large number of computational methods have been produced, which are offered to others via WWW servers. Several data sets are also available. The center also has experimental efforts in gene expression analysis using DNA chips and data generation in relation to the physical and structural properties of DNA. The on-line prediction services at CBS are available as interactive input forms. Most of the servers are also available as stand-alone software packages with the same functionality. In addition, for some servers, programmatic access is provided in the form of SOAP-based Web Services. The center also educates engineering students in biotechnology and systems biology and offers a wide range of courses in bioinformatics, systems biology, human health, microbiology and nutrigenomics.
Proper citation: DTU Center for Biological Sequence Analysis (RRID:SCR_003590) Copy
http://iubio.bio.indiana.edu/webapps/SeWeR/
Sequence analysis using Web Resources (SeWeR) is an integrated, Dynamic HTML (DHTML) interface to commonly used bioinformatics services available on the World Wide Web. It is highly customizable, extendable, platform neutral, completely server-independent and can be hosted as a web page as well as being used as stand-alone software running within a web browser. It doesn''t require any server to host itself. The goal of SeWeR is to turn your web-browser into a powerful sequence-analysis tool. It is written entirely in JavaScript1.2. SeWeR can be downloaded and mirrored freely. The whole package is just around 300K. You can even run it from a floppy. SeWeR is not compatible with Netscape 6. SeWeR now generates graphics. Savvy is a plasmid drawing software that generates plasmid map in the revolutionary Scalable Vector Graphics format from W3C.
Proper citation: SeWeR - SEquence analysis using WEb Resources (RRID:SCR_004167) Copy
http://nematode.lab.nig.ac.jp/
Expression pattern map of the 100Mb genome of the nematode Caenorhabditis elegans through EST analysis and systematic whole mount in situ hybridization. NEXTDB is the database to integrate all information from their expression pattern project and to make the data available to the scientific community. Information available in the current version is as follows: * Map: Visual expression of the relationships among the cosmids, predicted genes and the cDNA clones. * Image: In situ hybridization images that are arranged by their developmental stages. * Sequence: Tag sequences of the cDNA clones are available. * Homology: Results of BLASTX search are available. Users of the data presented on our web pages should not publish the information without our permission and appropriate acknowledgment. Methods are available for: * In situ hybridization on whole mount embryos of C.elegans * Protocols for large scale in situ hybridization on C.elegans larvae
Proper citation: NEXTDB (RRID:SCR_004480) Copy
http://www.genedb.org/Homepage/Tbruceibrucei927
Database of the most recent sequence updates and annotations for the T. brucei genome. New annotations are constantly being added to keep up with published manuscripts and feedback from the Trypanosomatid research community. You may search by Protein Length, Molecular Mass, Gene Type, Date, Location, Protein Targeting, Transmembrane Helices, Product, GO, EC, Pfam ID, Curation and Comments, and Dbxrefs. BLAST and other tools are available. T. brucei possesses a two-unit genome, a nuclear genome and a mitochondrial (kinetoplast) genome with a total estimated size of 35Mb/haploid genome. The nuclear genome is split into three classes of chromosomes according to their size on pulsed-field gel electrophoresis, 11 pairs of megabase chromosomes (0.9-5.7 Mb), intermediate (300-900 kb) and minichromosomes (50-100 kb). The T. brucei genome contains a ~0.5Mb segmental duplication affecting chromosomes 4 and 8, which is responsible for some 75 gene duplicates unique to this species. A comparative chromosome map of the duplicons can be accessed here (PubmedID 18036214). Protozoan parasites within the species Trypanosoma brucei are the etiological agent of human sleeping sickness and Nagana in animals. Infections are limited to patches of sub-Saharan Africa where insects vectors of the Glossina genus are endemic. The most recent estimates indicate between 50,000 - 70,000 human cases currently exist, with 17 000 new cases each year (WHO Factsheet, 2006). In collaboration with GeneDB, the EuPathDB genomic sequence data and annotations are regularly deposited on TriTrypDB where they can be integrated with other datasets and queried using customized queries.
Proper citation: GeneDB Tbrucei (RRID:SCR_004786) Copy
http://www.osc.riken.jp/english/
Omics Science Center is aiming to develop a comprehensive system called Life Science Accelerator(LSA) for the advancement of omics research. The LSA is a comprehensive system consists of biological resources, human resources, technologies, know-how, and essential administrative ability. Ultimate goal of LSA is to support and accelerate the advancement in life science research. Omics is the comprehensive study of molecules in living organisms. The complete sequencing of genomes (the complete set of genes in an organism) has enabled rapid developments in the collection and analysis of various types of comprehensive molecular data such as transcriptomes (the complete set of gene expression data) and proteomes (the complete set of intracellular proteins). Fundamental omics research aims to link these omics data to molecular networks and pathways in order to advance the understanding of biological phenomena as systems at the molecular level.
Proper citation: RIKEN Omics Science Center (RRID:SCR_008241) Copy
The Distributed Annotation System (DAS) defines a communication protocol used to exchange annotations on genomic or protein sequences. It is motivated by the idea that such annotations should not be provided by single centralized databases, but should instead be spread over multiple sites. Data distribution, performed by DAS servers, is separated from visualization, which is done by DAS clients. The advantages of this system are that control over the data is retained by data providers, data is freed from the constraints of specific organisations and the normal issues of release cycles, API updates and data duplication are avoided. DAS is a client-server system in which a single client integrates information from multiple servers. It allows a single machine to gather up sequence annotation information from multiple distant web sites, collate the information, and display it to the user in a single view. Little coordination is needed among the various information providers. DAS is heavily used in the genome bioinformatics community. Over the last years we have also seen growing acceptance in the protein sequence and structure communities. A DAS-enabled website or application can aggregate complex and high-volume data from external providers in an efficient manner. For the biologist, this means the ability to plug in the latest data, possibly including a user''s own data. For the application developer, this means protection from data format changes and the ability to add new data with minimal development cost. Here are some examples of DAS-enabled applications or websites for end users: :- Dalliance Experimental Web/Javascript based Genome Viewer :- IGV Integrative Genome Viewer java based browser for many genomes :- Ensembl uses DAS to pull in genomic, gene and protein annotations. It also provides data via DAS. :- Gbrowse is a generic genome browser, and is both a consumer and provider of DAS. :- IGB is a desktop application for viewing genomic data. :- SPICE is an application for projecting protein annotations onto 3D structures. :- Dasty2 is a web-based viewer for protein annotations :- Jalview is a multiple alignment editor. :- PeppeR is a graphical viewer for 3D electron microscopy data. :- DASMI is an integration portal for protein interaction data. :- DASher is a Java-based viewer for protein annotations. :- EpiC presents structure-function summaries for antibody design. :- STRAP is a STRucture-based sequence Alignment Program. Hundreds of DAS servers are currently running worldwide, including those provided by the European Bioinformatics Institute, Ensembl, the Sanger Institute, UCSC, WormBase, FlyBase, TIGR, and UniProt. For a listing of all available DAS sources please visit the DasRegistry. Sponsors: The initial ideas for DAS were developed in conversations with LaDeana Hillier of the Washington University Genome Sequencing Center.
Proper citation: Distributed Annotation System (RRID:SCR_008427) Copy
http://bioinf.uni-greifswald.de/augustus/
Software for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.
Proper citation: Augustus (RRID:SCR_008417) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.