NIF | Searching in Community Resources

cpnDB: A Chaperonin Database

RRID:SCR_002263

A curated collection of chaperonin sequence data collected from public databases or generated by a network of collaborators exploiting the cpn60 target in clinical, phylogenetic and microbial ecology studies. The database contains all available sequences for both group I and group II chaperonins. Users can search the database by Chaperonin type, group (I or II), BLAST, or other options, and can also enter and analyze FASTA sequences.

Proper citation: cpnDB: A Chaperonin Database (RRID:SCR_002263) Copy

Source: SciCrunch Registry

Sequence Tag Alignment and Consensus Knowledgebase Database

RRID:SCR_002156

http://ww2.sanbi.ac.za/Dbases.html

THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The STACKdb is knowledgebase generated by processing EST and mRNA sequences obtained from GenBank through a pipeline consisting of masking, clustering, alignment and variation analysis steps. The STACK project aims to generate a comprehensive representation of the sequence of each of the expressed genes in the human genome by extensive processing of gene fragments to make accurate alignments, highlight diversity and provide a carefully joined set of consensus sequences for each gene. The STACK project is comprised of the STACKdb human gene index, a database of virtual human transcripts, as well as stackPACK, the tools used to create the database. STACKdb is organized into 15 tissue-based categories and one disease category. STACK is a tool for detection and visualization of expressed transcript variation in the context of developmental and pathological states. The data system organizes and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index

Proper citation: Sequence Tag Alignment and Consensus Knowledgebase Database (RRID:SCR_002156) Copy

Source: SciCrunch Registry

HCV Databases

RRID:SCR_002863

http://hcv.lanl.gov/

The Hepatitis C Virus (HCV) Database Project strives to present HCV-associated genetic and immunologic data in a user-friendly way, by providing access to the central database via web-accessible search interfaces and supplying a number of analysis tools.

Proper citation: HCV Databases (RRID:SCR_002863) Copy

Source: SciCrunch Registry

SPM

RRID:SCR_007037

Issue

https://github.com/spm

Software package for analysis of brain imaging data sequences. Sequences can be a series of images from different cohorts, or time-series from same subject. Current release is designed for analysis of fMRI, PET, SPECT, EEG and MEG.

Proper citation: SPM (RRID:SCR_007037) Copy

Source: SciCrunch Registry

SoyBase

RRID:SCR_005096

http://soybase.org

Professionally curated repository for genetics, genomics and related data resources for soybean that contains the most current genetic, physical and genomic sequence maps integrated with qualitative and quantitative traits. SoyBase includes annotated Williams 82 genomic sequence and associated data mining tools. The genetic and sequence views of the soybean chromosomes and the extensive data on traits and phenotypes are extensively interlinked. This allows entry to the database using almost any kind of available information, such as genetic map symbols, soybean gene names or phenotypic traits. The repository maintains controlled vocabularies for soybean growth, development, and traits that are linked to more general plant ontologies. Contributions to SoyBase or the Breeder''s Toolbox are welcome.

Proper citation: SoyBase (RRID:SCR_005096) Copy

Source: SciCrunch Registry

mtDB - Human Mitochondrial Genome Database

RRID:SCR_002945

http://www.genpat.uu.se/mtDB

A database of human mitochondrial genomes containing mtDNA sequences, polymorphic sites, and the ability to search for specific variants. It contains 1865 complete sequences and 839 coding region sequences.

Proper citation: mtDB - Human Mitochondrial Genome Database (RRID:SCR_002945) Copy

Source: SciCrunch Registry

MITOMAP - A human mitochondrial genome database

RRID:SCR_002996

http://www.mitomap.org/

Database of polymorphisms and mutations of the human mitochondrial DNA. It reports published and unpublished data on human mitochondrial DNA variation. All data is curated by hand. If you would like to submit published articles to be included in mitomap, please send them the citation and a pdf.

Proper citation: MITOMAP - A human mitochondrial genome database (RRID:SCR_002996) Copy

Source: SciCrunch Registry

SECISearch3 and Seblastian

RRID:SCR_003186

http://gladyshevlab.org/SelenoproteinPredictionServer/

Web server to predict eukaryotic selenoproteins and SECIS (SElenoCysteine Insertion Sequences) elements along nucleotide sequences. SECISearch3 replaces its predecessor SECISearch as a tool for prediction of eukaryotic SECIS elements. Seblastian is a method for selenoprotein gene detection that uses SECISearch3 and then predicts selenoprotein sequences encoded upstream of SECIS elements. Seblastian is able to both identify known selenoproteins and predict new selenoproteins.

Proper citation: SECISearch3 and Seblastian (RRID:SCR_003186) Copy

Source: SciCrunch Registry

Nucleic Acid Database

RRID:SCR_003255

http://ndbserver.rutgers.edu/

A database of three-dimensional structural information about nucleic acids and their complexes. In addition to primary data, it contains derived geometric data, classifications of structures and motifs, standards for describing nucleic acid features, as well as tools and software for the analysis of nucleic acids. A variety of search capabilities are available, as are many different types of reports. NDB maintains the macromolecular Crystallographic Information File (mmCIF).

Proper citation: Nucleic Acid Database (RRID:SCR_003255) Copy

Source: SciCrunch Registry

NCBI Taxonomy

RRID:SCR_003256

http://www.ncbi.nlm.nih.gov/taxonomy/

Database for a curated classification and nomenclature that contains the names of all organisms that are represented in the public sequence databases with at least one nucleotide or protein sequence. Data provided encompasses archaea, bacteria, eukaryota, viroids and viruses. The NCBI taxonomy database is not a primary source for taxonomic or phylogenetic information. Furthermore, the database does not follow a single taxonomic treatise but rather attempts to incorporate phylogenetic and taxonomic knowledge from a variety of sources, including the published literature, web-based databases, and the advice of sequence submitters and outside taxonomy experts. Consequently, the NCBI taxonomy database is not a phylogenetic or taxonomic authority and should not be cited as such.

Proper citation: NCBI Taxonomy (RRID:SCR_003256) Copy

Source: SciCrunch Registry

ASAP: the Alternative Splicing Annotation Project

RRID:SCR_003415

http://bioinfo.mbi.ucla.edu/ASAP/

THIS RESOURCE IS NO LONGER IN SERVICE, documented on 8/12/13. Database to access and mine alternative splicing information coming from genomics and proteomics based on genome-wide analyses of alternative splicing in human (30 793 alternative splice relationships found) from detailed alignment of expressed sequences onto the genomic sequence. ASAP provides precise gene exon-intron structure, alternative splicing, tissue specificity of alternative splice forms, and protein isoform sequences resulting from alternative splicing. They developed an automated method for discovering human tissue-specific regulation of alternative splicing through a genome-wide analysis of expressed sequence tags (ESTs), which involves classifying human EST libraries according to tissue categories and Bayesian statistical analysis. They use the UniGene clusters of human Expressed Sequence Tags (ESTs) to identify splices. The UniGene EST's are clustered so that a single cluster roughly corresponds to a gene (or at least a part of a gene). A single EST represents a portion of a processed (already spliced) mRNA. A given cluster contains many ESTs, each representing an outcome of a series of splicing events. The ESTs in UniGene contain the different mRNA isoforms transcribed from an alternatively spliced gene. They are not predicting alternative splicing, but locating it based on EST analysis. The discovered splices are further analyzed to determine alternative splicing events. They have identified 6201 alternative splice relationships in human genes, through a genome-wide analysis of expressed sequence tags (ESTs). Starting with 2.1 million human mRNA and EST sequences, they mapped expressed sequences onto the draft human genome sequence and only accepted splices that obeyed the standard splice site consensus. After constructing a tissue list of 46 human tissues with 2 million human ESTs, they generated a database of novel human alternative splices that is four times larger than our previous report, and used Bayesian statistics to compare the relative abundance of every pair of alternative splices in these tissues. Using several statistical criteria for tissue specificity, they have identified 667 tissue-specific alternative splicing relationships and analyzed their distribution in human tissues. They have validated our results by comparison with independent studies. This genome-wide analysis of tissue specificity of alternative splicing will provide a useful resource to study the tissue-specific functions of transcripts and the association of tissue-specific variants with human diseases.

Proper citation: ASAP: the Alternative Splicing Annotation Project (RRID:SCR_003415) Copy

Source: SciCrunch Registry

3DSwap

RRID:SCR_004133

http://caps.ncbs.res.in/3dswap/index.html

Curated knowledegbase of protein structures that are reported to be involved in 3-dimensional domain swapping. 3DSwap provides literature curated information and structure related information about 3D domain swapping in proteins. Information about swapping, hinge region, swapped region, extent of swapping, etc. are extracted from original research publications after extensive literature curation.

Proper citation: 3DSwap (RRID:SCR_004133) Copy

Source: SciCrunch Registry

HapMap 3 and ENCODE 3

RRID:SCR_004563

http://www.hgsc.bcm.tmc.edu/content/hapmap-3-and-encode-3

Draft release 3 for genome-wide SNP genotyping and targeted sequencing in DNA samples from a variety of human populations (sometimes referred to as the HapMap 3 samples). This release contains the following data: * SNP genotype data generated from 1184 samples, collected using two platforms: the Illumina Human1M (by the Wellcome Trust Sanger Institute) and the Affymetrix SNP 6.0 (by the Broad Institute). Data from the two platforms have been merged for this release. * PCR-based resequencing data (by Baylor College of Medicine Human Genome Sequencing Center) across ten 100-kb regions (collectively referred to as ENCODE 3) in 712 samples. Since this is a draft release, please check this site regularly for updates and new releases. The HapMap 3 sample collection comprises 1,301 samples (including the original 270 samples used in Phase I and II of the International HapMap Project) from 11 populations, listed below alphabetically by their 3-letter labels. Five of the ten ENCODE 3 regions overlap with the HapMap-ENCODE regions; the other five are regions selected at random from the ENCODE target regions (excluding the 10 HapMap-ENCODE regions). All ENCODE 3 regions are 100-kb in size, and are centered within each respective ENCODE region. The HapMap 3 and ENCORE 3 data are downloadable from the ftp site.

Proper citation: HapMap 3 and ENCODE 3 (RRID:SCR_004563) Copy

Source: SciCrunch Registry

Anopheles gambiae (African malaria mosquito) genome view

RRID:SCR_004402

http://www.ncbi.nlm.nih.gov/mapview/map_search.cgi?taxid=7165

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 11, 2023. A database for the Anopheles gambiae str. PEST genome that was sequenced using a whole genome shotgun approach. The database aims to contribute to the understanding of mosquito genome structure and organization and will assist the development of malaria control strategies and improved anti-malarial drugs and vaccines. Sequences were generated and assembled into contigs for submission to GenBank.

Proper citation: Anopheles gambiae (African malaria mosquito) genome view (RRID:SCR_004402) Copy

Source: SciCrunch Registry

NEWT

RRID:SCR_004477

http://www.uniprot.org/taxonomy/

NEWT is the taxonomy database maintained by the UniProt group. It integrates taxonomy data compiled in the NCBI database and data specific to the UniProt Knowledgebase. Browse by hierarchy, List all, or Complete proteomes. Organisms are classified in a hierarchical tree structure. Our taxonomy database contains every node (taxon) of the tree. UniProtKB taxonomy data is manually curated: next to manually verified organism names, we provide a selection of external links, organism strains and viral host information. Species with protein sequences stored in the UniProt Knowledgebase are named according to UniProt nomenclature. We endeavour to maintain a list of manually curated species names for which protein sequence data is available. In particular, we have adopted a systematic convention for naming viral and bacterial strains and isolates. Links to external sites are chosen by the UniProt taxonomy team and show pictures and various scientific data of interest (taxonomy, biology, physiology,...).

Proper citation: NEWT (RRID:SCR_004477) Copy

Source: SciCrunch Registry

BioSample Database at EBI

RRID:SCR_004856

http://www.ebi.ac.uk/biosamples/

Database that aggregates sample information for reference samples (e.g. Coriell Cell lines) and samples for which data exist in one of the EBI''''s assay databases such as ArrayExpress, the European Nucleotide Archive or PRoteomics Identificates DatabasE. It provides links to assays for specific samples, and accepts direct submissions of sample information. The goals of the BioSample Database include: # recording and linking of sample information consistently within EBI databases such as ENA, ArrayExpress and PRIDE; # minimizing data entry efforts for EBI database submitters by enabling submitting sample descriptions once and referencing them later in data submissions to assay databases and # supporting cross database queries by sample characteristics. The database includes a growing set of reference samples, such as cell lines, which are repeatedly used in experiments and can be easily referenced from any database by their accession numbers. Accession numbers for the reference samples will be exchanged with a similar database at NCBI. The samples in the database can be queried by their attributes, such as sample types, disease names or sample providers. A simple tab-delimited format facilitates submissions of sample information to the database, initially via email to biosamples (at) ebi.ac.uk. Current data sources: * European Nucleotide Archive (424,811 samples) * PRIDE (17,001 samples) * ArrayExpress (1,187,884 samples) * ENCODE cell lines (119 samples) * CORIELL cell lines (27,002 samples) * Thousand Genome (2,628 samples) * HapMap (1,417 samples) * IMSR (248,660 samples)

Proper citation: BioSample Database at EBI (RRID:SCR_004856) Copy

Source: SciCrunch Registry

Pfam

RRID:SCR_004726

http://pfam.xfam.org/

A database of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Users can analyze protein sequences for Pfam matches, view Pfam family annotation and alignments, see groups of related families, look at the domain organization of a protein sequence, find the domains on a PDB structure, and query Pfam by keywords. There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families that may automatically generate a supplement using the ADDA database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. Pfam also generates higher-level groupings of related families, known as clans (collections of Pfam-A entries which are related by similarity of sequence, structure or profile-HMM).

Proper citation: Pfam (RRID:SCR_004726) Copy

Source: SciCrunch Registry

SuperCAT

RRID:SCR_004882

http://mlstoslo.uio.no/

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 11,2023. SuperCAT hosts typing databases for the Bacillus cereus group of bacteria. The databases contain MultiLocus Sequence Typing (MLST), MultiLocus Enzyme Electrophoresis (MLEE), and Amplified Fragment Length Polymorphism (AFLP) phylogenetic data. multilocus, sequence, Bacillus cereus, bacteria, Genomics, non-vertebrate, taxonomy, identification

Proper citation: SuperCAT (RRID:SCR_004882) Copy

Source: SciCrunch Registry

SGN

RRID:SCR_004933

http://solgenomics.net/

A clade oriented, community curated database containing genomic, genetic, phenotypic and taxonomic information for plant genomes. Genomic information is presented in a comparative format and tied to important plant model species such as Arabidopsis. SGN provides tools such as: BLAST searches, the SolCyc biochemical pathways database, a CAPS experiment designer, an intron detection tool, an advanced Alignment Analyzer, and a browser for phylogenetic trees. The SGN code and database are developed as an open source project, and is based on database schemas developed by the GMOD project and SGN-specific extensions.

Proper citation: SGN (RRID:SCR_004933) Copy

Source: SciCrunch Registry

DATFAP

RRID:SCR_005413

http://cgi-www.daimi.au.dk/cgi-chili/datfap/frontdoor.py

A database of transcription factors from 13 plant species, and PCR primers for around 90% of them.

Proper citation: DATFAP (RRID:SCR_005413) Copy

Source: SciCrunch Registry

Searching the RRID Resource Information Network

Our searching services are busy right now. Please try again later

Log in

Leaving Community

About

Community Resources

More Resources

Literature

Log in

Tools Select Another Resource Report Type

Options

Current Facets and Filters

Facets

Recent searches

RRID:SCR_002263

RRID:SCR_002156

RRID:SCR_002863

RRID:SCR_007037

RRID:SCR_005096

RRID:SCR_002945

RRID:SCR_002996

RRID:SCR_003186

RRID:SCR_003255

RRID:SCR_003256

RRID:SCR_003415

RRID:SCR_004133

RRID:SCR_004563

RRID:SCR_004402

RRID:SCR_004477

RRID:SCR_004856

RRID:SCR_004726

RRID:SCR_004882

RRID:SCR_004933

RRID:SCR_005413

Neuroscience Information Framework Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Sources

Categories

Subcategories

Further Questions

Category Graph

About

Recent News Entries

Contact Us

SciCrunch