Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
A curated collection of chaperonin sequence data collected from public databases or generated by a network of collaborators exploiting the cpn60 target in clinical, phylogenetic and microbial ecology studies. The database contains all available sequences for both group I and group II chaperonins. Users can search the database by Chaperonin type, group (I or II), BLAST, or other options, and can also enter and analyze FASTA sequences.
Proper citation: cpnDB: A Chaperonin Database (RRID:SCR_002263) Copy
http://ww2.sanbi.ac.za/Dbases.html
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 23, 2016. The STACKdb is knowledgebase generated by processing EST and mRNA sequences obtained from GenBank through a pipeline consisting of masking, clustering, alignment and variation analysis steps. The STACK project aims to generate a comprehensive representation of the sequence of each of the expressed genes in the human genome by extensive processing of gene fragments to make accurate alignments, highlight diversity and provide a carefully joined set of consensus sequences for each gene. The STACK project is comprised of the STACKdb human gene index, a database of virtual human transcripts, as well as stackPACK, the tools used to create the database. STACKdb is organized into 15 tissue-based categories and one disease category. STACK is a tool for detection and visualization of expressed transcript variation in the context of developmental and pathological states. The data system organizes and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index
Proper citation: Sequence Tag Alignment and Consensus Knowledgebase Database (RRID:SCR_002156) Copy
The Hepatitis C Virus (HCV) Database Project strives to present HCV-associated genetic and immunologic data in a user-friendly way, by providing access to the central database via web-accessible search interfaces and supplying a number of analysis tools.
Proper citation: HCV Databases (RRID:SCR_002863) Copy
Issue
Software package for analysis of brain imaging data sequences. Sequences can be a series of images from different cohorts, or time-series from same subject. Current release is designed for analysis of fMRI, PET, SPECT, EEG and MEG.
Proper citation: SPM (RRID:SCR_007037) Copy
Professionally curated repository for genetics, genomics and related data resources for soybean that contains the most current genetic, physical and genomic sequence maps integrated with qualitative and quantitative traits. SoyBase includes annotated Williams 82 genomic sequence and associated data mining tools. The genetic and sequence views of the soybean chromosomes and the extensive data on traits and phenotypes are extensively interlinked. This allows entry to the database using almost any kind of available information, such as genetic map symbols, soybean gene names or phenotypic traits. The repository maintains controlled vocabularies for soybean growth, development, and traits that are linked to more general plant ontologies. Contributions to SoyBase or the Breeder''s Toolbox are welcome.
Proper citation: SoyBase (RRID:SCR_005096) Copy
A database of human mitochondrial genomes containing mtDNA sequences, polymorphic sites, and the ability to search for specific variants. It contains 1865 complete sequences and 839 coding region sequences.
Proper citation: mtDB - Human Mitochondrial Genome Database (RRID:SCR_002945) Copy
Database of polymorphisms and mutations of the human mitochondrial DNA. It reports published and unpublished data on human mitochondrial DNA variation. All data is curated by hand. If you would like to submit published articles to be included in mitomap, please send them the citation and a pdf.
Proper citation: MITOMAP - A human mitochondrial genome database (RRID:SCR_002996) Copy
http://gladyshevlab.org/SelenoproteinPredictionServer/
Web server to predict eukaryotic selenoproteins and SECIS (SElenoCysteine Insertion Sequences) elements along nucleotide sequences. SECISearch3 replaces its predecessor SECISearch as a tool for prediction of eukaryotic SECIS elements. Seblastian is a method for selenoprotein gene detection that uses SECISearch3 and then predicts selenoprotein sequences encoded upstream of SECIS elements. Seblastian is able to both identify known selenoproteins and predict new selenoproteins.
Proper citation: SECISearch3 and Seblastian (RRID:SCR_003186) Copy
A database of three-dimensional structural information about nucleic acids and their complexes. In addition to primary data, it contains derived geometric data, classifications of structures and motifs, standards for describing nucleic acid features, as well as tools and software for the analysis of nucleic acids. A variety of search capabilities are available, as are many different types of reports. NDB maintains the macromolecular Crystallographic Information File (mmCIF).
Proper citation: Nucleic Acid Database (RRID:SCR_003255) Copy
http://www.ncbi.nlm.nih.gov/taxonomy/
Database for a curated classification and nomenclature that contains the names of all organisms that are represented in the public sequence databases with at least one nucleotide or protein sequence. Data provided encompasses archaea, bacteria, eukaryota, viroids and viruses. The NCBI taxonomy database is not a primary source for taxonomic or phylogenetic information. Furthermore, the database does not follow a single taxonomic treatise but rather attempts to incorporate phylogenetic and taxonomic knowledge from a variety of sources, including the published literature, web-based databases, and the advice of sequence submitters and outside taxonomy experts. Consequently, the NCBI taxonomy database is not a phylogenetic or taxonomic authority and should not be cited as such.
Proper citation: NCBI Taxonomy (RRID:SCR_003256) Copy
http://bioinfo.mbi.ucla.edu/ASAP/
THIS RESOURCE IS NO LONGER IN SERVICE, documented on 8/12/13. Database to access and mine alternative splicing information coming from genomics and proteomics based on genome-wide analyses of alternative splicing in human (30 793 alternative splice relationships found) from detailed alignment of expressed sequences onto the genomic sequence. ASAP provides precise gene exon-intron structure, alternative splicing, tissue specificity of alternative splice forms, and protein isoform sequences resulting from alternative splicing. They developed an automated method for discovering human tissue-specific regulation of alternative splicing through a genome-wide analysis of expressed sequence tags (ESTs), which involves classifying human EST libraries according to tissue categories and Bayesian statistical analysis. They use the UniGene clusters of human Expressed Sequence Tags (ESTs) to identify splices. The UniGene EST's are clustered so that a single cluster roughly corresponds to a gene (or at least a part of a gene). A single EST represents a portion of a processed (already spliced) mRNA. A given cluster contains many ESTs, each representing an outcome of a series of splicing events. The ESTs in UniGene contain the different mRNA isoforms transcribed from an alternatively spliced gene. They are not predicting alternative splicing, but locating it based on EST analysis. The discovered splices are further analyzed to determine alternative splicing events. They have identified 6201 alternative splice relationships in human genes, through a genome-wide analysis of expressed sequence tags (ESTs). Starting with 2.1 million human mRNA and EST sequences, they mapped expressed sequences onto the draft human genome sequence and only accepted splices that obeyed the standard splice site consensus. After constructing a tissue list of 46 human tissues with 2 million human ESTs, they generated a database of novel human alternative splices that is four times larger than our previous report, and used Bayesian statistics to compare the relative abundance of every pair of alternative splices in these tissues. Using several statistical criteria for tissue specificity, they have identified 667 tissue-specific alternative splicing relationships and analyzed their distribution in human tissues. They have validated our results by comparison with independent studies. This genome-wide analysis of tissue specificity of alternative splicing will provide a useful resource to study the tissue-specific functions of transcripts and the association of tissue-specific variants with human diseases.
Proper citation: ASAP: the Alternative Splicing Annotation Project (RRID:SCR_003415) Copy
http://caps.ncbs.res.in/3dswap/index.html
Curated knowledegbase of protein structures that are reported to be involved in 3-dimensional domain swapping. 3DSwap provides literature curated information and structure related information about 3D domain swapping in proteins. Information about swapping, hinge region, swapped region, extent of swapping, etc. are extracted from original research publications after extensive literature curation.
Proper citation: 3DSwap (RRID:SCR_004133) Copy
http://www.hgsc.bcm.tmc.edu/content/hapmap-3-and-encode-3
Draft release 3 for genome-wide SNP genotyping and targeted sequencing in DNA samples from a variety of human populations (sometimes referred to as the HapMap 3 samples). This release contains the following data: * SNP genotype data generated from 1184 samples, collected using two platforms: the Illumina Human1M (by the Wellcome Trust Sanger Institute) and the Affymetrix SNP 6.0 (by the Broad Institute). Data from the two platforms have been merged for this release. * PCR-based resequencing data (by Baylor College of Medicine Human Genome Sequencing Center) across ten 100-kb regions (collectively referred to as ENCODE 3) in 712 samples. Since this is a draft release, please check this site regularly for updates and new releases. The HapMap 3 sample collection comprises 1,301 samples (including the original 270 samples used in Phase I and II of the International HapMap Project) from 11 populations, listed below alphabetically by their 3-letter labels. Five of the ten ENCODE 3 regions overlap with the HapMap-ENCODE regions; the other five are regions selected at random from the ENCODE target regions (excluding the 10 HapMap-ENCODE regions). All ENCODE 3 regions are 100-kb in size, and are centered within each respective ENCODE region. The HapMap 3 and ENCORE 3 data are downloadable from the ftp site.
Proper citation: HapMap 3 and ENCODE 3 (RRID:SCR_004563) Copy
http://www.ncbi.nlm.nih.gov/mapview/map_search.cgi?taxid=7165
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 11, 2023. A database for the Anopheles gambiae str. PEST genome that was sequenced using a whole genome shotgun approach. The database aims to contribute to the understanding of mosquito genome structure and organization and will assist the development of malaria control strategies and improved anti-malarial drugs and vaccines. Sequences were generated and assembled into contigs for submission to GenBank.
Proper citation: Anopheles gambiae (African malaria mosquito) genome view (RRID:SCR_004402) Copy
http://www.uniprot.org/taxonomy/
NEWT is the taxonomy database maintained by the UniProt group. It integrates taxonomy data compiled in the NCBI database and data specific to the UniProt Knowledgebase. Browse by hierarchy, List all, or Complete proteomes. Organisms are classified in a hierarchical tree structure. Our taxonomy database contains every node (taxon) of the tree. UniProtKB taxonomy data is manually curated: next to manually verified organism names, we provide a selection of external links, organism strains and viral host information. Species with protein sequences stored in the UniProt Knowledgebase are named according to UniProt nomenclature. We endeavour to maintain a list of manually curated species names for which protein sequence data is available. In particular, we have adopted a systematic convention for naming viral and bacterial strains and isolates. Links to external sites are chosen by the UniProt taxonomy team and show pictures and various scientific data of interest (taxonomy, biology, physiology,...).
Proper citation: NEWT (RRID:SCR_004477) Copy
http://www.ebi.ac.uk/biosamples/
Database that aggregates sample information for reference samples (e.g. Coriell Cell lines) and samples for which data exist in one of the EBI''''s assay databases such as ArrayExpress, the European Nucleotide Archive or PRoteomics Identificates DatabasE. It provides links to assays for specific samples, and accepts direct submissions of sample information. The goals of the BioSample Database include: # recording and linking of sample information consistently within EBI databases such as ENA, ArrayExpress and PRIDE; # minimizing data entry efforts for EBI database submitters by enabling submitting sample descriptions once and referencing them later in data submissions to assay databases and # supporting cross database queries by sample characteristics. The database includes a growing set of reference samples, such as cell lines, which are repeatedly used in experiments and can be easily referenced from any database by their accession numbers. Accession numbers for the reference samples will be exchanged with a similar database at NCBI. The samples in the database can be queried by their attributes, such as sample types, disease names or sample providers. A simple tab-delimited format facilitates submissions of sample information to the database, initially via email to biosamples (at) ebi.ac.uk. Current data sources: * European Nucleotide Archive (424,811 samples) * PRIDE (17,001 samples) * ArrayExpress (1,187,884 samples) * ENCODE cell lines (119 samples) * CORIELL cell lines (27,002 samples) * Thousand Genome (2,628 samples) * HapMap (1,417 samples) * IMSR (248,660 samples)
Proper citation: BioSample Database at EBI (RRID:SCR_004856) Copy
A database of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Users can analyze protein sequences for Pfam matches, view Pfam family annotation and alignments, see groups of related families, look at the domain organization of a protein sequence, find the domains on a PDB structure, and query Pfam by keywords. There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families that may automatically generate a supplement using the ADDA database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. Pfam also generates higher-level groupings of related families, known as clans (collections of Pfam-A entries which are related by similarity of sequence, structure or profile-HMM).
Proper citation: Pfam (RRID:SCR_004726) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 11,2023. SuperCAT hosts typing databases for the Bacillus cereus group of bacteria. The databases contain MultiLocus Sequence Typing (MLST), MultiLocus Enzyme Electrophoresis (MLEE), and Amplified Fragment Length Polymorphism (AFLP) phylogenetic data. multilocus, sequence, Bacillus cereus, bacteria, Genomics, non-vertebrate, taxonomy, identification
Proper citation: SuperCAT (RRID:SCR_004882) Copy
A clade oriented, community curated database containing genomic, genetic, phenotypic and taxonomic information for plant genomes. Genomic information is presented in a comparative format and tied to important plant model species such as Arabidopsis. SGN provides tools such as: BLAST searches, the SolCyc biochemical pathways database, a CAPS experiment designer, an intron detection tool, an advanced Alignment Analyzer, and a browser for phylogenetic trees. The SGN code and database are developed as an open source project, and is based on database schemas developed by the GMOD project and SGN-specific extensions.
Proper citation: SGN (RRID:SCR_004933) Copy
http://cgi-www.daimi.au.dk/cgi-chili/datfap/frontdoor.py
A database of transcription factors from 13 plant species, and PCR primers for around 90% of them.
Proper citation: DATFAP (RRID:SCR_005413) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.