Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://www.ncbi.nlm.nih.gov/biosystems/
Database that provides access to biological systems and their component genes, proteins, and small molecules, as well as literature describing those biosystems and other related data throughout Entrez. A biosystem, or biological system, is a group of molecules that interact directly or indirectly, where the grouping is relevant to the characterization of living matter. BioSystem records list and categorize components, such as the genes, proteins, and small molecules involved in a biological system. The companion FLink tool, in turn, allows you to input a list of proteins, genes, or small molecules and retrieve a ranked list of biosystems. A number of databases provide diagrams showing the components and products of biological pathways along with corresponding annotations and links to literature. This database was developed as a complementary project to (1) serve as a centralized repository of data; (2) connect the biosystem records with associated literature, molecular, and chemical data throughout the Entrez system; and (3) facilitate computation on biosystems data. The NCBI BioSystems Database currently contains records from several source databases: KEGG, BioCyc (including its Tier 1 EcoCyc and MetaCyc databases, and its Tier 2 databases), Reactome, the National Cancer Institute's Pathway Interaction Database, WikiPathways, and Gene Ontology (GO). It includes several types of records such as pathways, structural complexes, and functional sets, and is desiged to accomodate other record types, such as diseases, as data become available. Through these collaborations, the BioSystems database facilitates access to, and provides the ability to compute on, a wide range of biosystems data. If you are interested in depositing data into the BioSystems database, please contact them.
Proper citation: NCBI BioSystems Database (RRID:SCR_004690) Copy
http://www.proconsortium.org/pro/
An ontological representation of protein-related entities by explicitly defining them and showing the relationships between them. Each PRO term represents a distinct class of entities (including specific modified forms, orthologous isoforms, and protein complexes) ranging from the taxon-neutral to the taxon-specific. The ontology has a meta-structure encompassing three areas: proteins based on evolutionary relatedness (ProEvo); protein forms produced from a given gene locus (ProForm); and protein-containing complexes (ProComp). NOTICE: The PRO ID format has changed from PRO: to PR: (e.g. PRO:000000563 is now PR:000000563).
Proper citation: PR (RRID:SCR_004964) Copy
GenMAPP is a free computer application designed to visualize gene expression and other genomic data on maps representing biological pathways and groupings of genes. Integrated with GenMAPP are programs to perform a global analysis of gene expression or genomic data in the context of hundreds of pathway MAPPs and thousands of Gene Ontology Terms (MAPPFinder), import lists of genes/proteins to build new MAPPs (MAPPBuilder), and export archives of MAPPs and expression/genomic data to the web. The main features underlying GenMAPP are: *Draw pathways with easy to use graphics tools *Color genes on MAPP files based on user-imported genomic data *Query data against MAPPs and the GeneOntology Enhanced features include the simultaneous view of multiple color sets, expanded species-specific gene databases and custom database options.
Proper citation: Gene Map Annotator and Pathway Profiler (RRID:SCR_005094) Copy
https://www.rostlab.org/services/snpdbe/
A database to fill the annotation gap left by the high cost of experimental testing for functional significance of protein variants. It joins related bits of knowledge, currently distributed throughout various databases, into a consistent, easily accessible, and updatable resource. It currently covers over 155,000 protein sequences which come from more than 2,600 organisms. Overall more than one million single amino acid substitutions (SAASs) are referenced consisting of natural variants, SAASs from mutagenesis experiments and sequencing conflicts. SNPdbe offers the following pieces of information (if available) on each SAAS: * Experimentally derived functional and structural impact * Predicted functional effect * Associated disease * Average heterozygosity * Experimental evidence of the nsSNP * Evolutionary conservation of wildtype and mutant amino acid * Link-outs to external databases A convenient webinterface to query SAASs on the following levels is offered: * Protein and gene identifiers and keywords * Disease keywords * Protein sequence on different sequence identity thresholds * Variant identifier (dbSNP rs, SwissVar, PMD) or specific mutant like XposY and specified sequence They offer the possibility to submit protein sequences along with experimentally substantiated mutations in order to predict their functional effect and inclusion into our database.
Proper citation: SNPdbe (RRID:SCR_005190) Copy
http://www.jcvi.org/cgi-bin/tigrfams/index.cgi
Consists curated multiple sequence alignments, Hidden Markov Models (HMMs) for protein sequence classification, and associated information designed to support automated annotation of (mostly prokaryotic) proteins. Starting with release 10.0, TIGRFAMs models use HMMER3, which provides excellent search speed as well as exquisite search sensitivity. See the "TIGRFAMs Complete Listing" page to review the accession, protein name, model type, and EC number (if assigned) of all models. TIGRFAMs is a member database in InterPro. The HMM libraries and supporting files are available to download and use for free from our FTP site.
Proper citation: TIGRFAMS (RRID:SCR_005493) Copy
http://llama.mshri.on.ca/synergizer/translate/
The Synergizer database is a growing repository of gene and protein identifier synonym relationships. This tool facilitates the conversion of identifiers from one naming scheme (a.k.a namespace) to another. The Synergizer is a service for translating between sets of biological identifiers. It can, for example, translate Ensembl Gene IDs to Entrez Gene IDs, or IPI IDs to HGNC gene symbols, and much more. Unlike some other tools for this purpose, The Synergizer is simple and easy to learn. The Synergizer works via a web interface (for users who are not programmers) or through a web service (for programmatic access).
Proper citation: Synergizer (RRID:SCR_005308) Copy
http://www.ncbi.nlm.nih.gov/gtr/
Central location for voluntary submission of genetic test information by providers including the test''s purpose, methodology, validity, evidence of the test''s usefulness, and laboratory contacts and credentials. GTR aims to advance the public health and research into the genetic basis of health and disease. GTR is accepting registration of clinical tests for Mendelian disorders, complex tests and arrays, and pharmacogenetic tests. These tests may include multiple methods and may include multiple major method categories such as biochemical, cytogenetic, and molecular tests. GTR is not currently accepting registration of tests for somatic disorders, research tests or direct-to-consumer tests.
Proper citation: Genetic Testing Registry (RRID:SCR_005565) Copy
Software package for Bayesian analysis of protein, DNA and RNA sequences. It utilizes multiple alignments, phylogenetic trees and evolutionary parameters to quantify uncertainty in these analyses. It is written in Java.
Proper citation: StatAlign (RRID:SCR_001892) Copy
http://abi.inf.uni-tuebingen.de/Services/YLoc/webloc.cgi
An interpretable web server for predicting subcellular localization. In addition to the predicted location, YLoc gives a reasoning why this prediction was made and which biological properties of the protein sequence lead to this prediction. Moreover, a confidence estimate helps users to rate predictions as trustworthy. YLoc+ is able to predict the location of multiple-targeted proteins with high accuracy. The YLoc webserver is also accessible via SOAP.
Proper citation: YLoc (RRID:SCR_002464) Copy
http://genome.unmc.edu/ngLOC/index.html
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 5, 2023.An n-gram-based Bayesian classifier that predicts subcellular localization of proteins both in prokaryotes and eukaryotes. The downloadable version of this software with source code is freely available for academic use under the GNU General Public License.
Proper citation: ngLOC (RRID:SCR_003150) Copy
http://www.nactem.ac.uk/facta/
Text mining tool to discover associations between biomedical concepts from MEDLINE articles. Use the service from your browser or via a Web Service. The whole MEDLINE corpus containing more than 20 million articles is indexed with an efficient text search engine, and it allows you to navigate such associations and their textual evidence in a highly interactive manner - the system accepts arbitrary query terms and displays relevant concepts immediately. A broad range of important biomedical concepts are covered by the combination of a machine learning-based term recognizer and large-scale dictionaries for genes, proteins, diseases, and chemical compounds. There is also a FACTA+ visualization service that can be found here: http://www.nactem.ac.uk/facta-visualizer/
Proper citation: FACTA+. (RRID:SCR_001767) Copy
https://www.ncbi.nlm.nih.gov/geo/
Functional genomics data repository supporting MIAME-compliant data submissions. Includes microarray-based experiments measuring the abundance of mRNA, genomic DNA, and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. Array- and sequence-based data are accepted. Collection of curated gene expression DataSets, as well as original Series and Platform records. The database can be searched using keywords, organism, DataSet type and authors. DataSet records contain additional resources including cluster tools and differential expression queries.
Proper citation: Gene Expression Omnibus (GEO) (RRID:SCR_005012) Copy
http://bioinformatics.biol.uoa.gr/hPATM/
A web tool, based on a heuristic transformation of the original global pairwise and local pairwise alignment algorithms, offers objective alignments for transmembrane protein sequences. hPATM takes advantage of the information offered by the knowledge of the position of transmembrane segmets, by experiment or prediction. The heuristic approach may reveal similarities between diverge sequences with low percentages of identity and similarity. The produced alignments, based on common structural scaffolds derived by the transmembrane segments of the sequence, can be used to spot conserved non-transmembrane segments or as a basis for the production of 3-D models via homology modelling. The hPAFAG algorithm is based on the heuristic transformation of the Needleman & Wunsch and Smith & Waterman algorithms, featuring affine gap penalties. The heuristic transformation is based on two extra features: * a heuristic bonus, added to the score when two amino acids that belong to transmembrane segmens are aligned. * a heuristic gap penalty, substracted from the score when a gap is opened in a transmembrane segment. This way transmembrane segments are anchored (not by force, but by more strict alignment) together, allowing the pairwise alignment to focus on non-transmembrane segments. This web server offers a friendly interface for the hPATM command line version. The algorithm was implemented in PERL and the source code of the command line version is available on request by the authors.
Proper citation: hPATM (RRID:SCR_006224) Copy
http://jilab.biostat.jhsph.edu/database/cgi-bin/hmChIP.pl
A database of genome-wide chromatin immunoprecipitation (ChIP) data in human and mouse. Currently, the database contains >2000 samples from >500 ChIP-seq and ChIP-chip experiments, representing a total of >170 proteins and >10,000,000 protein-DNA interactions (March 2014). A web server provides an interface for database query. Protein-DNA binding intensities can be retrieved from individual samples for user-provided genomic regions. The retrieved intensities can be used to cluster samples and genomic regions to facilitate exploration of combinatorial patterns, cell type dependencies, and cross-sample variability of protein-DNA interactions.
Proper citation: hmChIP (RRID:SCR_005407) Copy
http://cmr.jcvi.org/tigr-scripts/CMR/CmrHomePage.cgi
Database of all of the publicly available, complete prokaryotic genomes. In addition to having all of the organisms on a single website, common data types across all genomes in the CMR make searches more meaningful, and cross genome analysis highlight differences and similarities between the genomes. CMR offers a wide variety of tools and resources, all of which are available off of our menu bar at the top of each page. Below is an explanation and link for each of these menu options. * Genome Tools: Find organism lists as well as summary information and analyses for selected genomes. * Searches: Search CMR for genes, genomes, sequence regions, and evidence. * Comparative Tools: Compare multiple genomes based on a variety of criteria, including sequence homology and gene attributes. SNP data is also found under this menu. * Lists: Select and download gene, evidence, and genomic element lists. * Downloads: Download gene sequences or attributes for CMR organisms, or go to our FTP site. * Carts: Select genome preferences from our Genome Cart or download your Gene Cart genes. The Omniome is the relational database underlying the CMR and it holds all of the annotation for each of the CMR genomes, including DNA sequences, proteins, RNA genes and many other types of features. Associated with each of these DNA features in the Omniome are the feature coordinates, nucleotide and protein sequences (where appropriate), and the DNA molecule and organism with which the feature is associated. Also available are evidence types associated with annotation such as HMMs, BLAST, InterPro, COG, and Prosite, as well as individual gene attributes. In addition, the database stores identifiers from other centers such as GenBank and SwissProt, as well as manually curated information on each genome or each DNA molecule including website links. Also stored in the Omniome are precomputed homology data, called All vs All searches, used throughout the CMR for comparative analysis.
Proper citation: JCVI CMR (RRID:SCR_005398) Copy
http://www.proteomexchange.org
A data repository for proteomic data sets. The ProteomeExchange consortium, as a whole, aims to provide a coordinated submission of MS proteomics data to the main existing proteomics repositories, as well as to encourage optimal data dissemination. ProteomeXchange provides access to a number of public databases, and users can access and submit data sets to the consortium's PRIDE database and PASSEL/PeptideAtlas.
Proper citation: ProteomeXchange (RRID:SCR_004055) Copy
http://www.ideal.force.cs.is.nagoya-u.ac.jp/IDEAL/
IDEAL, Intrinsically Disordered proteins with Extensive Annotations and Literature, is a collection of knowledge on experimentally verified intrinsically disordered proteins (IDPs) or intrinsically disordered regions (IDRs). IDEAL contains manually curated annotations on IDPs in locations, structures, and functional sites such as protein binding regions and posttranslational modification sites together with references and structural domain assignments. Protean segment One of the unique phenomena seen in IDPs is so-called the coupled folding and binding, where a short flexible segment can bind to its binding partner with forming a specific structure to act as a molecular recognition element. IDEAL explicitly annotates these regions as protean segment (ProS) when unstructured and structured information are both available in the region. Access to the data All the entries are tabulated in the list and individual entries can be retrieved by using the search tool at the upper-right corner in this page. IDEAL also provides the BLAST search, which can find homologs in IDEAL. All the information in IDEAL can be downloaded in the XML file.
Proper citation: IDEAL - Intrinsically Disordered proteins with Extensive Annotations and Literature (RRID:SCR_006027) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on April 15,2025. Human protein knowledge platform. Knowledge platform for human proteins selects and filters high throughput data pertinent to human proteins from UniProtKB. Extends UniProtKB/Swiss-Prot annotations for human proteins to include several new data types.
Proper citation: neXtProt (RRID:SCR_008911) Copy
http://harvester.fzk.de/harvester/
Harvester is a Web-based tool that bulk-collects bioinformatic data on human proteins from various databases and prediction servers. It is a meta search engine for gene and protein information. It searches 16 major databases and prediction servers and combines the results on pregenerated HTML pages. In this way Harvester can provide comprehensive gene-protein information from different servers in a convenient and fast manner. As full text meta search engine, similar to Google trade mark, Harvester allows screening of the whole genome proteome for current protein functions and predictions in a few seconds. With Harvester it is now possible to compare and check the quality of different database entries and prediction algorithms on a single page. Sponsors: This work has been supported by the BMBF with grants 01GR0101 and 01KW0013.
Proper citation: Bioinformatic Harvester IV (beta) at Karlsruhe Institute of Technology (RRID:SCR_008017) Copy
http://hdbase.org/cgi-bin/welcome.cgi
A community website for Huntington''s Disease (HD) research that currently contains Y2H and Mass spectrometry protein-protein interaction data centered around the HD protein (huntingtin) and information on therapeutic studies in mouse. Also available are raw Human and Mouse Affymetrix Microarray data. The protein interaction data is from several sources, including interactions curated from the literature by ISB staff, experimentally determined interactions produced by Bob Hughes and colleagues at Prolexys (currently password protected), and interactions reported in a recent publication by Goehler et al from Eric Wanker''s lab. Content areas that may be covered by the site include the following: * Therapeutic studies in mouse, primarily drug screens. * HD mouse models with a focus on timelines of disease progression. * Antibodies used in HD research. * Microarray gene expression studies. * Genes and proteins relevant to HD research. This includes HD itself, the growing list of proteins thought to interact directly or indirectly with huntingtin (Htt), and other genes and proteins implicated in the disease process. * Molecular pathways thought to be involved in the disease process. * Timelines of disease for Mouse models
Proper citation: HDBase (RRID:SCR_007132) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.