Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://umber.sbs.man.ac.uk/dbbrowser/bioie/
BioIE is a rule-based system that extracts informative sentences relating to protein families, their structures, functions and diseases from the biomedical literature. Based on manual definition of templates and rules, it aims at precise sentence extraction rather than wide recall. After uploading source text or retrieving abstracts from MEDLINE, users can extract sentences based on predefined or user-defined template categories. BioIE also provides a brief insight into the syntactic and semantic context of the source-text by looking at word, N-gram and MeSH-term distributions. Important Applications of BioIE are in, for example, annotation of microarray data and of protein databases.
Proper citation: BioIE: Extracting Informative Sentences From the Biomedical Literature (RRID:SCR_013464) Copy
http://commonfund.nih.gov/Proteincapture/
Program that is developing new resources and tools to understand the critical role the multitude of cellular proteins play in normal development and health as well as in disease. These resources will support a wide-range of research and clinical applications that will enable the isolation and tracking of proteins of interest and permit their use as diagnostic biomarkers of disease onset and progression. The program is being implemented in phases, with three Funding Opportunity Announcements (FOAs): * FOA 1: Antigen Production (RFA-RM-10-007) To produce human transcription factor antigens for making monoclonal antibodies or other affinity capture reagents; this effort is already underway. * FOA 2: Anti-Transcription Factor Antibodies Production (RFA-RM-10-017) To optimize and scale anti-transcription factor capture reagent production to develop a community antibody resource. * FOA 3: New Reagent Technology Development and Piloting (RFA-RM-10-018) To develop improvements in the reagent production pipeline with regard to quality, utility, cost, and production scalability. To understand what makes a cell function normally and what may go awry in disease, we need better tools and resources, such as renewable protein capture reagents and probes, to study how proteins work in isolation and how they interact with other proteins, carbohydrates, or DNA regions within a cell. Ideally, this resource would allow us to identify and isolate all proteins within cells, in their various forms the so called proteome to ensure broad application in research and clinical studies aimed at understanding, preventing, detecting and treating disease. Existing protein capture reagents, such monoclonal antibodies, have been developed for a number of protein targets, although these represent only a subset of all proteins comprising the human proteome. In addition, many monoclonal antibodies lack the desired level of specificity and do not reliably target only the protein of interest. This is particularly problematic given the multiple forms of any one protein and the broad range of protein types in the body. The Protein Capture Reagents Program is organized as a pilot program using transcription factors as a test case to examine the feasibility and value of generating a community resource of low cost, renewable affinity reagents for all human proteins. The reagents must be specifically designed for high quality and broad experimental utility in order to meet the growing demands of biomedical researchers. Based on what is learned from these funding initiatives, the program may expand to a larger production effort to provide a broad community resource of human protein capture reagents.
Proper citation: Common Fund Protein Capture Reagents (RRID:SCR_006570) Copy
http://podb.nibb.ac.jp/Organellome/
Database of images, movies, and protocols to promote a comprehensive understanding of plant organelle dynamics, including organelle function, biogenesis, differentiation, movement, and interactions with other organelles. It consists of 5 individual parts, ''Perceptive Organelles Database'', ''The Organelles Movie Database'', ''The Organellome Database'', ''The Functional Analysis Database'', and ''External Links to other databases and Web pages''. All the data and protocols in ''The Organelle Movie Database'', ''The Organellome Database'' and ''The Functional Analysis Database'' are populated by direct submission of experimentally determined data from plant researchers. Your active contributions by submission of data and protocols to our database would also be appreciated. * Perceptive Organelles Database: This database contains images and movies of organelles in various tissues during different developmental stages in response to environmental stimuli. * Organelles Movie Database: This database contains time-lapse images, Z slices and projection images of organelles in various tissues during different developmental stages, visualized using fluorescent and non-fluorescent probes. * Organellome Database: This database contains images for cellular structures that are composed of organelle images in various tissues during different developmental stages, visualized with fluorescent and non-fluorescent probes. * Functional Analysis Database: This database is a collection of protocols for plant organelle research. * External Links: Access to biological databases.
Proper citation: Plant Organelles Database (RRID:SCR_006520) Copy
http://www.ebi.ac.uk/pdbe/emdb/
Repository for electron microscopy density maps of macromolecular complexes and subcellular structures at Protein Data Bank in Europe. Covers techniques, including single-particle analysis, electron tomography, and electron (2D) crystallography.
Proper citation: Electron Microscopy Data Bank at PDBe (MSD-EBI) (RRID:SCR_006506) Copy
http://www.informatics.jax.org/expression.shtml
Community database that collects and integrates the gene expression information in MGI with a primary emphasis on endogenous gene expression during mouse development. The data in GXD are obtained from the literature, from individual laboratories, and from large-scale data providers. All data are annotated and reviewed by GXD curators. GXD stores and integrates different types of expression data (RNA in situ hybridization; Immunohistochemistry; in situ reporter (knock in); RT-PCR; Northern and Western blots; and RNase and Nuclease s1 protection assays) and makes these data freely available in formats appropriate for comprehensive analysis. There is particular emphasis on endogenous gene expression during mouse development. GXD also maintains an index of the literature examining gene expression in the embryonic mouse. It is comprehensive and up-to-date, containing all pertinent journal articles from 1993 to the present and articles from major developmental journals from 1990 to the present. GXD stores primary data from different types of expression assays and by integrating these data, as data accumulate, GXD provides increasingly complete information about the expression profiles of transcripts and proteins in different mouse strains and mutants. GXD describes expression patterns using an extensive, hierarchically-structured dictionary of anatomical terms. In this way, expression results from assays with differing spatial resolution are recorded in a standardized and integrated manner and expression patterns can be queried at different levels of detail. The records are complemented with digitized images of the original expression data. The Anatomical Dictionary for Mouse Development has been developed by our Edinburgh colleagues, as part of the joint Mouse Gene Expression Information Resource project. GXD places the gene expression data in the larger biological context by establishing and maintaining interconnections with many other resources. Integration with MGD enables a combined analysis of genotype, sequence, expression, and phenotype data. Links to PubMed, Online Mendelian Inheritance in Man (OMIM), sequence databases, and databases from other species further enhance the utility of GXD. GXD accepts both published and unpublished data.
Proper citation: Gene Expression Database (RRID:SCR_006539) Copy
It facilitates the search for and dissemination of mass spectra from biologically active metabolites quantified using Gas chromatography (GC) coupled to mass spectrometry (MS). Use the Search Page to search for a compound of your interest, using the name, mass, formula, InChI etc. as query input. Additionally, a Library Search service enables the search of user submitted mass spectra within the GMD. In parallel to the library search, a prediction of chemical sub-groups is performed. This approach has reached beta level and a publication is currently under review. Using several sub-group specific Decision Trees (DTs), mass spectra are classified with respect to the presence of the chemical moieties within the linked (unknown) compound. Prediction of functional groups (ms analysis) facilitates the search of metabolites within the GMD by means of user submitted GC-MS spectra consisting of retention index (n-alkanes, if vailable) and mass intensities ratios. In addition, a functional group prediction will help to characterize those metabolites without available reference mass spectra included in the GMD so far. Instead, the unknown metabolite is characterized by predicted presence or absence of functional groups. For power users this functionality presented here is exposed as soap based web services. Functional group prediction of compounds by means of GC-EI-MS spectra using Microsoft analysis service decision trees All currently available trained decision trees and sub-structure predictions provided by the GMD interface. Table describes the functional group, optional use of an RI system, record date of the trained decision tree, number of MSTs with proportion of MSTs linked to metabolites with the functional group present for each tree. Average and standard deviation of the 50-fold CV error, namely the ratio false over correctly sorted MSTs in the trained DT, are listed. The GMD website offers a range of mass spectral reference libraries to academic users which can be downloaded free of charge in various electronic formats. The libraries are constituted by base peak normalized consensus spectra of single analytes and contain masses in the range 70 to 600 amu, while the ubiquitous mass fragments typically generated from compounds carrying a trimethylsilyl-moiety, namely the fragments at m/z 73, 74, 75, 147, 148, and 149, were excluded.
Proper citation: GMD (RRID:SCR_006625) Copy
http://espript.ibcp.fr/ESPript/ESPript/
A utility, whose output is a PostScript file of aligned sequences with graphical enhancements. Its main input is an ascii file of pre-aligned sequences. Optional files allow further rendering. The program calculates a similarity score for each residue of the aligned sequences. The output shows: * Secondary Structures * Aligned sequences * Similarities * Accessibility * Hydropathy * User-supplied markers * Intermolecular contacts In addition, similarity score can be written in the bfactor column of a pdb file, to enable direct display of highly conserved areas. You can run ESPript from this server with the HTML interface. It is configured for a maximum of 1,000 sequences. Links to webESPript * ENDscript: you can upload a PDB file or enter a PDB code such as 1M85. The programs DSSP and CNS are executed via the interface, so as to obtain an ESPript figure with a lot of structural information (secondary structure elements, intermolecular contacts). You can also find homologous sequences with a BLAST search, perform multiple sequence alignments with MULTALIN or CLUSTALW and create an image with BOBSCRIPT or MOLSCRIPT to show similarities on your 3D structure. * ProDom: you can enter a sequence identifier to find homologous domains, perform multiple sequence alignments with MULTALIN and click on the link to ESPript. * Predict Protein: you can receive a mail in text (do not use the HTML option when you submit your request in Predict Protein) with aligned sequences and numerous information including secondary structure prediction. Click on a special html link to upload your mail in ESPript. * NPS(at): you can execute the programs BLAST and CLUSTALW to obtain multiple alignments. You can predict secondary structure elements and click on the link to ESPript. This program started in the laboratory of Dr Richard Wade at the Institut de Biologie Structurale, Grenoble. It moved later to the Laboratory of Molecular Biophysics in Oxford, then to the Institut de Pharmacologie et de Biologie Structurale in Toulouse. It is now developed in the Laboratoire de BioCristallographie of Dr Richard Haser, Institut de Biologie et de Chimie des Prot��������ines, Lyon and in the Laboratoire de Biologie Mol��������culaire et de Relations Plantes-Organismes, group of Dr Daniel Kahn, Institut National de la Recherche Agronomique de Toulouse.
Proper citation: ESPript 2.2 (RRID:SCR_006587) Copy
The Global Proteome Machine Organization was set up so that scientists involved in proteomics using tandem mass spectrometry could use that data to analyze proteomes. The projects supported by the GPMO have been selected to improve the quality of analysis, make the results portable and to provide a common platform for testing and validating proteomics results. The Global Proteome Machine Database was constructed to utilize the information obtained by GPM servers to aid in the difficult process of validating peptide MS/MS spectra as well as protein coverage patterns. This database has been integrated into GPM server pages, allowing users to quickly compare their experimental results with the best results that have been previously observed by other scientists.
Proper citation: Global Proteome Machine Database (GPM DB) (RRID:SCR_006617) Copy
http://ligand-expo.rutgers.edu/
An integrated data resource for finding chemical and structural information about small molecules bound to proteins and nucleic acids within the structure entries of the Protein Data Bank. Tools are provided to search the PDB dictionary for chemical components, to identify structure entries containing particular small molecules, and to download the 3D structures of the small molecule components in the PDB entry. A sketch tool is also provided for building new chemical definitions from reported PDB chemical components.
Proper citation: Ligand Expo (RRID:SCR_006636) Copy
DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.
Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy
Publicly available database of the genes, proteins, experimentally-verified interactions and signaling pathways involved in the innate immune response of humans, mice and bovines to microbial infection. The database captures coverage of the innate immunity interactome by integrating known interactions and pathways from major public databases together with manually-curated data into a centralized resource. The database can be mined as a knowledgebase or used with the integrated bioinformatics and visualization tools for the systems level analysis of the innate immune response. Although InnateDB curation focuses on innate immunity-relevant interactions and pathways, it also incorporates detailed annotation on the entire human, mouse and bovine interactomes by integrating data (178,000+ interactions & 3,900+ pathways) from several of the major public interaction and pathway databases. InnateDB also has integrated human, mouse and bovine orthology predictions generated using Ortholgue software. Ortholgue uses a phylogenetic distance-based method to identify possible paralogs in high-throughput orthology predictions. Integrated human and mouse conserved gene order and synteny information has also been determined to provide further support for orthology predictions. InnateDB Capabilities: * View statistics for manually-curated innate immunity relevant molecular interactions. New manually curated interactions are submitted weekly. * Search for genes and proteins of interest. * Search for experimentally-verified molecular interactions by gene/protein name, interaction type, cell type, etc. * Search genes/interactions belonging to 3,900 pathways. * Visualize interactions using an intuitive subcellular localization-based layout in Cerebral. * Upload your own list of genes along with associated gene expression data (from up to 10 experimental conditions) to interactively analyze this data in a molecular interaction network context. Once you have uploaded your data, you will be able to interactively visualize interaction networks with expression data overlaid; carry out Pathway, Gene Ontology and Transcription Factor Binding Site over-representation analyses; construct orthologous interaction networks in other species; and much more. * Access curated interaction data via a dedicated PSICQUIC webservice.
Proper citation: InnateDB (RRID:SCR_006714) Copy
http://inparanoid.sbc.su.se/cgi-bin/index.cgi
Collection of pairwise comparisons between 100 whole genomes generated by a fully automatic method for finding orthologs and in-paralogs between TWO species. Ortholog clusters in the InParanoid are seeded with a two-way best pairwise match, after which an algorithm for adding in-paralogs is applied. The method bypasses multiple alignments and phylogenetic trees, which can be slow and error-prone steps in classical ortholog detection. Still, it robustly detects complex orthologous relationships and assigns confidence values for in-paralogs. The original data sets can be downloaded.
Proper citation: InParanoid: Eukaryotic Ortholog Groups (RRID:SCR_006801) Copy
Software for a Laboratory Information Management System (LIMS) developed to support the unpredictable workflows of Molecular biology and Protein production labs of all sizes.
Proper citation: PiMS (RRID:SCR_011816) Copy
Web based instant protein network modeler for newly sequenced species. Web server designed to instantly construct genome scale protein networks using protein sequence data. Provides network visualization, analysis pages and solution for instant network modeling of newly sequenced species.
Proper citation: JiffyNet (RRID:SCR_011954) Copy
http://pathways.mcdb.ucla.edu/algal/
Tools to search gene lists for functional term enrichment as well as to dynamically visualize proteins onto pathway maps. Additionally, integrated expression data may be used to discover similarly expressed genes based on a starting gene of interest.
Proper citation: Algal Functional Annotation Tool (RRID:SCR_012034) Copy
Software package for Bayesian analysis of protein, DNA and RNA sequences. It utilizes multiple alignments, phylogenetic trees and evolutionary parameters to quantify uncertainty in these analyses. It is written in Java.
Proper citation: StatAlign (RRID:SCR_001892) Copy
http://abi.inf.uni-tuebingen.de/Services/YLoc/webloc.cgi
An interpretable web server for predicting subcellular localization. In addition to the predicted location, YLoc gives a reasoning why this prediction was made and which biological properties of the protein sequence lead to this prediction. Moreover, a confidence estimate helps users to rate predictions as trustworthy. YLoc+ is able to predict the location of multiple-targeted proteins with high accuracy. The YLoc webserver is also accessible via SOAP.
Proper citation: YLoc (RRID:SCR_002464) Copy
http://genome.unmc.edu/ngLOC/index.html
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 5, 2023.An n-gram-based Bayesian classifier that predicts subcellular localization of proteins both in prokaryotes and eukaryotes. The downloadable version of this software with source code is freely available for academic use under the GNU General Public License.
Proper citation: ngLOC (RRID:SCR_003150) Copy
http://www.nactem.ac.uk/facta/
Text mining tool to discover associations between biomedical concepts from MEDLINE articles. Use the service from your browser or via a Web Service. The whole MEDLINE corpus containing more than 20 million articles is indexed with an efficient text search engine, and it allows you to navigate such associations and their textual evidence in a highly interactive manner - the system accepts arbitrary query terms and displays relevant concepts immediately. A broad range of important biomedical concepts are covered by the combination of a machine learning-based term recognizer and large-scale dictionaries for genes, proteins, diseases, and chemical compounds. There is also a FACTA+ visualization service that can be found here: http://www.nactem.ac.uk/facta-visualizer/
Proper citation: FACTA+. (RRID:SCR_001767) Copy
https://www.ncbi.nlm.nih.gov/geo/
Functional genomics data repository supporting MIAME-compliant data submissions. Includes microarray-based experiments measuring the abundance of mRNA, genomic DNA, and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. Array- and sequence-based data are accepted. Collection of curated gene expression DataSets, as well as original Series and Platform records. The database can be searched using keywords, organism, DataSet type and authors. DataSet records contain additional resources including cluster tools and differential expression queries.
Proper citation: Gene Expression Omnibus (GEO) (RRID:SCR_005012) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the NIF Resources search. From here you can search through a compilation of resources used by NIF and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that NIF has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on NIF then you can log in from here to get additional features in NIF such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into NIF you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within NIF that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.