Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://ccmbweb.ccv.brown.edu/gibbs/gibbs.html
Software to identify motifs, conserved regions, in DNA or protein sequences.
Proper citation: Gibbs Motif Sampler (RRID:SCR_002550) Copy
Maintains and provides archival, retrieval and analytical resources for biological information. Central DDBJ resource consists of public, open-access nucleotide sequence databases including raw sequence reads, assembly information and functional annotation. Database content is exchanged with EBI and NCBI within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). In 2011, DDBJ launched two new resources: DDBJ Omics Archive and BioProject. DOR is archival database of functional genomics data generated by microarray and highly parallel new generation sequencers. Data are exchanged between the ArrayExpress at EBI and DOR in the common MAGE-TAB format. BioProject provides organizational framework to access metadata about research projects and data from projects that are deposited into different databases.
Proper citation: DNA DataBank of Japan (DDBJ) (RRID:SCR_002359) Copy
http://funsimmat.bioinf.mpi-inf.mpg.de
FunSimMat is a comprehensive resource of semantic and functional similarity values. It allows ranking disease candidate proteins for OMIM diseases and searching for functional similarity values for proteins (extracted from UniProt), and protein families (Pfam, SMART). FunSimMat provides several different semantic and functional similarity measures for each protein pair using the Gene Ontology annotation from UniProtKB and the Gene Ontology Annotation project at EBI (GOA). There are several search options available: Disease candidate prioritization: * Rank candidate proteins using any OMIM disease entry * Compare a list of proteins to any OMIM disease entry * Compare all human proteins to any OMIM disease entry Functional similarity: * Compare one protein / protein family to a list of proteins / protein families * Compare a list of GO terms to a list of proteins / protein families Semantic similarity: * For a list of GO terms, FunSimMat performs an all-against-all comparison and displays the semantic similarity values. FunSimMat provides an XML-RPC interface for performing automatic queries and processing of the results as well as a RestLike Interface. Platform: Online tool
Proper citation: FunSimMat (RRID:SCR_002729) Copy
Database of known and predicted mammalian and eukaryotic protein-protein interactions, it is designed to be both a resource for the laboratory scientist to explore known and predicted protein-protein interactions, and to facilitate bioinformatics initiatives exploring protein interaction networks. It has been built by mapping high-throughput (HTP) data between species. Thus, until experimentally verified, these interactions should be considered predictions. It remains one of the most comprehensive sources of known and predicted eukaryotic PPI. It contains 490,600 Source Interactions, 370,002 Predicted Interactions, for a total of 846,116 interactions, and continues to expand as new protein-protein interaction data becomes available.
Proper citation: I2D (RRID:SCR_002957) Copy
http://www.humanproteinpedia.org/
A community portal for sharing and integration of human protein data that allows research laboratories to contribute and maintain protein annotations. The Human Protein Reference Database (HPRD) integrates data that is deposited along with the existing literature curated information in the context of an individual protein. Data pertaining to post-translational modifications, protein-protein interactions, tissue expression, expression in cell lines, subcellular localization and enzyme substrate relationships can be submitted.
Proper citation: Human Proteinpedia (RRID:SCR_002948) Copy
Computational biology research at Memorial Sloan-Kettering Cancer Center (MSKCC) pursues computational biology research projects and the development of bioinformatics resources in the areas of: sequence-structure analysis; gene regulation; molecular pathways and networks, and diagnostic and prognostic indicators. The mission of cBio is to move the theoretical methods and genome-scale data resources of computational biology into everyday laboratory practice and use, and is reflected in the organization of cBio into research and service components ~ the intention being that new computational methods created through the process of scientific inquiry should be generalized and supported as open-source and shared community resources. Faculty from cBio participate in graduate training provided through the following graduate programs: * Gerstner Sloan-Kettering Graduate School of Biomedical Sciences * Graduate Training Program in Computational Biology and Medicine Integral to much of the research and service work performed by cBio is the creation and use of software tools and data resources. The tools that we have created and utilize provide evidence of our involvement in the following areas: * Cancer Genomics * Data Repositories * iPhone & iPod Touch * microRNAs * Pathways * Protein Function * Text Analysis * Transcription Profiling
Proper citation: Computational Biology Center (RRID:SCR_002877) Copy
http://bibiserv.techfak.uni-bielefeld.de/dialign/
Tool for multiple sequence alignment using various sources of external information that is particularly useful to detect local homologies in sequences with low overall similarity. While standard alignment methods rely on comparing single residues and imposing gap penalties, DIALIGN constructs pairwise and multiple alignments by comparing entire segments of the sequences. No gap penalty is used. This approach can be used for both global and local alignment, but it is particularly successful in situations where sequences share only local homologies. Several versions of DIALIGN are available online at GOBICS, http://dialign.gobics.de/
Proper citation: DIALIGN (RRID:SCR_003041) Copy
http://lab.rockefeller.edu/tuschl/
RNA is not only a carrier of genetic information, but also a catalyst and a guide for sequence-specific recognition and processing of other RNA molecules. This lab investigates the regulatory mechanisms of RNA interference, RNA-mediated translational control, and nuclear pre-mRNA splicing. Classical and combinatorial biochemical techniques are used to analyze the function of the RNA- and protein-components involved in those processes.
Proper citation: Tuschl Laboratory: RNA Molecular Biology (RRID:SCR_002866) Copy
It facilitates the search for and dissemination of mass spectra from biologically active metabolites quantified using Gas chromatography (GC) coupled to mass spectrometry (MS). Use the Search Page to search for a compound of your interest, using the name, mass, formula, InChI etc. as query input. Additionally, a Library Search service enables the search of user submitted mass spectra within the GMD. In parallel to the library search, a prediction of chemical sub-groups is performed. This approach has reached beta level and a publication is currently under review. Using several sub-group specific Decision Trees (DTs), mass spectra are classified with respect to the presence of the chemical moieties within the linked (unknown) compound. Prediction of functional groups (ms analysis) facilitates the search of metabolites within the GMD by means of user submitted GC-MS spectra consisting of retention index (n-alkanes, if vailable) and mass intensities ratios. In addition, a functional group prediction will help to characterize those metabolites without available reference mass spectra included in the GMD so far. Instead, the unknown metabolite is characterized by predicted presence or absence of functional groups. For power users this functionality presented here is exposed as soap based web services. Functional group prediction of compounds by means of GC-EI-MS spectra using Microsoft analysis service decision trees All currently available trained decision trees and sub-structure predictions provided by the GMD interface. Table describes the functional group, optional use of an RI system, record date of the trained decision tree, number of MSTs with proportion of MSTs linked to metabolites with the functional group present for each tree. Average and standard deviation of the 50-fold CV error, namely the ratio false over correctly sorted MSTs in the trained DT, are listed. The GMD website offers a range of mass spectral reference libraries to academic users which can be downloaded free of charge in various electronic formats. The libraries are constituted by base peak normalized consensus spectra of single analytes and contain masses in the range 70 to 600 amu, while the ubiquitous mass fragments typically generated from compounds carrying a trimethylsilyl-moiety, namely the fragments at m/z 73, 74, 75, 147, 148, and 149, were excluded.
Proper citation: GMD (RRID:SCR_006625) Copy
http://espript.ibcp.fr/ESPript/ESPript/
A utility, whose output is a PostScript file of aligned sequences with graphical enhancements. Its main input is an ascii file of pre-aligned sequences. Optional files allow further rendering. The program calculates a similarity score for each residue of the aligned sequences. The output shows: * Secondary Structures * Aligned sequences * Similarities * Accessibility * Hydropathy * User-supplied markers * Intermolecular contacts In addition, similarity score can be written in the bfactor column of a pdb file, to enable direct display of highly conserved areas. You can run ESPript from this server with the HTML interface. It is configured for a maximum of 1,000 sequences. Links to webESPript * ENDscript: you can upload a PDB file or enter a PDB code such as 1M85. The programs DSSP and CNS are executed via the interface, so as to obtain an ESPript figure with a lot of structural information (secondary structure elements, intermolecular contacts). You can also find homologous sequences with a BLAST search, perform multiple sequence alignments with MULTALIN or CLUSTALW and create an image with BOBSCRIPT or MOLSCRIPT to show similarities on your 3D structure. * ProDom: you can enter a sequence identifier to find homologous domains, perform multiple sequence alignments with MULTALIN and click on the link to ESPript. * Predict Protein: you can receive a mail in text (do not use the HTML option when you submit your request in Predict Protein) with aligned sequences and numerous information including secondary structure prediction. Click on a special html link to upload your mail in ESPript. * NPS(at): you can execute the programs BLAST and CLUSTALW to obtain multiple alignments. You can predict secondary structure elements and click on the link to ESPript. This program started in the laboratory of Dr Richard Wade at the Institut de Biologie Structurale, Grenoble. It moved later to the Laboratory of Molecular Biophysics in Oxford, then to the Institut de Pharmacologie et de Biologie Structurale in Toulouse. It is now developed in the Laboratoire de BioCristallographie of Dr Richard Haser, Institut de Biologie et de Chimie des Prot��������ines, Lyon and in the Laboratoire de Biologie Mol��������culaire et de Relations Plantes-Organismes, group of Dr Daniel Kahn, Institut National de la Recherche Agronomique de Toulouse.
Proper citation: ESPript 2.2 (RRID:SCR_006587) Copy
The Global Proteome Machine Organization was set up so that scientists involved in proteomics using tandem mass spectrometry could use that data to analyze proteomes. The projects supported by the GPMO have been selected to improve the quality of analysis, make the results portable and to provide a common platform for testing and validating proteomics results. The Global Proteome Machine Database was constructed to utilize the information obtained by GPM servers to aid in the difficult process of validating peptide MS/MS spectra as well as protein coverage patterns. This database has been integrated into GPM server pages, allowing users to quickly compare their experimental results with the best results that have been previously observed by other scientists.
Proper citation: Global Proteome Machine Database (GPM DB) (RRID:SCR_006617) Copy
http://commonfund.nih.gov/Proteincapture/
Program that is developing new resources and tools to understand the critical role the multitude of cellular proteins play in normal development and health as well as in disease. These resources will support a wide-range of research and clinical applications that will enable the isolation and tracking of proteins of interest and permit their use as diagnostic biomarkers of disease onset and progression. The program is being implemented in phases, with three Funding Opportunity Announcements (FOAs): * FOA 1: Antigen Production (RFA-RM-10-007) To produce human transcription factor antigens for making monoclonal antibodies or other affinity capture reagents; this effort is already underway. * FOA 2: Anti-Transcription Factor Antibodies Production (RFA-RM-10-017) To optimize and scale anti-transcription factor capture reagent production to develop a community antibody resource. * FOA 3: New Reagent Technology Development and Piloting (RFA-RM-10-018) To develop improvements in the reagent production pipeline with regard to quality, utility, cost, and production scalability. To understand what makes a cell function normally and what may go awry in disease, we need better tools and resources, such as renewable protein capture reagents and probes, to study how proteins work in isolation and how they interact with other proteins, carbohydrates, or DNA regions within a cell. Ideally, this resource would allow us to identify and isolate all proteins within cells, in their various forms the so called proteome to ensure broad application in research and clinical studies aimed at understanding, preventing, detecting and treating disease. Existing protein capture reagents, such monoclonal antibodies, have been developed for a number of protein targets, although these represent only a subset of all proteins comprising the human proteome. In addition, many monoclonal antibodies lack the desired level of specificity and do not reliably target only the protein of interest. This is particularly problematic given the multiple forms of any one protein and the broad range of protein types in the body. The Protein Capture Reagents Program is organized as a pilot program using transcription factors as a test case to examine the feasibility and value of generating a community resource of low cost, renewable affinity reagents for all human proteins. The reagents must be specifically designed for high quality and broad experimental utility in order to meet the growing demands of biomedical researchers. Based on what is learned from these funding initiatives, the program may expand to a larger production effort to provide a broad community resource of human protein capture reagents.
Proper citation: Common Fund Protein Capture Reagents (RRID:SCR_006570) Copy
http://podb.nibb.ac.jp/Organellome/
Database of images, movies, and protocols to promote a comprehensive understanding of plant organelle dynamics, including organelle function, biogenesis, differentiation, movement, and interactions with other organelles. It consists of 5 individual parts, ''Perceptive Organelles Database'', ''The Organelles Movie Database'', ''The Organellome Database'', ''The Functional Analysis Database'', and ''External Links to other databases and Web pages''. All the data and protocols in ''The Organelle Movie Database'', ''The Organellome Database'' and ''The Functional Analysis Database'' are populated by direct submission of experimentally determined data from plant researchers. Your active contributions by submission of data and protocols to our database would also be appreciated. * Perceptive Organelles Database: This database contains images and movies of organelles in various tissues during different developmental stages in response to environmental stimuli. * Organelles Movie Database: This database contains time-lapse images, Z slices and projection images of organelles in various tissues during different developmental stages, visualized using fluorescent and non-fluorescent probes. * Organellome Database: This database contains images for cellular structures that are composed of organelle images in various tissues during different developmental stages, visualized with fluorescent and non-fluorescent probes. * Functional Analysis Database: This database is a collection of protocols for plant organelle research. * External Links: Access to biological databases.
Proper citation: Plant Organelles Database (RRID:SCR_006520) Copy
http://www.ebi.ac.uk/pdbe/emdb/
Repository for electron microscopy density maps of macromolecular complexes and subcellular structures at Protein Data Bank in Europe. Covers techniques, including single-particle analysis, electron tomography, and electron (2D) crystallography.
Proper citation: Electron Microscopy Data Bank at PDBe (MSD-EBI) (RRID:SCR_006506) Copy
http://www.informatics.jax.org/expression.shtml
Community database that collects and integrates the gene expression information in MGI with a primary emphasis on endogenous gene expression during mouse development. The data in GXD are obtained from the literature, from individual laboratories, and from large-scale data providers. All data are annotated and reviewed by GXD curators. GXD stores and integrates different types of expression data (RNA in situ hybridization; Immunohistochemistry; in situ reporter (knock in); RT-PCR; Northern and Western blots; and RNase and Nuclease s1 protection assays) and makes these data freely available in formats appropriate for comprehensive analysis. There is particular emphasis on endogenous gene expression during mouse development. GXD also maintains an index of the literature examining gene expression in the embryonic mouse. It is comprehensive and up-to-date, containing all pertinent journal articles from 1993 to the present and articles from major developmental journals from 1990 to the present. GXD stores primary data from different types of expression assays and by integrating these data, as data accumulate, GXD provides increasingly complete information about the expression profiles of transcripts and proteins in different mouse strains and mutants. GXD describes expression patterns using an extensive, hierarchically-structured dictionary of anatomical terms. In this way, expression results from assays with differing spatial resolution are recorded in a standardized and integrated manner and expression patterns can be queried at different levels of detail. The records are complemented with digitized images of the original expression data. The Anatomical Dictionary for Mouse Development has been developed by our Edinburgh colleagues, as part of the joint Mouse Gene Expression Information Resource project. GXD places the gene expression data in the larger biological context by establishing and maintaining interconnections with many other resources. Integration with MGD enables a combined analysis of genotype, sequence, expression, and phenotype data. Links to PubMed, Online Mendelian Inheritance in Man (OMIM), sequence databases, and databases from other species further enhance the utility of GXD. GXD accepts both published and unpublished data.
Proper citation: Gene Expression Database (RRID:SCR_006539) Copy
DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.
Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy
http://inparanoid.sbc.su.se/cgi-bin/index.cgi
Collection of pairwise comparisons between 100 whole genomes generated by a fully automatic method for finding orthologs and in-paralogs between TWO species. Ortholog clusters in the InParanoid are seeded with a two-way best pairwise match, after which an algorithm for adding in-paralogs is applied. The method bypasses multiple alignments and phylogenetic trees, which can be slow and error-prone steps in classical ortholog detection. Still, it robustly detects complex orthologous relationships and assigns confidence values for in-paralogs. The original data sets can be downloaded.
Proper citation: InParanoid: Eukaryotic Ortholog Groups (RRID:SCR_006801) Copy
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 27, 2019.
Database for those interested in the consequences of Factor VIII genetic variation at the DNA and protein level, it provides access to data on the molecular pathology of haemophilia A. The database presents a review of the structure and function of factor VIII and the molecular genetics of haemophilia A, a real time update of the biostatistics of each parameter in the database, a molecular model of the A1, A2 and A3 domains of the factor VIII protein (based on the crystal structure of caeruloplasmin) and a bulletin board for discussion of issues in the molecular biology of factor VIII. The database is completely updated with easy submission of point mutations, deletions and insertions via e-mail of custom-designed forms. A methods section devoted to mutation detection is available, highlighting issues such as choice of technique and PCR primer sequences. The FVIII structure section now includes a download of a FVIII A domain homology model in Protein Data Bank format and a multiple alignment of the FVIII amino-acid sequences from four species (human, murine, porcine and canine) in addition to the virtual reality simulations, secondary structural data and FVIII animation already available. Finally, to aid navigation across this site, a clickable roadmap of the main features provides easy access to the page desired. Their intention is that continued development and updating of the site shall provide workers in the fields of molecular and structural biology with a one-stop resource site to facilitate FVIII research and education. To submit your mutants to the Haemophilia A Mutation Database email the details. (Refer to Submission Guidelines)
Proper citation: HAMSTeRS - The Haemophilia A Mutation Structure Test and Resource Site (RRID:SCR_006883) Copy
http://sourceforge.net/p/fastsemsim/home/Home/
A package that implements several semantic similarity measures. It is both a library and an end-user application, featuring an intuitive graphical user interface (GUI). It has been implemented with the aim of being fast, expandable, and easy to use. It allows the user to work with the most updated version of GO database and customizable annotation corpora. It provides a set of logically-organized classes that can be easily exploited to both integrate semantic similarity into different analysis pipelines and extend the library with new measures. Platform: Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible
Proper citation: FastSemSim (RRID:SCR_006919) Copy
Multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences. All results of sequence and spectral library searching are subsequently processed through the Trans Proteomic Pipeline to derive a probability of correct identification for all results in a uniform manner to insure a high quality database, along with false discovery rates at the whole atlas level. The raw data, search results, and full builds can be downloaded for other uses. All results of sequence searching are processed through PeptideProphet to derive a probability of correct identification for all results in a uniform manner ensuring a high quality database. All peptides are mapped to Ensembl and can be viewed as custom tracks on the Ensembl genome browser. The long term goal of the project is full annotation of eukaryotic genomes through a thorough validation of expressed proteins. The PeptideAtlas provides a method and a framework to accommodate proteome information coming from high-throughput proteomics technologies. The online database administers experimental data in the public domain. You are encouraged to contribute to the database.
Proper citation: PeptideAtlas (RRID:SCR_006783) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the nidm-terms Resources search. From here you can search through a compilation of resources used by nidm-terms and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that nidm-terms has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on nidm-terms then you can log in from here to get additional features in nidm-terms such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into nidm-terms you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within nidm-terms that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.