Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
Software that generates, analyses and compares k-mer spectra produced from sequence files. Used to quality control NGS datasets and genome assemblies.
Proper citation: KAT (RRID:SCR_016741) Copy
http://bio-bwa.sourceforge.net/
Software for aligning sequencing reads against large reference genome. Consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. First for sequence reads up to 100bp, and other two for longer sequences ranged from 70bp to 1Mbp.
Proper citation: BWA (RRID:SCR_010910) Copy
A web program that can locate residue periodicities in either amino acid or DNA sequences. It is based on an algorithm of Dr. A.D. McLachlan (1977). NOTE: You must use a Java compatible browser to run the application.
Proper citation: FT (RRID:SCR_006228) Copy
The Kabat Database determines the combining site of antibodies based on the available amino acid sequences. The precise delineation of complementarity determining regions (CDR) of both light and heavy chains provides the first example of how properly aligned sequences can be used to derive structural and functional information of biological macromolecules. The Kabat database now includes nucleotide sequences, sequences of T cell receptors for antigens (TCR), major histocompatibility complex (MHC) class I and II molecules, and other proteins of immunological interest. The Kabat Database searching and analysis tools package is an ASP.NET web-based portal containing lookup tools, sequence matching tools, alignment tools, length distribution tools, positional correlation tools and much more. The searching and analysis tools are custom made for the aligned data sets contained in both the SQL Server and ASCII text flat file formats. The searching and analysis tools may be run on a single PC workstation or in a distributed environment. The analysis tools are written in ASP.NET and C# and are available in Visual Studio .NET 2003/2005/2008 formats. The Kabat Database was initially started in 1970 to determine the combining site of antibodies based on the available amino acid sequences at that time. Bence Jones proteins, mostly from human, were aligned, using the now-known Kabat numbering system, and a quantitative measure, variability, was calculated for every position. Three peaks, at positions 24-34, 50-56 and 89-97, were identified and proposed to form the complementarity determining regions (CDR) of light chains. Subsequently, antibody heavy chain amino acid sequences were also aligned using a different numbering system, since the locations of their CDRs (31-35B, 50-65 and 95-102) are different from those of the light chains. CDRL1 starts right after the first invariant Cys 23 of light chains, while CDRH1 is eight amino acid residues away from the first invariant Cys 22 of heavy chains. During the past 30 years, the Kabat database has grown to include nucleotide sequences, sequences of T cell receptors for antigens (TCR), major histocompatibility complex (MHC) class I and II molecules and other proteins of immunological interest. It has been used extensively by immunologists to derive useful structural and functional information from the primary sequences of these proteins.
Proper citation: Kabat Database of Sequences of Proteins of Immunological Interest (RRID:SCR_006465) Copy
http://athina.biol.uoa.gr/bioinformatics/NON-RED/index.html
A web tool to select biological sequences from a given set, with similarity / homology less than a user-defined level. This web-based application takes as input a set of N sequences and outputs a set of sequences of user-determined redundancy. Initially, the algorithm runs an all-against-all BLAST alignment on the input data set and creates an NxN matrix of pairwise distances defined by the similarity percentages. In the next step, the algorithm removes the sequence with the largest number of neighbors, causing that sequence not to be counted as a neighbor of any other sequence during the next iterations. It then reassesses the number of neighbors of each sequence and repeats the previous step until the sequences left over have no more neighbors. The user can specify the similarity (%) threshold and the minimum coverage length of the alignments. Sequences with a similarity below the threshold or a smaller coverage than the minimum length are not considered to be neighbors.
Proper citation: NON-RED (RRID:SCR_006225) Copy
http://athina.biol.uoa.gr/bioinformatics/waveTM/
A web tool for the prediction of transmembrane segments in alpha-helical membrane proteins. A sliding window of 20 residues is used in order to calculate an average residue hydrophobicity profile, using a hydrophobicity scale. Discrete Wavelet Transform is applied on the average residue hydrophobicity signal and the different frequency coefficients produced are adaptively thresholded so that a denoised signal is reconstructed. A dynamic programming algorithm processes the denoised signal to provide the optimal model for the number, the length and the location of membrane-spanning segments. The end points of the predicted segments are extended to include flanking hydrophobic residues. Topology prediction can also be obtained in conjunction with OrienTM (Liakopoulos et al, 2001). Analysis of a non-redundant test set, provides a ~95% per segment accuracy and ~90% per residue accuracy. Now, you can: * Run waveTM on a sequence * Browse the results obtained with the algorithm * View additional material concerning the hydrophobicity scale
Proper citation: waveTM (RRID:SCR_006199) Copy
http://athina.biol.uoa.gr/PRED-TMR/
A web server that predicts transmembrane domains in proteins using solely information contained in the sequence itself. The algorithm refines a standard hydrophobicity analysis with a detection of potential termini (edges, starts and ends) of transmembrane regions. This allows both to discard highly hydrophobic regions not delimited by clear start and end configurations and to confirm putative transmembrane segments not distinguishable by their hydrophobic composition. The accuracy obtained on a test set of 101 non homologous transmembranes proteins with reliable topologies compares well with that of other popular existing methods. Only a slight decrease in prediction accuracy was observed when the algorithm was applied to all transmembrane proteins of the SwissProt database (release 35).
Proper citation: PRED-TMR (RRID:SCR_006203) Copy
Database that provides the genome sequence assembly of the International Rice Genome Sequencing Project (IRGSP), manually curated annotation of the sequence, and other genomics information that could be useful for comprehensive understanding of the rice biology. RAP-DB contains clone positions, structures and functions of genes validated by cDNAs, RNA genes detected by massively parallel signature sequencing (MPSS) technology and sequence similarity, flanking sequences of mutant lines, transposable elements, etc. Other annotation data such as Gnomon can be displayed along with those of RAP for comparison.
Proper citation: RAP-DB (RRID:SCR_006610) Copy
http://www.ncbi.nlm.nih.gov/CCDS/
Database (anonymous FTP) resulting from a collaborative effort to identify a core set of human and mouse protein coding regions that are consistently annotated and of high quality. The long term goal is to support convergence towards a standard set of gene annotations. Collaborators are EBI, NCBI, UCSC, WTSI and the initial results are also available from the participants'''' genome browser Web sites. In addition, CCDS identifiers are indicated on the relevant NCBI RefSeq and Entrez Gene records and in Map Viewer displays of RNA (RefSeq) and Gene annotations on the reference assembly.
Proper citation: Consensus CDS (RRID:SCR_006729) Copy
Database devoted to protein domains. It is also a collection of tools for the investigation of the relationships between protein sequences and motifs described on them.
Proper citation: MyHits (RRID:SCR_006757) Copy
http://igs-server.cnrs-mrs.fr/mgdb/Rickettsia/
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 18, 2016. Rickettsia are obligate intracellular bacteria living in arthropods. They occasionally cause diseases in humans. To understand their pathogenicity, physiologies and evolutionary mechanisms, RicBase is sequencing different species of Rickettsia. Up to now we have determined the genome sequences of R. conorii, R. felis, R. bellii, R. africae, and R. massiliae. The RicBase aims to organize the genomic data to assist followup studies of Rickettsia. This website contains information on R. conorii and R. prowazekii. A R. conorii and R. prowazekii comparative genome map is also available. Images of genome maps, dendrogram, and sequence alignment allow users to gain a visualization of the diagrams.
Proper citation: Rickettsia Genome Database (RRID:SCR_007102) Copy
http://www.ncbi.nlm.nih.gov/unists
THIS RESOURCE IS NO LONGER IN SERVICE, documented August 22, 2016. Database of sequence tagged sites (STSs) derived from STS-based maps and other experiments. STSs are defined by PCR primer pairs and are associated with additional information such as genomic position, genes, and sequences. Chromosome maps are labeled by name of the originating organism, the map title, total markers, total UniSTSs and links to view maps as well as research documents available through PubMed, another NCBI database. The search functions within UniSTS allow the user to search by gene marker, chromosome, gene symbol and gene description terms to locate markers on specified genes. A representation of the UniSTS datasets is available by ftp. NOTE: All data from this resource have been moved to the Probe database, http://www.ncbi.nlm.nih.gov/probe. You can retrieve all UniSTS records by searching the probe database using the search term unists(properties). (use brackets insead of parenthesis). Additionally, legacy data remain on the NCBI FTP Site in the UniSTS Repository (ftp://ftp.ncbi.nih.gov/pub/ProbeDB/legacy_unists).
Proper citation: UniSTS (RRID:SCR_006843) Copy
http://www.mbio.ncsu.edu/RNaseP/home.html
Ribonuclease P is responsible for the 5''-maturation of tRNA precursors. Ribonuclease P is a ribonucleoprotein, and in bacteria (and some Archaea) the RNA subunit alone is catalytically active in vitro, i.e. it is a ribozyme. The Ribonuclease P Database is a compilation of ribonuclease P sequences, sequence alignments, secondary structures, three-dimensional models and accessory information. The database contains information on bacterial, archaeal, and eukaryotic RNase P. The RNase P and protein sequences are available from phylogentically-arranged lists, individual sequences, or aligned in GenBank format. The database also provides secondary structures and 3D models, as well as movies, still images, and other accessory information.
Proper citation: RNase P Database (RRID:SCR_006680) Copy
http://bond.unleashedinformatics.com/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.BOND, which requires registration of a free account, is a resource used to perform cross-database searches of available sequence, interaction, complex and pathway information. BOND integrates a range of component databases including GenBank and BIND, the Biomolecular Interaction Network Database. BOND contains 70+ million biological sequences, 33,000 structures, 38,000 GO terms, and over 200,000 human curated interactions contained in BIND, and is open access. BOND serves the interests of the developing global interactome effort encompassing the genomic, proteomic and metabolomic research communities. BOND is the first open access search resource to integrate sequence and interaction information. BOND integrates BLAST functionality, and contains a well-documented API. BOND also stores annotation links for sequences, including links to Genome Ontology descriptions, MedLine abstracts, taxon identifiers, associated structures, redundant sequences, sequence neighbors, conserved domains, data base cross-references, Online Mendalian Inheritance in Man identifiers, LocusLink identifiers and complete genomes. BIND on BOND The Biomolecular Interaction Network Database (BIND), a component database of BOND, is a collection of records documenting molecular interactions. The contents of BIND include high-throughput data submissions and hand-curated information gathered from the scientific literature. BIND is an interaction database with three classifications for molecular associations: molecules that associate with each other to form interactions, molecular complexes that are formed from one or more interaction(s) and pathways that are defined by a specific sequence of two or more interactions.Interactions A BIND record represents an interaction between two or more objects that is believed to occur in a living organism. A biological object can be a protein, DNA, RNA, ligand, molecular complex, gene, photon or an unclassified biological entity. BIND records are created for interactions which have been shown experimentally and published in at least one peer-reviewed journal. A record also references any papers with experimental evidence that support or dispute the associated interaction. Interactions are the basic units of BIND and can be linked together to form molecular complexes or pathways. The BIND interaction viewer is a tool to visualize and analyze molecular interactions, complexes and pathways. The BIND interaction viewer uses Ontoglyphs to display information about a protein via attributes such as molecular function, biological process and sub-cellular localization. Ontoglyphs allow to graphically and interactively explore interaction networks, by visualizing interactions in the context of 34 functional, 25 binding specificity and 24 sub-cellular localization Ontoglyphs categories. We will continue to provide an open access version of BOND, providing its subscribers with free, unlimited access to a core content set. But we are confident you will soon want to upgrade to BONDplus.
Proper citation: Biomolecular Object Network Databank (RRID:SCR_007433) Copy
http://biobases.ibch.poznan.pl/5SData/
A database on nucleotide sequences of 5S rRNAs and their genes. The database contains 1985 primary structures of 5S rRNA and 5S rDNA, and was last updated in 2002, according to the website. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms. The sequences for particular organisms can be retrieved as single files using a taxonomic browser or in multiple sequence structural alignments. The multiple sequence alignments of 5S ribosomal RNAs can be downloaded in TAB-delimited and FASTA formats.
Proper citation: 5S Ribosomal RNA Database (RRID:SCR_007545) Copy
http://mips.gsf.de/genre/proj/ustilago/
The MIPS Ustilago maydis Genome Database aims to present information on the molecular structure and functional network of the entirely sequenced, filamentous fungus Ustilago maydis. The underlying sequence is the initial release of the high quality draft sequence of the Broad Institute. The goal of the MIPS database is to provide a comprehensive genome database in the Genome Research Environment in parallel with other fungal genomes to enable in depth fungal comparative analysis. The specific aims are to: 1. Generate and assemble Whole Genome Shotgun sequence reads yielding 10X coverage of the U. maydis genome 2. Integrate the genomic sequence assembly with physical maps generated by Bayer CropScience 3. Perform automated annotation of the sequence assembly 4. Align the strain 521 assembly with the FB1 assembly provided by Exelixis 5. Release the sequence assembly and results of our annotation and analysis to public Ustilago maydis is a basidiomycete fungal pathogen of maize and teosinte. The genome size is approximately 20 Mb. The fungus induces tumors on host plants and forms masses of diploid teliospores. These spores germinate and form haploid meiotic products that can be propagated in culture as yeast-like cells. Haploid strains of opposite mating type fuse and form a filamentous, dikaryotic cell type that invades plant tissue to reinitiate infection. Ustilago maydis is an important model system for studying pathogen-host interactions and has been studied for more than 100 years by plant pathologists. Molecular genetic research with U. maydis focuses on recombination, the role of mating in pathogenesis, and signaling pathways that influence virulence. Recently, the fungus has emerged as an excellent experimental model for the molecular genetic analysis of phytopathogenesis, particularly in the characterization of infection-specific morphogenesis in response to signals from host plants. Ustilago maydis also serves as an important model for other basidiomycete plant pathogens that are more difficult to work with in the laboratory, such as the rust and bunt fungi. Genomic sequence of U. maydis will also be valuable for comparative analysis of other fungal genomes, especially with respect to understanding the host range of fungal phytopathogens. The analysis of U. maydis would provide a framework for studying the hundreds of other Ustilago species that attack important crops, such as barley, wheat, sorghum, and sugarcane. Comparisons would also be possible with other basidiomycete fungi, such as the important human pathogen C. neoformans. Commercially, U. maydis is an excellent model for the discovery of antifungal drugs. In addition, maize tumors caused by U. maydis are prized in Hispanic cuisine and there is interest in improving commercial production. The complete putative gene set of the Broad Institute''s second release is loaded into the database and in addition all deviating putative genes from a putative gene set produced by MIPS with different gene prediction parameters are also loaded. The complete dataset will then be analysed, gene predictions will be manually corrected due to combined information derived from different gene prediction algorithms and, more important, protein and EST comparisons. Gene prediction will be restricted to ORFs larger than 50 codons; smaller ORFs will be included only if similarities to other proteins or EST matches confirm their existence or if a coding region was postulated by all prediction programs used. The resulting proteins will be annotated. They will be classified according to the MIPS classification catalogue receiving appropriate descriptions. All proteins with a known, characterized homolog will be automatically assigned to functional categories using the MIPS functional catalog. All extracted proteins are in addition automatically analysed and annotated by the PEDANT suite.
Proper citation: MIPS Ustilago maydis Database (RRID:SCR_007563) Copy
Collection of transmembrane protein datasets containing experimentally derived topology information from the literature and from public databases. Web interface of TOPDB includes tools for searching, relational querying and data browsing, visualisation tools for topology data.
Proper citation: Topology Data Bank of Transmembrane Proteins (RRID:SCR_007964) Copy
It provides a database based on a pre-computed similarity matrix covering the similarity space formed by >4 million amino acid sequences from public databases and completely sequenced genomes. The database is capable of handling very large datasets and is updated incrementally. For sequence similarity searches and pairwise alignments, we implemented a grid-enabled software system, which is based on FASTA heuristics and the Smith Waterman algorithm. SimpleSIMAP and AdvancedSIMAP retrieve homologs for given protein sequences that need to be contained in the SIMAP database. While SimpleSIMAP provides only selected parameters and preconfigured search spaces, the AdvancedSIMAP allows the user to specify search space, filtering and sorting parameters in a flexible manner. Both types of queries result in lists of homologs that are linked in turn to their homologs. So the web interfaces allow users to explore quickly and interactively the protein world by homology. Sponsors: SIMAP is supported by the Department of Genome Oriented Bioinformatics of the Technische Universitt Mnchen and the Institute for Bioinformatics of the GSF-National Research Center for Environment and Health.
Proper citation: SIMAP (RRID:SCR_007927) Copy
http://icebox.lbl.gov:8080/ApolloWebDemo/jbrowse/
WebApollo is an extensible web-based sequence annotation editor for community annotation. No software download is required and the annotations are saved to a centralized database with real-time annotation updating. (The edit server mediates annotation changes made by multiple users.) The Web based client uses JBrowse, is fast and highly interactive. WebApollo accesses many types of genomic data including access to public data from UCSC, Ensembl, and GMOD Chado databases. Source code (BSD License) * Client source code: https://github.com/berkeleybop/jbrowse * Annotation editing engine: http://code.google.com/p/apollo-web * Data model and I/O layer: http://code.google.com/p/gbol * Trellis server code: http://code.google.com/p/genomancer
Proper citation: WebApollo: A Web-Based Sequence Annotation Editor for Community Annotation (RRID:SCR_005321) Copy
http://www.ch.embnet.org/software/COILS_form.html
COILS is a program that compares a sequence to a database of known parallel two-stranded coiled-coils and derives a similarity score. By comparing this score to the distribution of scores in globular and coiled-coil proteins, the program then calculates the probability that the sequence will adopt a coiled-coil conformation.
Proper citation: COILS: Prediction of Coiled Coil Regions in Proteins (RRID:SCR_008440) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within RRID that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.