Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
The HUGE protein database has been created to publicize the Human cDNA project at the Kazusa DNA Research Institute. This project will sequence and analyze long (>4 kb) human cDNAs and establish methods by using the sequence data how to predict the primary structure of proteins of various biological activities. Currently, it focuses on the analysis of cDNA clones encoding particularly large proteins (>50 kDa). The HUGE protein database contains various types of information derived from the predicted primary structure data of newly identified human proteins. The HUGE protein database are expected to cover various sets of large human proteins of hitherto unidentified functions. They are likely to be involved in cellular structure/motility (such as cytoskeleton, membrane skeleton, and motor proteins), gene expression and nucleic acid metabolism, cell signaling/communication (such as cellular adhesion, signal transduction, channels, and receptors), and so on.
Proper citation: HUGE - Human Unidentified Gene-Encoded large proteins (RRID:SCR_013482) Copy
http://db.systemsbiology.net/kaviar/
A database containing a compilation of SNVs, indels, and complex variants observed in humans, designed to facilitate testing for the novelty and frequency of observed variants.
Proper citation: KAVIAR (RRID:SCR_013737) Copy
A database of protein disorder and mobility annotations. The database features three levels of annotation: manually curated data (which are extracted from the DisProt database), indirect data, and predicted data. Additional annotations are included from external sources, including UniProt, Pfam, PDB, and STRING.
Proper citation: MobiDB (RRID:SCR_014542) Copy
https://rtips.cancer.gov/rtips/index.do
Database of cancer control interventions and program materials. It is designed to provide program planners and public health practitioners easy and immediate access to research-tested materials.
Proper citation: Research-tested Intervention Programs (RTIPs) (RRID:SCR_016042) Copy
http://floresta.eead.csic.es/3dfootprint
Database of DNA-binding protein structures that is updated with Protein Data Bank complexes. It provides structure-based binding specificities and sequence logos, classification and clusters of protein-DNA interfaces, and downloads/stats.
Proper citation: 3D-footprint (RRID:SCR_015713) Copy
Database for the identification of the human proteome and its use across the scientific community. Users can browse proteins and chromosomes and contribute to the data repository.
Proper citation: ProteomicsDB (RRID:SCR_015562) Copy
http://amp.pharm.mssm.edu/datasets2tools/
Database for the discovery and evaluation of biomedical digital objects. It includes a wide variety of enrichment analyses, gene interaction networks, interactive data visualizations, datasets, and computational tools.
Proper citation: Datasets2Tools (RRID:SCR_016174) Copy
Collection of transcription factor microRNA regulations. TransmiR v2.0 manually curated TF-miRNA regulations from publications during 2013-2017 and included ChIP-seq-derived TF-miRNA regulation data.
Proper citation: TransmiR (RRID:SCR_017499) Copy
Collection of chemical compounds and associated information that were automatically extracted by text mining content of PubMed and PubChem databases. Unifies chemical lists from metabolomics, systems biology, environmental epidemiology, occupational expossure, toxiology and nutrition fields.
Proper citation: Blood Exposome Database (RRID:SCR_017610) Copy
A comprehensive analysis and visualization software package for gene expression experiments that provides: a number of clustering and analysis techniques; integrated gene expression and analysis result visualizations, integration with the Gene Expression Omnibus; and an optional data sharing architecture. GO is used to assign functional enrichment scores to clusters, using a combination of specially developed techniques and general statistical methods. These results can be explored using the in built ontology browsing tool or through the generated web pages. SeqExpress also supports numerous data transformation, projection, visualization, file export/import, searching, integration (with R), and clustering options.
Proper citation: SeqExpress (RRID:SCR_007075) Copy
https://database.riken.jp/sw/en/The_RIKEN_integrated_database_of_mammals/ria254i/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on August 16, 2019.
A database that integrates not only RIKEN''''s original large-scale mammalian databases, such as FANTOM, the ENU mutagenesis program, the RIKEN Cerebellar Development Transcriptome Database and the Bioresource Database, but also imported data from public databases, such as Ensembl, MGI and biomedical ontologies. Our integrated database has been implemented on the infrastructure of publication medium for databases, termed SciNetS/SciNeS, or the Scientists'''' Networking System, where the data and metadata are structured as a semantic web and are downloadable in various standardized formats. The top-level ontology-based implementation of mammal-related data directly integrates the representative knowledge and individual data records in existing databases to ensure advanced cross-database searches and reduced unevenness of the data management operations. Through the development of this database, we propose a novel methodology for the development of standardized comprehensive management of heterogeneous data sets in multiple databases to improve the sustainability, accessibility, utility and publicity of the data of biomedical information.
Proper citation: RIKEN integrated database of mammals (RRID:SCR_006890) Copy
http://www.bioconductor.org/packages/2.13/bioc/html/epigenomix.html
Software package for the integrative analysis of microarray based gene expression and histone modification data obtained by ChIP-seq. The package provides methods for data preprocessing and matching as well as methods for fitting bayesian mixture models in order to detect genes with differences in both data types.
Proper citation: epigenomix (RRID:SCR_006407) Copy
http://soap.genomics.org.cn/soapaligner.html
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on April 12,2024. Updated version of SOAP software for short oligonucleotide alignment that features in super fast and accurate alignment for huge amounts of short reads generated by Illumina/Solexa Genome Analyzer., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: SOAPaligner/soap2 (RRID:SCR_005503) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on April 12,2024. Software application for pedigree drawing (entry from Genetic Analysis Software)
Proper citation: Pedigree-Draw (RRID:SCR_008302) Copy
http://www.ch.embnet.org/software/COILS_form.html
COILS is a program that compares a sequence to a database of known parallel two-stranded coiled-coils and derives a similarity score. By comparing this score to the distribution of scores in globular and coiled-coil proteins, the program then calculates the probability that the sequence will adopt a coiled-coil conformation.
Proper citation: COILS: Prediction of Coiled Coil Regions in Proteins (RRID:SCR_008440) Copy
Software package of molecular simulation programs. It is distributed into AmberTools15 and Amber14. AmberTools15 is a software package which can carry out complete molecular dynamics simulations with either explicit water or generalized Born solvent models. It is distributed in source code format and must be compiled in order to be used. Amber14 builds on AmberTools15 by adding the pmemd program, which provides better performance on multiple CPUs and dramatic speed improvements on GPUs compared to sander (molecular dynamics). GPU info, manuals, and tutorials are available on the website.
Proper citation: Assisted Model Building with Energy Refinement (AMBER) (RRID:SCR_014230) Copy
http://subread.sourceforge.net/
Software package for high-performance read alignment, quantification and mutation discovery.General purpose read aligner which can be used to map both genomic DNA-seq reads and RNA-seq reads. Subread aligner as fast, accurate and scalable read mapping by seed-and-vote.These programs were also implemented in Bioconductor R package Rsubread.
Proper citation: Subread (RRID:SCR_009803) Copy
http://sourceforge.net/projects/gsa-snp/
A tool for the gene-set (or pathway) analysis of a genome-wide association study result. It accepts a genome-wide list of SNPs and their association P-values. It summarizes the SNP P-values into nearby genes. The gene-by-gene summary results are then further summarized by gene-sets such as Gene Ontology, KEGG pathways, or user-created gene-sets. Various standardization and statistical tests can be performed and the resulting gene-sets that pass a significance level after multiple-testing correction are reported. The tool is written in Java and is available as a standalone version.
Proper citation: GSA-SNP (RRID:SCR_013109) Copy
Ratings or validation data are available for this resource
http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/
Software tool to automate quality and adapter trimming as well as quality control, with some added functionality to remove biased methylation positions for RRBS sequence files for directional, non-directional or paired-end sequencing. Wrapper around Cutadapt and FastQC to consistently apply adapter and quality trimming to FastQ files, with extra functionality for Reduced Representation Bisulfite Sequencing data.
Proper citation: Trim Galore (RRID:SCR_011847) Copy
https://github.com/MikkelSchubert/paleomix
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Software toolkit for the processing of ancient and modern HTS data. PALEOMIX also aids in metagenomic analysis of the extracts from the HTS processing.
Proper citation: PALEOMIX (RRID:SCR_015057) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the SPARC SAWG Resources search. From here you can search through a compilation of resources used by SPARC SAWG and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that SPARC SAWG has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on SPARC SAWG then you can log in from here to get additional features in SPARC SAWG such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into SPARC SAWG you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within SPARC SAWG that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.