Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://bio-bigdata.hrbmu.edu.cn/diseasemeth/
Human disease methylation database. DiseaseMeth version 2.0 is focused on aberrant methylomes of human diseases. Used for understanding of DNA methylation driven human diseases.
Proper citation: DiseaseMeth (RRID:SCR_005942) Copy
The DistiLD database aims to increase the usage of existing genome-wide association studies (GWAS) results by making it easy to query and visualize disease-associated SNPs and genes in their chromosomal context. The database performs three important tasks: # published GWAS are collected from several sources and linked to standardized, international disease codes ICD10 codes) # data from the International HapMap Project are analyzed to define linkage disequilibrium (LD) blocks onto which SNPs and genes are mapped # the web interface makes it easy to query and visualize disease-associated SNPs and genes within LD blocks. Users can query the database by diseases, SNPs or genes. No matter which of the three query modes was used, an intermediate page will be shown listing all the studies that matched the search with a link to the corresponding publication. The user can select either all studies related to a certain disease or one specific study for which to view the related LD blocks. The DistiLD resource integrates information on: * Associations between Single Nucleotide Polymorphisms (SNPs) and diseases from genome-wide association studies (GWAS) * Links between SNPs and genes based on linkage disequilibrium (LD) data from HapMap For convenience, we provide the complete datasets as two (zipped) tab-delimited files. The first file contains GWAS results mapped to LD blocks. The second file contains all SNPs and genes assigned to each LD block.
Proper citation: DistiLD - Diseases and Traits in LD (RRID:SCR_005943) Copy
https://compbio.dfci.harvard.edu/predictivenetworks//
A flexible, open-source, web-based application and data services framework that enables the integration, navigation, visualization and analysis of gene interaction networks. The primary goal of PN is to allow biomedical researchers to evaluate experimentally derived gene lists in the context of large-scale gene interaction networks. The PN analytical pipeline involves two key steps. The first is the collection of a comprehensive set of known gene interactions derived from a variety of publicly available sources. The second is to use these ''known'' interactions together with gene expression data to infer robust gene networks. The regression-based network inference algorithm creates a graph of gene interactions in which cycles may be present (but no self-loops). Based on information-theoretic techniques, a causal gene interaction network is inferred from both prior knowledge (interactions extracted from biomedical literature and structured biological databases) and gene expression data. A prediction model is fitted for each gene, given its parents, enabling assessment of the predictive ability of the network model.
Proper citation: Predictive Networks (RRID:SCR_006110) Copy
http://newt-omics.mpi-bn.mpg.de/index.php
Newt-omics is a database, which enables researchers to locate, retrieve and store data sets dedicated to the molecular characterization of newts. Newt-omics is a transcript-centered database, based on an Expressed Sequence Tag (EST) data set from the newt, covering ~50,000 Sanger sequenced transcripts and a set of high-density microarray data, generated from regenerating hearts. Newt-omics also contains a large set of peptides identified by mass spectrometry, which was used to validate 13,810 ESTs as true protein coding. Newt-omics is open to implement additional high-throughput data sets without changing the database structure. Via a user-friendly interface Newt-omics allows access to a huge set of molecular data without the need for prior bioinformatical expertise. The newt Notopthalmus viridescens is the master of regeneration. This organism is known for more than 200 years for its exceptional regenerative capabilities. Newts can completely replace lost appendages like limb and tail, lens and retina and parts of the central nervous system. Moreover, after cardiac injury newts can rebuild the functional myocardium with no scar formation. To date only very limited information from public databases is available. Newt-Omics aims to provide a comprehensive platform of expressed genes during tissue regeneration, including extensive annotations, expression data and experimentally verified peptide sequences with yet no homology to other publicly available gene sequences. The goal is to obtain a detailed understanding of the molecular processes underlying tissue regeneration in the newt, that may lead to the development of approaches, efficiently stimulating regenerative pathways in mammalians. * Number of contigs: 26594 * Number of est in contigs: 48537 * Number of transcripts with verified peptide: 5291 * Number of peptides: 15169
Proper citation: Newtomics (RRID:SCR_006073) Copy
http://www.nematodes.org/nembase4/
NEMBASE is a comprehensive Nematode Transcriptome Database including 63 nematode species, over 600,000 ESTs and over 250,000 proteins. Nematode parasites are of major importance in human health and agriculture, and free-living species deliver essential ecosystem services. The genomics revolution has resulted in the production of many datasets of expressed sequence tags (ESTs) from a phylogenetically wide range of nematode species, but these are not easily compared. NEMBASE4 presents a single portal into extensively functionally annotated, EST-derived transcriptomes from over 60 species of nematodes, including plant and animal parasites and free-living taxa. Using the PartiGene suite of tools, we have assembled the publicly available ESTs for each species into a high-quality set of putative transcripts. These transcripts have been translated to produce a protein sequence resource and each is annotated with functional information derived from comparison with well-studied nematode species such as Caenorhabditis elegans and other non-nematode resources. By cross-comparing the sequences within NEMBASE4, we have also generated a protein family assignment for each translation. The data are presented in an openly accessible, interactive database. An example of the utility of NEMBASE4 is that it can examine the uniqueness of the transcriptomes of major clades of parasitic nematodes, identifying lineage-restricted genes that may underpin particular parasitic phenotypes, possible viral pathogens of nematodes, and nematode-unique protein families that may be developed as drug targets.
Proper citation: NEMBASE (RRID:SCR_006070) Copy
http://hfv.lanl.gov/content/index
The Hemorrhagic Fever Viruses (HFV) sequence database collects and stores sequence data and provides a user-friendly search interface and a large number of sequence analysis tools, following the model of the highly regarded and widely used Los Alamos HIV database. The database uses an algorithm that aligns each sequence to a species-wide reference sequence. The NCBI RefSeq database is used for this; if a reference sequence is not available, a Blast search finds the best candidate. Using this method, sequences in each genus can be retrieved pre-aligned. Hemorrhagic fever viruses (HFVs) are a diverse set of over 80 viral species, found in 10 different genera comprising five different families: arena-, bunya-, flavi-, filo- and togaviridae. All these viruses are highly variable and evolve rapidly, making them elusive targets for the immune system and for vaccine and drug design. About 55,000 HFV sequences exist in the public domain today. A central website that provides annotated sequences and analysis tools will be helpful to HFV researchers worldwide.
Proper citation: HFV Database (RRID:SCR_006017) Copy
http://sourceforge.net/projects/bless-ec/
Software tool for Bloom-filter-based error correction for next-generation sequencing (NGS) reads. The algorithm produces accurate correction results with much less memory.
Proper citation: BLESS (RRID:SCR_005963) Copy
http://jjwanglab.org:8080/gwasdb/
Combines collections of genetic variants (GVs) from GWAS and their comprehensive functional annotations, as well as disease classifications. Used to maximize utilility of GWAS data to gain biological insights through integrative, multi-dimensional functional annotation portal. In addition to all GVs annotated in NHGRI GWAS Catalog, we manually curate GVs that are marginally significant (P value < 10-3) by looking into supplementary materials of each original publication and provide extensive functional annotations for these GVs. GVs are manually classified by diseases according to Disease Ontology Lite and HPO (Human Phenotype Ontology) for easy access. Database can also conduct gene based pathway enrichment and PPI network association analysis for those diseases with sufficient variants. SOAP services are available. You may Download GWASdb SNP. (This file contains all of the significant SNP in GWASdb. In the pvalue column, 0 means this P-value is not reported in the study but it is significant SNP. In the source column, GWAS:A represents the original data in GWAS catalog, while GWAS:B is our curation data which P-value < 10-3)
Proper citation: GWASdb (RRID:SCR_006015) Copy
http://202.38.126.151:8080/SDisease/
Curated database of experimentally supported data of RNA Splicing mutation and disease. The RNA Splicing mutations include cis-acting mutations that disrupt splicing and trans-acting mutations that affecting RNA-dependent functions that cause disease. Information such as EntrezGeneID, gene genomic sequence, mutation (nucleotide substitutions, deletions and insertions), mutation location within the gene, organism, detailed description of the splicing mutation and references are also given. Users are able to submit new entries to the database. This database integrating RNA splicing and disease associations would be helpful for understanding not only the RNA splicing but also its contribution to disease. In SpliceDisease database, they manually curated 2337 splicing mutation disease entries involving 303 genes and 370 diseases, which have been supported experimentally in 898 publications. The SpliceDisease database provides information including the change of the nucleotide in the sequence, the location of the mutation on the gene, the reference PubMed ID and detailed description for the relationship among gene mutations, splicing defects and diseases. They standardized the names of the diseases and genes and provided links for these genes to NCBI and UCSC genome browser for further annotation and genomic sequences. For the location of the mutation, they give direct links of the entry to the respective position/region in the genome browser.
Proper citation: SpliceDisease (RRID:SCR_006130) Copy
http://equilibrator.weizmann.ac.il/
Web interface designed for thermodynamic analysis of biochemical systems. eQuilibrator enables free-text search for biochemical compounds and reactions and provides thermodynamic estimates for both in a variety of conditions. It can provide estimates for compounds in the KEGG database, and individual compounds and enzymes can be searched for by their common names (water, glucosamine, hexokinase). Reactions can be entered in a free-text format that eQuilibrator parses automatically. eQuilibrator also allows manipulation of the conditions of a reaction - pH, ionic strength, and reactant and product concentrations.
Proper citation: eQuilibrator (RRID:SCR_006011) Copy
http://aias.biol.uoa.gr/OMPdb/
A database of Beta-barrel outer membrane proteins from Gram-negative bacteria. The web interface of OMPdb offers the user the ability not only to view the available data, but also to submit advanced queries for text search within the database''s protein entries or run BLAST searches against the database. The most up-to-date version of the database (as well as all past versions) can be downloaded in various formats (flat text, XML format or raw FASTA sequences). For constructing OMPdb, multiple freely accessible resources were combined and a detailed literature search was performed. The classification of OMPdb''s protein entries into families is based mainly on structural and functional criteria. Information included in the database consists of sequence data, as well as annotation for structural characteristics (such as the transmembrane segments), literature references and links to other public databases, features that are unique worldwide. Along with the database, a collection of profile Hidden Markov Models that were shown to be characteristic for Beta-barrel outer membrane proteins was also compiled. This set, when used in combination with our previously developed algorithms (PRED-TMBB, MCMBB and ConBBPRED) will serve as a powerful tool in matters of discrimination and classification of novel Beta-barrel proteins and whole-genome analyses., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: OMPdb (RRID:SCR_006221) Copy
http://evolution.genetics.washington.edu/phylip.html
A free package of software programs for inferring phylogenies (evolutionary trees). The source code is distributed (in C), and executables are also distributed. In particular, already-compiled executables are available for Windows (95/98/NT/2000/me/xp/Vista), Mac OS X, and Linux systems. Older executables are also available for Mac OS 8 or 9 systems.
Proper citation: PHYLIP (RRID:SCR_006244) Copy
Collection of chemical structures. Provides access to structures, properties and associated information from hundreds of data sources to find compounds of interest and provides services to improve this data by curation and annotation and to integrate it with users applications.
Proper citation: ChemSpider (RRID:SCR_006360) Copy
http://www.geenivaramu.ee/en/tools/gwama
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Software tool for meta analysis of whole genome association data.
Proper citation: GWAMA (RRID:SCR_006624) Copy
Repository contains antibody/B cell and T cell epitope information and epitope prediction and analysis tools. Immune epitopes are defined as molecular structures recognized by specific antigen receptors of the immune system, namely antibodies, B cell receptors, and T cell receptors. Immune epitopes from infectious diseases, excluding HIV, and immune-mediated diseases and the accompanying biological information are included.
Proper citation: Immune Epitope Database and Analysis Resource (IEDB) (RRID:SCR_006604) Copy
Model organism database for the social amoeba Dictyostelium discoideum that provides the biomedical research community with integrated, high quality data and tools for Dictyostelium discoideum and related species. dictyBase houses the complete genome sequence, ESTs, and the entire body of literature relevant to Dictyostelium. This information is curated to provide accurate gene models and functional annotations, with the goal of fully annotating the genome to provide a ''''reference genome'''' in the Amoebozoa clade. They highlight several new features in the present update: (i) new annotations; (ii) improved interface with web 2.0 functionality; (iii) the initial steps towards a genome portal for the Amoebozoa; (iv) ortholog display; and (v) the complete integration of the Dicty Stock Center with dictyBase. The Dicty Stock Center currently holds over 1500 strains targeting over 930 different genes. There are over 100 different distinct amoebozoan species. In addition, the collection contains nearly 600 plasmids and other materials such as antibodies and cDNA libraries. The strain collection includes: * strain catalog * natural isolates * MNNG chemical mutants * tester strains for parasexual genetics * auxotroph strains * null mutants * GFP-labeled strains for cell biology * plasmid catalog The Dicty Stock Center can accept Dictyostelium strains, plasmids, and other materials relevant for research using Dictyostelium such as antibodies and cDNA or genomic libraries.
Proper citation: Dictyostelium discoideum genome database (RRID:SCR_006643) Copy
A comparative platform for green plant genomics. Families of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These families allow easy access to clade specific orthology / paralogy relationships as well as clade specific genes and gene expansions. As of release v9.1, Phytozome provides access to forty-one sequenced and annotated green plant genomes which have been clustered into gene families at 20 evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: Phytozome (RRID:SCR_006507) Copy
http://www.informatics.jax.org/expression.shtml
Community database that collects and integrates the gene expression information in MGI with a primary emphasis on endogenous gene expression during mouse development. The data in GXD are obtained from the literature, from individual laboratories, and from large-scale data providers. All data are annotated and reviewed by GXD curators. GXD stores and integrates different types of expression data (RNA in situ hybridization; Immunohistochemistry; in situ reporter (knock in); RT-PCR; Northern and Western blots; and RNase and Nuclease s1 protection assays) and makes these data freely available in formats appropriate for comprehensive analysis. There is particular emphasis on endogenous gene expression during mouse development. GXD also maintains an index of the literature examining gene expression in the embryonic mouse. It is comprehensive and up-to-date, containing all pertinent journal articles from 1993 to the present and articles from major developmental journals from 1990 to the present. GXD stores primary data from different types of expression assays and by integrating these data, as data accumulate, GXD provides increasingly complete information about the expression profiles of transcripts and proteins in different mouse strains and mutants. GXD describes expression patterns using an extensive, hierarchically-structured dictionary of anatomical terms. In this way, expression results from assays with differing spatial resolution are recorded in a standardized and integrated manner and expression patterns can be queried at different levels of detail. The records are complemented with digitized images of the original expression data. The Anatomical Dictionary for Mouse Development has been developed by our Edinburgh colleagues, as part of the joint Mouse Gene Expression Information Resource project. GXD places the gene expression data in the larger biological context by establishing and maintaining interconnections with many other resources. Integration with MGD enables a combined analysis of genotype, sequence, expression, and phenotype data. Links to PubMed, Online Mendelian Inheritance in Man (OMIM), sequence databases, and databases from other species further enhance the utility of GXD. GXD accepts both published and unpublished data.
Proper citation: Gene Expression Database (RRID:SCR_006539) Copy
DPVweb provides a central source of information about viruses, viroids and satellites of plants, fungi and protozoa. Comprehensive taxonomic information, including brief descriptions of each family and genus, and classified lists of virus sequences are provided. The database also holds detailed, curated, information for all sequences of viruses, viroids and satellites of plants, fungi and protozoa that are complete or that contain at least one complete gene. For comparative purposes, it also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA genome. The start and end positions of each feature (gene, non-translated region and the like) have been recorded and checked for accuracy. As far as possible, nomenclature for genes and proteins are standardized within genera and families. Sequences of features (either as DNA or amino acid sequences) can be directly downloaded from the website in FASTA format. The sequence information can also be accessed via client software for PC computers (freely downloadable from the website) that enable users to make an easy selection of sequences and features of a chosen virus for further analyses. The public sequence databases contain vast amounts of data on virus genomes but accessing and comparing the data, except for relatively small sets of related viruses can be very time consuming. The procedure is made difficult because some of the sequences on these databases are incorrectly named, poorly annotated or redundant. The NCBI Reference Sequence project (1) provides a comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) and protein products, for major research organisms. This now includes curated information for a single sequence of each fully sequenced virus species. While this is a welcome development, it can only deal with complete sequences. An important feature of DPV is the opportunity to access genes (and other features) of multiple sequences quickly and accurately. Thus, for example, it is easy to obtain the nucleotide or amino acid sequences of all the available accessions of the coat protein gene of a given virus species or for a group of viruses. To increase its usefulness further, DPVweb also contains a single representative sequence of all other fully sequenced virus species with an RNA or single-stranded DNA (ssDNA) genome. Sponsors: This site is supported by the Association of Applied Biologists and the Zhejiang Academy of Agricultural Sciences, Hangzhou, People''s Republic of China.
Proper citation: Descriptions of Plant Viruses (RRID:SCR_006656) Copy
Software package for sequence alignment, assembly and analysis. Integrated and extendable desktop software platform for organization and analysis of sequence data. Bioinformatics software platform packed with molecular biology and sequence analysis tools.
Proper citation: Geneious (RRID:SCR_010519) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the SPARC SAWG Resources search. From here you can search through a compilation of resources used by SPARC SAWG and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that SPARC SAWG has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on SPARC SAWG then you can log in from here to get additional features in SPARC SAWG such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into SPARC SAWG you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within SPARC SAWG that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.