Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
THIS RESOURCE IS NO LONGER IN SERVICE, documented October 13, 2014. The resource has moved to the NIDDKInformation Network (dkNET) project. Contact them at info_at_dknet.org with any questions. Database of large pools of data relevant to the mission of NIDDKwith the goal of developing a community-based network for integration across disciplines to include the larger DKuniverse of diseases, investigators, and potential users. The focus is on greater use of this data with the objective of adding value by breaking down barriers between sites to facilitate linking of different datasets. To date (2013/06/10), a total of 1,195 resources have been associated with one or more genes. Of 11,580 total genes associated with resources, the ten most represented are associated with 359 distinct resources. The main method by which they currently interconnect resources between the providers is via EntrezGene identifiers. A total of 780 unique genes provide the connectivity between 3,159 resource pairs across consortia. To further increase interconnectivity, the groups have been further annotating their data with additional gene identifiers, publications, and ontology terms from selected Open Biological and Biomedical Ontologies (OBO).
Proper citation: dkCOIN (RRID:SCR_004438) Copy
http://www.omicsoft.com/fusionmap/
An efficient fusion aligner which aligns reads spanning fusion junctions directly to the genome without prior knowledge of potential fusion regions. It detects and characterizes fusion junctions at base-pair resolution. FusionMap can be applied to detect fusion junctions in both single- and paired-end dataset from either gDNA-Seq or RNA-Seq studies. FusionMap runs under both Windows and Linux (requiring MONO) environments. Although it can run on 32 bit machine, it is recommended to run on 64-bit machine with 8GB RAM or more. If you have an ArrayStudio License, you can run the fusion detection easily through its GUI.
Proper citation: FusionMap (RRID:SCR_005242) Copy
http://sourceforge.net/projects/cova/
A variant annotation and comparison tool for next-generation sequencing. It annotates the effects of variants on genes and compares those among multiple samples, which helps to pinpoint causal variation(s) relating to phenotype.
Proper citation: COVA (RRID:SCR_005175) Copy
http://stothard.afns.ualberta.ca/downloads/NGS-SNP/
A collection of command-line scripts for providing rich annotations for SNPs identified by the sequencing of transcripts or whole genomes from organisms with reference sequences in Ensembl. Included among the annotations, several of which are not available from any existing SNP annotation tools, are the results of detailed comparisons with orthologous sequences. These comparisons allow, for example, SNPs to be sorted or filtered based on how drastically the SNP changes the score of a protein alignment. Other fields indicate the names of overlapping protein domains or features, and the conservation of both the SNP site and flanking regions. NCBI, Ensembl, and Uniprot IDs are provided for genes, transcripts, and proteins when applicable, along with Gene Ontology terms, a gene description, phenotypes linked to the gene, and an indication of whether the SNP is novel or known. A ?Model_Annotations? field provides several annotations obtained by transferring in silico the SNP to an orthologous gene, typically in a well-characterized species.
Proper citation: NGS-SNP (RRID:SCR_005182) Copy
http://www.cbcb.umd.edu/software/glimmer-mg/
A software system for finding genes in environmental shotgun DNA sequences.
Proper citation: Glimmer-MG (RRID:SCR_011932) Copy
http://statgenpro.psychiatry.hku.hk/limx/kggseq/
A biological Knowledge-based mining platform for Genomic and Genetic studies using Sequence data. The software platform, constituted of bioinformatics and statistical genetics functions, makes use of valuable biologic resources and knowledge for sequencing-based genetic mapping of variants / genes responsible for human diseases / traits. It facilitates geneticists to fish for the genetic determinants of human diseases / traits in the big sea of DNA sequences. KGGSeq has paid attention to downstream analysis of genetic mapping. The framework was implemented to filter and prioritize genetic variants from whole exome sequencing data.
Proper citation: KGGSeq (RRID:SCR_005311) Copy
http://bioapps.sabanciuniv.edu/mugex/v02/
Service that automatically extracts mutation-gene pairs from MEDLINE abstracts for a given disease.
Proper citation: MuGeX (RRID:SCR_005306) Copy
http://sourceforge.net/projects/netclassr/
An R package for network-based feature (gene) selection for biomarkers discovery via integrating biological information. The package adapts the following 5 algorithms for classifying and predicting gene expression data using prior knowledge: # average gene expression of pathway (aep); # pathway activities classification (PAC); # Hub network classification (hubc); # filter via top ranked genes (FrSVM); # network smoothed t-statistic (stSVM).
Proper citation: netClass (RRID:SCR_005672) Copy
http://snp-magma.sourceforge.net
Software that utilizes a multiobjective evolutionary algorithm for genetic mapping. It is based on a the ECJ evolutionary software package written by Sean Luke and includes the Strength Pareto Evoluationary Algorithm Version 2 changes for multiobjective analysis. The code runs on any platform with Java Version 2. A genetic mapping project, typically implemented during a search for genes responsible for a disease, requires the acquisition of a set of data from each of a large number of individuals. This data set includes the values of multiple genetic markers. These genetic markers occur at discrete positions along the genome, which is a collection of one or more linear chromosomes. Typing the value of a marker in an individual carries a cost; one seeks to minimize the number of markers typed without excessively jeopardizing the probability of detecting an association between a marker and a disease phenotype. MAGMA is a project which employ''s a multiobjective evolutionary algorithm to solve this problem.
Proper citation: MAGMA (RRID:SCR_005757) Copy
http://www.ici.upmc.fr/cluego/
A Cytoscape plug-in that visualizes the non-redundant biological terms for large clusters of genes in a functionally grouped network. It can be used in combination with GOlorize. The identifiers can be uploaded from a text file or interactively from a network of Cytoscape. The type of identifiers supported can be easily extended by the user. ClueGO performs single cluster analysis and comparison of clusters. From the ontology sources used, the terms are selected by different filter criteria. The related terms which share similar associated genes can be combined to reduce redundancy. The ClueGO network is created with kappa statistics and reflects the relationships between the terms based on the similarity of their associated genes. On the network, the node colour can be switched between functional groups and clusters distribution. ClueGO charts are underlying the specificity and the common aspects of the biological role. The significance of the terms and groups is automatically calculated. ClueGO is easy updatable with the newest files from Gene Ontology and KEGG. Platform: Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible, THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: ClueGO (RRID:SCR_005748) Copy
Curated protein-protein and genetic interaction repository of raw protein and genetic interactions from major model organism species, with data compiled through comprehensive curation efforts.
Proper citation: Biological General Repository for Interaction Datasets (BioGRID) (RRID:SCR_007393) Copy
http://medgene.med.harvard.edu/MEDGENE/
An algorithm that generates lists of genes associated with a gene or one or more disorders. The algorithm can be used in high-throughput screening experiments, can create disease-specific micro-arrays, and can sort the results of gene profiling data. Based on the co-citations of all Medline records, MedGene can retrieve the following relationships: 1. A list of human genes associated with a particular human disease in ranking order 2. A list of human genes associated with multiple human diseases in ranking order 3. A list of human diseases associated with a particular human gene in ranking order 4. A list of human genes associated with a particular human gene in ranking order 5. The sorted gene list from other disease related high-throughput experiments, such as micro-array 6. The sorted gene list from other gene related high-throughput experiments, such as micro-array
Proper citation: MedGene (RRID:SCR_008122) Copy
http://www.bioconductor.org/packages/devel/bioc/html/ChIPXpress.html
A R package designed to improve ChIP-seq and ChIP-chip target gene ranking using publicly available gene expression data. It takes as input predicted transcription factor (TF) bound genes from ChIPx data and uses a corresponding database of gene expression profiles downloaded from NCBI GEO to rank the TF bound targets in order of which gene is most likely to be functional TF target.
Proper citation: ChIPXpress (RRID:SCR_006653) Copy
A semantically annotated corpus of 240 MEDLINE abstracts (167 on the subject of E. coli species and 73 on the subject of the Human species) intended for training information extraction (IE) systems and/or resources which are used to extract events from biomedical literature. The corpus has been manually annotated with events relating to gene regulation by biologists. Each event is centered on either a verb (e.g. transcribe) or nominalized verb (e.g. transcription) and annotation consists of identifying, as exhaustively as possible, the structurally-related arguments of the verb or nominalized verb within the same sentence. Each event argument is then assigned the following information: * A semantic role from a fixed set of 13 roles which are tailored to the biomedical domain. * A biomedical concept type (where appropriate). The corpus in available for download in 2 formats: * A standoff format, based on the BioNLP'09 Shared Task format * An XML format, based on the GENIA event annotation format
Proper citation: GREC Corpus (RRID:SCR_006719) Copy
Publicly available database of the genes, proteins, experimentally-verified interactions and signaling pathways involved in the innate immune response of humans, mice and bovines to microbial infection. The database captures coverage of the innate immunity interactome by integrating known interactions and pathways from major public databases together with manually-curated data into a centralized resource. The database can be mined as a knowledgebase or used with the integrated bioinformatics and visualization tools for the systems level analysis of the innate immune response. Although InnateDB curation focuses on innate immunity-relevant interactions and pathways, it also incorporates detailed annotation on the entire human, mouse and bovine interactomes by integrating data (178,000+ interactions & 3,900+ pathways) from several of the major public interaction and pathway databases. InnateDB also has integrated human, mouse and bovine orthology predictions generated using Ortholgue software. Ortholgue uses a phylogenetic distance-based method to identify possible paralogs in high-throughput orthology predictions. Integrated human and mouse conserved gene order and synteny information has also been determined to provide further support for orthology predictions. InnateDB Capabilities: * View statistics for manually-curated innate immunity relevant molecular interactions. New manually curated interactions are submitted weekly. * Search for genes and proteins of interest. * Search for experimentally-verified molecular interactions by gene/protein name, interaction type, cell type, etc. * Search genes/interactions belonging to 3,900 pathways. * Visualize interactions using an intuitive subcellular localization-based layout in Cerebral. * Upload your own list of genes along with associated gene expression data (from up to 10 experimental conditions) to interactively analyze this data in a molecular interaction network context. Once you have uploaded your data, you will be able to interactively visualize interaction networks with expression data overlaid; carry out Pathway, Gene Ontology and Transcription Factor Binding Site over-representation analyses; construct orthologous interaction networks in other species; and much more. * Access curated interaction data via a dedicated PSICQUIC webservice.
Proper citation: InnateDB (RRID:SCR_006714) Copy
http://bioweb.ensam.inra.fr/esther
Database and tools for analysis of protein and nucleic acid sequences belonging to superfamily of alpha/beta hydrolases homologous to cholinesterases. Covers multiple species, including human, mouse caenorhabditis and drosophila., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: ESTHER (RRID:SCR_002621) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 14,2026. Integrated database of genomic, expression and protein data for Drosophila, Anopheles, C. elegans and other organisms. You can run flexible queries, export results and analyze lists of data. FlyMine presents data in categories, with each providing information on a particular type of data (for example Gene Expression or Protein Interactions). Template queries, as well as the QueryBuilder itself, allow you to perform searches that span data from more than one category. Advanced users can use a flexible query interface to construct their own data mining queries across the multiple integrated data sources, to modify existing template queries or to create your own template queries. Access our FlyMine data via our Application Programming Interface (API). We provide client libraries in the following languages: Perl, Python, Ruby and & Java API
Proper citation: FlyMine (RRID:SCR_002694) Copy
A biotechnology company that has developed technology for synthesizing custom microarrays, the FlexArrayer. Its is a desk-top sized instrument which allows the researcher to generate, in their own laboratory, either a custom oligonucleotide array in a single day or oligonucleotide pool in a few days. Recent developments in synthesis chemistry allows many modifications to be incorporated or for alternative chemistries to be considered., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: FlexGen (RRID:SCR_003902) Copy
http://sourceforge.net/projects/hlaseq/
An open-source software tool for accurate genotyping the human HLA genes from Illumina GA high-throughput sequencing data.
Proper citation: HLASeq (RRID:SCR_004185) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the SPARC SAWG Resources search. From here you can search through a compilation of resources used by SPARC SAWG and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that SPARC SAWG has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on SPARC SAWG then you can log in from here to get additional features in SPARC SAWG such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into SPARC SAWG you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within SPARC SAWG that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.