Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
http://cmb.gis.a-star.edu.sg/ChIPSeq/paperCCAT.htm
THIS RESOURCE IS OUT OF SERVICE, documented on April 5, 2017, A software package for the analysis of ChIP-seq data with negative control., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: CCAT (RRID:SCR_001843) Copy
https://urgi.versailles.inra.fr/Tools/S-Mart
Software toolbox that manages your RNA-Seq and ChIP-Seq data and also produces many different plots to visualize your data. It performs several tasks that are usually required during the analysis of mapped RNA-Seq and ChIP-Seq reads, including data selection and data visualization. It includes the selection (or the exclusion) of the data that overlaps with a reference set, clustering and comparative analysis. It also provides many ways to visualize data: size of the reads, density on the genome, distance with respect to a reference set, and the correlation of two data sets (with cloud plots). A computer science background is not required to run it through a graphical interface and it can be run on any personal computer, yielding results within an hour for most queries.
Proper citation: S-MART (RRID:SCR_001908) Copy
http://www.bioconductor.org/packages/release/bioc/html/SamSPECTRAL.html
Software that identifies cell population in flow cytometry data. It demonstrates significant advantages in proper identification of populations with non-elliptical shapes, low density populations close to dense ones, minor subpopulations of a major population and rare populations. It samples large data such that spectral clustering is possible while preserving density information in edge weights. More specifically, given a matrix of coordinates as input, SamSPECTRAL first builds the communities to sample the data points. Then, it builds a graph and after weighting the edges by conductance computation, the graph is passed to a classic spectral clustering algorithm to find the spectral clusters. The last stage of SamSPECTRAL is to combine the spectral clusters. The resulting connected components estimate biological cell populations in the data sample.
Proper citation: SamSPECTRAL (RRID:SCR_001858) Copy
http://www.bioconductor.org/packages/2.13/bioc/html/cqn.html
A normalization tool for RNA-Seq data, implementing the conditional quantile normalization method.
Proper citation: CQN (RRID:SCR_001786) Copy
Suite of motif-based sequence analysis tools to discover motifs using MEME, DREME (DNA only) or GLAM2 on groups of related DNA or protein sequences; search sequence databases with motifs using MAST, FIMO, MCAST or GLAM2SCAN; compare a motif to all motifs in a database of motifs; associate motifs with Gene Ontology terms via their putative target genes, and analyze motif enrichment using SpaMo or CentriMo. Source code, binaries and a web server are freely available for noncommercial use.
Proper citation: MEME Suite - Motif-based sequence analysis tools (RRID:SCR_001783) Copy
Software package for a DNA assembly program designed for de novo assembly of 25-40mer input fragments and deep sequence coverage.
Proper citation: SHARCGS (RRID:SCR_002026) Copy
http://www.cs.sunysb.edu/~skiena/shorty/
Software for targeted de novo assembly of microreads with mate pair information and sequencing errors.
Proper citation: SHORTY (RRID:SCR_002048) Copy
Database of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus including information about genes and proteins of Aspergillus nidulans and Aspergillus fumigatus; descriptions and classifications of their biological roles, molecular functions, and subcellular localizations; gene, protein, and chromosome sequence information; tools for analysis and comparison of sequences; and links to literature information; as well as a multispecies comparative genomics browser tool (Sybil) for exploration of orthology and synteny across multiple sequenced Sgenus species. Also available are Gene Ontology (GO) and community resources. Based on the Candida Genome Database, the Aspergillus Genome Database is a resource for genomic sequence data and gene and protein information for Aspergilli. Among its many species, the genus contains an excellent model organism (A. nidulans, or its teleomorph Emericella nidulans), an important pathogen of the immunocompromised (A. fumigatus), an agriculturally important toxin producer (A. flavus), and two species used in industrial processes (A. niger and A. oryzae). Search options allow you to: *Search AspGD database using keywords. *Find chromosomal features that match specific properties or annotations. *Find AspGD web pages using keywords located on the page. *Find information on one gene from many databases. *Search for keywords related to a phenotype (e.g., conidiation), an allele (such as veA1), or an experimental condition (e.g., light). Analysis and Tools allow you to: *Find similarities between a sequence of interest and Aspergillus DNA or protein sequences. *Display and analyze an Aspergillus sequence (or other sequence) in many ways. *Navigate the chromosomes set. View nucleotide and protein sequence. *Find short DNA/protein sequence matches in Aspergillus. *Design sequencing and PCR primers for Aspergillus or other input sequences. *Display the restriction map for a Aspergillus or other input sequence. *Find similarities between a sequence of interest and fungal nucleotide or protein sequences. AspGD welcomes data submissions.
Proper citation: ASPGD (RRID:SCR_002047) Copy
http://www.bioconductor.org/packages/release/bioc/html/ADaCGH2.html
Software for analysis and plotting of array comparative genomic hybridization (CGH) data. It allows usage of Circular Binary Segementation, wavelet-based smoothing (both as in Liu et al., and HaarSeg as in Ben-Yaacov and Eldar), HMM, BioHMM, GLAD, CGHseg. Most computations are parallelized (either via forking or with clusters, including MPI and sockets clusters) and use ff for storing data.
Proper citation: ADaCGH2 (RRID:SCR_001981) Copy
https://cran.r-project.org/src/contrib/Archive/PurBayes/
An MCMC-based algorithm that uses next-generation sequencing data to estimate tumor purity and clonality for paired tumor-normal data.
Proper citation: PurBayes (RRID:SCR_002068) Copy
Multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences. All results of sequence and spectral library searching are subsequently processed through the Trans Proteomic Pipeline to derive a probability of correct identification for all results in a uniform manner to insure a high quality database, along with false discovery rates at the whole atlas level. The raw data, search results, and full builds can be downloaded for other uses. All results of sequence searching are processed through PeptideProphet to derive a probability of correct identification for all results in a uniform manner ensuring a high quality database. All peptides are mapped to Ensembl and can be viewed as custom tracks on the Ensembl genome browser. The long term goal of the project is full annotation of eukaryotic genomes through a thorough validation of expressed proteins. The PeptideAtlas provides a method and a framework to accommodate proteome information coming from high-throughput proteomics technologies. The online database administers experimental data in the public domain. You are encouraged to contribute to the database.
Proper citation: PeptideAtlas (RRID:SCR_006783) Copy
http://www.ensemblgenomes.org/
Database portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the Ensembl genome annotation and visualization platform. Ensembl Genomes consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the availability of vertebrate genomes in Ensembl. Many of the databases supporting the portal have been built in close collaboration with the scientific community - essential for maintaining the accuracy and usefulness of the resource. A common set of user interfaces (which include a graphical genome browser, FTP, BLAST search, a query optimized data warehouse, programmatic access, and a Perl API) is provided for all domains. Data types incorporated include annotation of (protein and non-protein coding) genes, cross references to external resources, and high throughput experimental data (e.g. data from large scale studies of gene expression and polymorphism visualized in their genomic context). Additionally, extensive comparative analysis has been performed, both within defined clades and across the wider taxonomy, and sequence alignments and gene trees resulting from this can be accessed through the site.
Proper citation: Ensembl Genomes (RRID:SCR_006773) Copy
canSAR is an integrated database that brings together biological, chemical, pharmacological (and eventually clinical) data. Its goal is to integrate this data and make it accessible to cancer research scientists from multiple disciplines, in order to help with hypothesis generation in cancer research and support translational research. This cancer research and drug discovery resource was developed to utilize the growing publicly available biological annotation, chemical screening, RNA interference screening, expression, amplification and 3D structural data. Scientists can, in a single place, rapidly identify biological annotation of a target, its structural characterization, expression levels and protein interaction data, as well as suitable cell lines for experiments, potential tool compounds and similarity to known drug targets. canSAR has, from the outset, been completely use-case driven which has dramatically influenced the design of the back-end and the functionality provided through the interfaces. The Web interface provides flexible, multipoint entry into canSAR. This allows easy access to the multidisciplinary data within, including target and compound synopses, bioactivity views and expert tools for chemogenomic, expression and protein interaction network data.
Proper citation: canSAR (RRID:SCR_006794) Copy
Curated collection of known Drosophila transcriptional cis-regulatory modules (CRMs) and transcription factor binding sites (TFBSs). Includes experimentally verified fly regulatory elements along with their DNA sequence, associated genes, and expression patterns they direct. Submission of experimentally verified cis-regulatory elements that are not included in REDfly database are welcome.
Proper citation: REDfly Regulatory Element Database for Drosophilia (RRID:SCR_006790) Copy
https://github.com/friend1ws/EBCall
A software package for somatic mutation detection (including InDels). EBCall uses not only paired tumor/normal sequence data of a target sample, but also multiple non-paired normal reference samples for evaluating distribution of sequencing errors, which leads to an accurate mutaiton detection even in case of low sequencing depths and low allele frequencies.
Proper citation: EBCall (RRID:SCR_006791) Copy
http://bioconductor.org/packages/2.9/bioc/html/RamiGO.html
Software package with an R interface sending requests to AmiGO visualize, retrieving DAG GO trees, parsing GraphViz DOT format files and exporting GML files for Cytoscape. Also uses RCytoscape to interactively display AmiGO trees in Cytoscape.
Proper citation: RamiGO (RRID:SCR_006922) Copy
http://seqbarracuda.sourceforge.net/
A sequence mapping software that utilizes the massive parallelism of graphics processing units to accelerate the inexact alignment of short sequence reads to a particular location on a reference genome. It can align a paired-end library containing 14 million pairs of 76bp reads to the Human genome in about 27 minutes (from fastq files to SAM alignment) using a ��380 NVIDIA Geforce GTX 680*. The alignment throughput can be boosted further by using multiple GPUs (up to 8) at the same time. Being based on BWA (http://bio-bwa.sf.net) from the Sanger Institute, BarraCUDA delivers a high level of alignment fidelity and is comparable to other mainstream alignment programs. It can perform gapped alignment with gap extensions, in order to minimise the number of false variant calls in re-sequencing studies.
Proper citation: BarraCUDA (RRID:SCR_006881) Copy
http://autismkb.cbi.pku.edu.cn/
Genetic factors contribute significantly to ASD. AutismKB is an evidence-based knowledgebase of Autism spectrum disorder (ASD) genetics. The current version contains 2193 genes (99 syndromic autism related genes and 2135 non-syndromic autism related genes), 4617 Copy Number Variations (CNVs) and 158 linkage regions associated with ASD by one or more of the following six experimental methods: # Genome-Wide Association Studies (GWAS); # Genome-wide CNV studies; # Linkage analysis; # Low-scale genetic association studies; # Expression profiling; # Other low-scale gene studies. Based on a scoring and ranking system, 99 syndromic autism related genes and 383 non-syndromic autism related genes (434 genes in total) were designated as having high confidence. Autism spectrum disorder (ASD) is a heterogeneous neurodevelopmental disorder with a prevalence of 1.0-2.6%. The three core symptoms of ASD are: # impairments in reciprocal social interaction; # communication impairments; # presence of restricted, repetitive and stereotyped patterns of behavior, interests and activities.
Proper citation: AutismKB (RRID:SCR_006937) Copy
http://bowtie-bio.sourceforge.net/myrna/index.shtml
A cloud computing tool for calculating differential gene expression in large RNA-seq datasets. It uses Bowtie for short read alignment and R/Bioconductor for interval calculations, normalization, and statistical testing. These tools are combined in an automatic, parallel pipeline that runs in the cloud (Elastic MapReduce in this case) on a local Hadoop cluster, or on a single computer, exploiting multiple computers and CPUs wherever possible.
Proper citation: Myrna (RRID:SCR_006951) Copy
https://github.com/jstjohn/SimSeq
An illumina paired-end and mate-pair short read simulator. This project attempts to model as many of the quirks that exist in Illumina data as possible. Some of these quirks include the potential for chimeric reads, and non-biotinylated fragment pull down in mate-pair libraries .
Proper citation: SimSeq (RRID:SCR_006947) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the dkNET Resources search. From here you can search through a compilation of resources used by dkNET and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that dkNET has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on dkNET then you can log in from here to get additional features in dkNET such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into dkNET you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within dkNET that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.