Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
https://github.com/lmrodriguezr/nonpareil
Estimate average coverage and create Nonpareil curves for metagenomic datasets.
Proper citation: Nonpareil (RRID:SCR_004629) Copy
http://omics.informatics.indiana.edu/AbundanceBin/
An abundance-based software tool for binning metagenomic sequences, such that the reads classified in a bin belong to species of identical or very similar abundances. AbundanceBin also gives estimations of species abundances and their genome sizes -two important characteristic parameters for a microbial community.
Proper citation: AbundanceBin (RRID:SCR_004648) Copy
http://compbio.cs.sfu.ca/software-variation-hunter
A software tool for discovery of structural variation in one or more individuals simultaneously using high throughput technologies.
Proper citation: VariationHunter (RRID:SCR_004865) Copy
http://www.cbcb.umd.edu/software/phymm/
Software for Phylogenetic Classification of Metagenomic Data with Interpolated Markov Models to taxonomically classify DNA sequences and accurately classify reads as short as 100 bp. PhymmBL, the hybrid classifier included in this distribution which combines analysis from both Phymm and BLAST, produces even higher accuracy.
Proper citation: Phymm and PhymmBL (RRID:SCR_004751) Copy
https://code.google.com/p/destruct/
A software tool for identifying structural variation in tumour genomes from whole genome illumina sequencing.
Proper citation: deStruct (RRID:SCR_004747) Copy
http://bioinformatics.rutgers.edu/Software/SLiQ/
Software for simple linear inequalities based Mate-Pair reads filtering and scaffolding. A set of simple linear inequalities (SLIQ) derived from the geometry of contigs on the line that can be used to predict the relative positions and orientations of contigs from individual mate pair reads and thus produce a contig digraph. The SLIQ inequalities can also filter out unreliable mate pairs and can be used as a pre-processing step for any scaffolding algorithm. This tool filters mate pairs and then produces a Directed Contig Graph (contig diGraph). Also provided is a Naive scaffolder that can then produce scaffolds out of the contig diGraph.
Proper citation: SLIQ (RRID:SCR_005003) Copy
http://cortexassembler.sourceforge.net/index_cortex_var.html
A tool for genome assembly and variation analysis from sequence data. You can use it to discover and genotype variants on single or multiple haploid or diploid samples. If you have multiple samples, you can use Cortex to look specifically for variants that distinguish one set of samples (eg phenotype=X, cases, parents, tumour) from another set of samples (eg phenotype=Y, controls, child, normal). cortex_var features * Variant discovery by de novo assembly - no reference genome required * Supports multicoloured de Bruijn graphs - have multiple samples loaded into the same graph in different colours, and find variants that distinguish them. * Capable of calling SNPs, indels, inversions, complex variants, small haplotypes * Extremely accurate variant calling - see our paper for base-pair-resolution validation of entire alleles (rather than just breakpoints) of SNPs, indels and complex variants by comparison with fully sequenced (and finished) fosmids - a level of validation beyond that demanded of any other variant caller we are aware of - currently cortex_var is the most accurate variant caller for indels and complex variants. * Capable of aligning a reference genome to a graph and using that to call variants * Support for comparing cases/controls or phenotyped strains * Typical memory use: 1 high coverage human in under 80Gb of RAM, 1000 yeasts in under 64Gb RAM, 10 humans in under 256 Gb RAM
Proper citation: cortex var (RRID:SCR_005081) Copy
http://www.physics.rutgers.edu/~anirvans/SOPRA/
Software tool to exploit the mate pair/paired-end information for assembly of short reads from high throughput sequencing platforms, e.g. Illumina and SOLiD.
Proper citation: SOPRA (RRID:SCR_005035) Copy
http://www.baseclear.com/landingpages/basetools-a-wide-range-of-bioinformatics-solutions/sspacev12/
A stand-alone software program for scaffolding pre-assembled contigs using paired-read data. Main features are: a short runtime, multiple library input of paired-end and/or mate pair datasets and possible contig extension with unmapped sequence reads.
Proper citation: SSPACE (RRID:SCR_005056) Copy
http://meringlab.org/software/hpc-clust/
A set of tools designed to cluster large numbers (>1 million) of pre-aligned nucleotide sequences. It performs the clustering of sequences using the Hierarchical Clustering Algorithm (HCA). There are currently three different cluster metrics implemented: single-linkage, complete-linkage, and average-linkage. In addition, there are currently four sequence distance functions implemented, these are: identity (gap-gap counting as match), nogap (gap-gap being ignored), nogap-single (like nogap, but consecutive gap-nogap''s count as a single mismatch), tamura (distance is calculated with the knowledge that transitions are more likely than transversions). One advantage that HCA has over other algorithms is that instead of producing only the clustering at a given threshold, it produces the set of merges occuring at each threshold. With this approach, the clusters can afterwards very quickly be reported for every arbitrary threshold with little extra computation. This approach also allows the plotting of the variation of number of clusters with clustering threshold without requiring the clustering to be run for each threshold independently. Another feature of the way HPC-CLUST is implemented is that the single-, complete-, and average-linkage clusterings can be computed in a single run with little overhead.
Proper citation: HPC-CLUST (RRID:SCR_005052) Copy
http://plaza.ufl.edu/xywang/Mpick.htm
A modularity-based clustering software for Operational Taxonomic Unit (OTU) picking of 16S rRNA sequences. The algorithm does not require a predetermined cut-off level, and our simulation studies suggest that it is superior to existing methods that require specified distance or variance levels to define OTUs.
Proper citation: M-pick (RRID:SCR_004995) Copy
http://plaza.ufl.edu/sunyijun/ES-Tree.htm
Software for hierarchical Clustering Analysis of Millions of 16S rRNA Pyrosequences in Quasi-linear Time.
Proper citation: ESPRIT-Tree (RRID:SCR_005045) Copy
http://biohealth.snu.ac.kr/software/TRAP/
A comprehensive software package integrating all necessary tasks such as mapping short reads, measuring gene expression levels, finding differentially expressed genes (DEGs), clustering and pathway analysis for time-series data in a single environment.
Proper citation: Time-series RNA-seq Analysis Package (RRID:SCR_002935) Copy
http://pfind.ict.ac.cn/software/pBuild/index.html
A software tool that can compare several search engines' results and combine them together.
Proper citation: pBuild (RRID:SCR_002929) Copy
http://cran.r-project.org/src/contrib/Archive/aCGH.Spline/
An R package for array comparative genomic hybridization (aCGH) dye bias normalization.
Proper citation: aCGH.Spline (RRID:SCR_002927) Copy
https://github.com/GiBacci/StreamingTrim/
A DNA reads trimming software, written in Java, with which researchers are able to analyse the quality of DNA sequences in fastq files and to search for low-quality zones in a very conservative way.
Proper citation: StreamingTrim (RRID:SCR_002922) Copy
http://pfind.ict.ac.cn/software/pLabel/index.html
Mass spectral peak labeling software developed for proteomics research., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025.
Proper citation: pLabel (RRID:SCR_002923) Copy
https://github.com/itojal/hot_scan
A free software to detect genomic regions unusually rich in translocation breakpoints. More generally, it may be used to detect a region that is unusually rich in a given character of a binary sequence.
Proper citation: hot scan (RRID:SCR_002840) Copy
A fast and versatile Open Source docking software program that can be used to dock small molecules against proteins and nucleic acids.
Proper citation: rDock (RRID:SCR_002838) Copy
http://www.bioconductor.org/packages/release/bioc/html/Basic4Cseq.html
An R/Bioconductor package for basic filtering, analysis and subsequent near-cis visualization of 4C-seq data. Virtual fragment libraries can be created for any BSGenome package, and filter functions for both reads and fragments and basic quality controls are included. Fragment data in the vicinity of the experiment's viewpoint can be visualized as a coverage plot based on a running median approach and a multi-scale contact profile.
Proper citation: Basic4Cseq (RRID:SCR_002836) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the SPARC SAWG Resources search. From here you can search through a compilation of resources used by SPARC SAWG and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that SPARC SAWG has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on SPARC SAWG then you can log in from here to get additional features in SPARC SAWG such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into SPARC SAWG you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within SPARC SAWG that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.