Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
| Resource Name | Proper Citation | Abbreviations | Resource Type |
Description |
Keywords | Resource Relationships | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
FunCluster Resource Report Resource Website 1+ mentions |
FunCluster (RRID:SCR_005774) | FunCluster | data analysis software, software resource, data processing software, software application | FunCluster is a genomic data analysis algorithm which performs functional analysis of gene expression data obtained from cDNA microarray experiments. Besides automated functional annotation of gene expression data, FunCluster functional analysis aims to detect co-regulated biological processes through a specially designed clustering procedure involving biological annotations and gene expression data. FunCluster''''s functional analysis relies on Gene Ontology and KEGG annotations and is currently available for three organisms: Homo Sapiens, Mus Musculus and Saccharomyces Cerevisiae. FunCluster is provided as a standalone R package, which can be run on any operating system for which an R environment implementation is available (Windows, Mac OS, various flavors of Linux and Unix). Download it from the FunCluster website, or from the worldwide mirrors of CRAN. FunCluster is provided freely under the GNU General Public License 2.0. Platform: Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible | genomic, gene, functional analysis, gene expression, cdna microarray, cdna, microarray, function, cluster, annotation, biological process, statistical analysis, bio.tools |
is listed by: Gene Ontology Tools is listed by: bio.tools is listed by: Debian is related to: Gene Ontology has parent organization: Cordelier Research Center |
PMID:17007070 PMID:16506959 PMID:16046292 |
Free for academic use, GNU General Public License, v2 | nlx_149242, biotools:funcluster | https://bio.tools/funcluster | SCR_005774 | FunCluster R Package, FunCluster Algorithm | 2026-02-14 02:00:58 | 2 | |||||
|
InterProScan Resource Report Resource Website 5000+ mentions |
InterProScan (RRID:SCR_005829) | web service, data processing software, data analysis service, analysis service resource, data analysis software, production service resource, service resource, software application, data access protocol, software resource | Software package for functional analysis of sequences by classifying them into families and predicting presence of domains and sites. Scans sequences against InterPro's signatures. Characterizes nucleotide or protein function by matching it with models from several different databases. Used in large scale analysis of whole proteomes, genomes and metagenomes. Available as Web based version and standalone Perl version and SOAP Web Service. | functional, analysis, sequence, protein, nucleotide, predict, presence, domain, site, proteome, genome, metagenome, bio.tools |
is listed by: Gene Ontology Tools is listed by: OMICtools is listed by: bio.tools is listed by: Debian is listed by: SoftCite is related to: Gene Ontology is related to: RARTF is related to: InterPro is related to: LegumeIP is related to: UniProtKB has parent organization: European Bioinformatics Institute |
European Union ; Biotechnology and Biological Sciences Research Council ; EMBL |
PMID:15980438 PMID:17202162 PMID:24451626 |
Free, Available for download, Freely available | OMICS_01479, biotools:interproscan_4, nlx_149337 | https://www.ebi.ac.uk/interpro/download.html https://bio.tools/interproscan_4 |
SCR_005829 | InterProScan Sequence Search, InterProScan 2, InterProScan 3, InterProScan 4, InterProScan 5 | 2026-02-14 02:01:11 | 6936 | |||||
|
ToppGene Suite Resource Report Resource Website 1000+ mentions |
ToppGene Suite (RRID:SCR_005726) | data analysis service, portal, analysis service resource, resource, data or information resource, production service resource, service resource, database | ToppGene Suite is a one-stop portal for gene list enrichment analysis and candidate gene prioritization based on functional annotations and protein interactions network. ToppGene Suite is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. | gene portal, enrichment analysis, functional annotation, gene prioritization, protein interaction, bio.tools, FASEB list |
is listed by: Gene Ontology Tools is listed by: NIDDK Information Network (dkNET) is listed by: GUDMAP Ontology is listed by: Debian is listed by: bio.tools is related to: Gene Ontology is related to: ToppCluster |
State of Ohio Computational Medicine Center ODD TECH 04-042; NIDDK 1U01DK70219; NIDDK P30DK078392 |
PMID:19465376 | Free for academic use | nlx_149183, biotools:toppgene_suite | https://bio.tools/toppgene_suite | SCR_005726 | ToppGene | 2026-02-14 02:01:10 | 1030 | |||||
|
ccPDB - Compilation and Creation of datasets from PDB Resource Report Resource Website 1+ mentions |
ccPDB - Compilation and Creation of datasets from PDB (RRID:SCR_005870) | ccPDB | web service, data or information resource, data access protocol, software resource, database | ccPDB (Compilation and Creation of datasets from PDB) is designed to provide service to scientific community working in the field of function or structure annoation of proteins. This database of datasets is based on Protein Data Bank (PDB), where all datasets were derived from PDB. ccPDB have four modules; i) compilation of datasets, ii) creation of datasets, iii) web services and iv) Important links. * Compilation of Datasets: Datasets at ccPDB can be classified in two categories, i) datasets collected from literature and ii) datasets compiled from PDB. We are in process of collecting PDB datasetsfrom literature and maintaining at ccPDB. We are also requesting community to suggest datasets. In addition, we generate datasets from PDB, these datasets were generated using commonly used standard protocols like non-redundant chains, structures solved at high resolution. * Creation of datasets: This module developed for creating customized datasets where user can create a dataset using his/her conditions from PDB. This module will be useful for those users who wish to create a new dataset as per ones requirement. This module have six steps, which are described in help page. * Web Services: We integrated following web services in ccPDB; i) Analyze of PDB ID service allows user to submit their PDB on around 40 servers from single point, ii) BLAST search allows user to perform BLAST search of their protein against PDB, iii) Structural information service is designed for annotating a protein structure from PDB ID, iv) Search in PDB facilitate user in searching structures in PDB, v)Generate patterns service facility to generate different types of patterns required for machine learning techniques and vi) Download useful information allows user to download various types of information for a given set of proteins (PDB IDs). * Important Links: One of major objectives of this web site is to provide links to web servers related to functional annotation of proteins. In first phase we have collected and compiled these links in different categories. In future attempt will be made to collect as many links as possible. | secondary structure, nucleic acid interaction, ligand interaction, structure, nucleic acid, interaction, ligand, data set, function, protein, annotate, tight-turn, nucleotide interacting residue, metals interacting residue, dna/rna binding residue, bio.tools |
is listed by: Debian is listed by: bio.tools is related to: Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) has parent organization: Institute of Microbial Technology; Chandigarh; India |
OSDD ; DBT ; Council of Scientific and Industrial Research; New Delhi; India |
PMID:22139939 | biotools:ccpdb, nlx_149416 | https://bio.tools/ccpdb | SCR_005870 | Compilation and Creation of datasets from PDB, ccPDB - Compilation Creation of datasets from PDB | 2026-02-14 02:01:01 | 2 | |||||
|
UTRdb/UTRsite Resource Report Resource Website 10+ mentions |
UTRdb/UTRsite (RRID:SCR_005868) | data or information resource, portal, topical portal | UTRdb/UTRsite is a portal to other databases, including Nucleotide Sequence Databases, Protein Sequence Databases, other Sequence databanks, Untranslated Nucleotide Sequence Databases, Mitochondrial Databases, Mutation Databases, and others. The site also allows users to start long-term permanent projects or just to do quick searches, depending on the user''s needs. | bio.tools, FASEB list |
is listed by: bio.tools is listed by: Debian |
biotools:utrdb, nif-0000-03619 | https://bio.tools/utrdb | http://bighost.area.ba.cnr.it/srs6/ | SCR_005868 | UTRdb/UTRsite | 2026-02-14 02:00:59 | 40 | |||||||
|
ESTScan Resource Report Resource Website 100+ mentions |
ESTScan (RRID:SCR_005742) | ESTScan | data analysis software, software resource, data processing software, software application | ESTScan is a program that can detect coding regions in DNA sequences, even if they are of low quality. ESTScan will also detect and correct sequencing errors that lead to frameshifts. ESTScan is not a gene prediction program , nor is it an open reading frame detector. In fact, its strength lies in the fact that it does not require an open reading frame to detect a coding region. As a result, the program may miss a few translated amino acids at either the N or the C terminus, but will detect coding regions with high selectivity and sensitivity. ESTScan takes advantages of the bias in hexanucleotide usage found in coding regions relative to non-coding regions. This bias is formalized as an inhomogeneous 3-periodic fifth-order Hidden Markov Model (HMM). Additionally, the HMM of ESTScan has been extended to allows insertions and deletions when these improve the coding region statistics. | dna, dna sequence, coding region, perl module, c, btlib perl module |
is listed by: Debian is listed by: OMICtools has parent organization: SourceForge |
PMID:10786296 | OMICS_08423, nlx_149202 | https://sources.debian.org/src/estscan/ | SCR_005742 | ESTScan project | 2026-02-14 02:01:09 | 289 | ||||||
|
PeptideAtlas Resource Report Resource Website 100+ mentions |
PeptideAtlas (RRID:SCR_006783) | PeptideAtlas | data repository, storage service resource, data or information resource, service resource, database | Multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences. All results of sequence and spectral library searching are subsequently processed through the Trans Proteomic Pipeline to derive a probability of correct identification for all results in a uniform manner to insure a high quality database, along with false discovery rates at the whole atlas level. The raw data, search results, and full builds can be downloaded for other uses. All results of sequence searching are processed through PeptideProphet to derive a probability of correct identification for all results in a uniform manner ensuring a high quality database. All peptides are mapped to Ensembl and can be viewed as custom tracks on the Ensembl genome browser. The long term goal of the project is full annotation of eukaryotic genomes through a thorough validation of expressed proteins. The PeptideAtlas provides a method and a framework to accommodate proteome information coming from high-throughput proteomics technologies. The online database administers experimental data in the public domain. You are encouraged to contribute to the database. | proteomics, peptide, mass spectrometry, annotation, eukaryotic, genome, peptide sequence, high-throughput mass spectrometry, ensembl, peptideprophet, protein sequence, blood plasma, protein, eukaryotic cell, dna, bio.tools, FASEB list |
is used by: NIF Data Federation is used by: ProteomeXchange is recommended by: NIDDK Information Network (dkNET) is recommended by: National Library of Medicine is recommended by: NIDDK - National Institute of Diabetes and Digestive and Kidney Diseases is listed by: Biositemaps is listed by: re3data.org is listed by: bio.tools is listed by: Debian is related to: Ensembl is related to: ProteomeXchange is related to: NIH Data Sharing Repositories is related to: Integrated Manually Extracted Annotation has parent organization: Institute for Systems Biology; Washington; USA |
NCI ; NHGRI ; NIGMS |
PMID:20013378 PMID:23215161 PMID:16381952 PMID:15642101 |
Public, The community can contribute to this resource, Acknowledgement requested | nif-0000-03266, r3d100010889, biotools:peptideatlas | https://bio.tools/peptideatlas https://doi.org/10.17616/R3BK61 |
SCR_006783 | Peptide Atlas, PeptideAtlas | 2026-02-14 02:01:13 | 479 | ||||
|
ShortRead Resource Report Resource Website 100+ mentions |
ShortRead (RRID:SCR_006813) | ShortRead | software resource | Software package for input, quality assessment and exploration of high-throughput sequence data. Used for input, quality assurance, and basic manipulation of `short read'' DNA sequences such as those produced by Solexa, 454, and related technologies, including exible import of common short read data formats. | high throughput sequence data, short read, DNA sequences, short read data |
is listed by: OMICtools is listed by: Debian is listed by: SoftCite has parent organization: Bioconductor |
PMID:19654119 | Free, Available for download, Freely available | OMICS_01076 | https://sources.debian.org/src/r-bioc-shortread/ | SCR_006813 | ShortRead - Classes and methods for high-throughput short-read sequencing data. | 2026-02-14 02:01:24 | 225 | |||||
|
Ensembl Genomes Resource Report Resource Website 100+ mentions |
Ensembl Genomes (RRID:SCR_006773) | web service, data or information resource, data access protocol, software resource, database | Database portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the Ensembl genome annotation and visualization platform. Ensembl Genomes consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the availability of vertebrate genomes in Ensembl. Many of the databases supporting the portal have been built in close collaboration with the scientific community - essential for maintaining the accuracy and usefulness of the resource. A common set of user interfaces (which include a graphical genome browser, FTP, BLAST search, a query optimized data warehouse, programmatic access, and a Perl API) is provided for all domains. Data types incorporated include annotation of (protein and non-protein coding) genes, cross references to external resources, and high throughput experimental data (e.g. data from large scale studies of gene expression and polymorphism visualized in their genomic context). Additionally, extensive comparative analysis has been performed, both within defined clades and across the wider taxonomy, and sequence alignments and gene trees resulting from this can be accessed through the site. | genome, gold standard, bio.tools, FASEB list |
is listed by: OMICtools is listed by: bio.tools is listed by: Debian is related to: Ensembl is related to: Ensembl is related to: g:Profiler has parent organization: European Bioinformatics Institute |
European Molecular Biology Laboratory ; European Union FELICS 021902 (RII3); BBSRC BB/F019793/1 |
PMID:24163254 PMID:19884133 |
r3d100011197, OMICS_01648, nlx_65207, biotools:ensembl_genomes | https://bio.tools/ensembl_genomes https://doi.org/10.17616/R3MW6M |
SCR_006773 | Ensembl Genomes: Extending Ensembl across the taxonomic space, EnsemblGenomes, Ensembl Genome | 2026-02-14 02:01:23 | 276 | ||||||
|
canSAR Resource Report Resource Website 50+ mentions |
canSAR (RRID:SCR_006794) | canSAR | data analysis service, analysis service resource, data or information resource, production service resource, service resource, database | canSAR is an integrated database that brings together biological, chemical, pharmacological (and eventually clinical) data. Its goal is to integrate this data and make it accessible to cancer research scientists from multiple disciplines, in order to help with hypothesis generation in cancer research and support translational research. This cancer research and drug discovery resource was developed to utilize the growing publicly available biological annotation, chemical screening, RNA interference screening, expression, amplification and 3D structural data. Scientists can, in a single place, rapidly identify biological annotation of a target, its structural characterization, expression levels and protein interaction data, as well as suitable cell lines for experiments, potential tool compounds and similarity to known drug targets. canSAR has, from the outset, been completely use-case driven which has dramatically influenced the design of the back-end and the functionality provided through the interfaces. The Web interface provides flexible, multipoint entry into canSAR. This allows easy access to the multidisciplinary data within, including target and compound synopses, bioactivity views and expert tools for chemogenomic, expression and protein interaction network data. | molecular target, expression, cell line, compound, molecule, protein, structure, ligand, drug, 3d, genomics, 3d complex, bioactivity, protein affinity, cell line sensitivity, pathway, annotation, bio.tools, FASEB list |
is listed by: Debian is listed by: bio.tools is related to: Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is related to: BindingDB is related to: Gene Ontology has parent organization: Cancer Research UK |
Cancer | Cancer Research UK C309/A8274 | PMID:22013161 | CanSAR is freely available to all cancer researchers. By using canSAR you are agreeing to the Terms of Use, Https://cansar.icr.ac.uk/cansar/terms-of-use/ | biotools:cansar, nlx_149410 | https://bio.tools/cansar | SCR_006794 | 2026-02-14 02:01:16 | 54 | ||||
|
REDfly Regulatory Element Database for Drosophilia Resource Report Resource Website 10+ mentions |
REDfly Regulatory Element Database for Drosophilia (RRID:SCR_006790) | REDfly | data repository, storage service resource, data or information resource, service resource, database | Curated collection of known Drosophila transcriptional cis-regulatory modules (CRMs) and transcription factor binding sites (TFBSs). Includes experimentally verified fly regulatory elements along with their DNA sequence, associated genes, and expression patterns they direct. Submission of experimentally verified cis-regulatory elements that are not included in REDfly database are welcome. | transcriptional cis-regulatory module, transcription factor binding site, dna sequence, gene, expression pattern, genome, gene expression, transcription factor, cis-regulatory module, bio.tools |
is listed by: OMICtools is listed by: bio.tools is listed by: Debian is related to: Drosophila anatomy and development ontologies is related to: FlyMine has parent organization: University at Buffalo; New York; USA |
NSF EF0843229; NIGMS U24 GM144232 |
PMID:20965965 PMID:18039705 PMID:16303794 |
Acknowledgement requested | OMICS_01870, biotools:redfly, nif-0000-03393 | https://bio.tools/redfly | SCR_006790 | Regulatory Element Database for Drosophilia, Regulatory Element Database | 2026-02-14 02:01:16 | 14 | ||||
|
EBCall Resource Report Resource Website 10+ mentions |
EBCall (RRID:SCR_006791) | EBCall | software resource | A software package for somatic mutation detection (including InDels). EBCall uses not only paired tumor/normal sequence data of a target sample, but also multiple non-paired normal reference samples for evaluating distribution of sequencing errors, which leads to an accurate mutaiton detection even in case of low sequencing depths and low allele frequencies. | mutation, cancer, genome, sequencing, bio.tools |
is listed by: OMICtools is listed by: bio.tools is listed by: Debian has parent organization: University of Tokyo; Tokyo; Japan |
PMID:23471004 | Copyright conditions, Acknowledgement required | biotools:ebcall, OMICS_00084 | https://bio.tools/ebcall | SCR_006791 | EBCall (Empirical Baysian mutation Calling), Empirical Baysian mutation Calling | 2026-02-14 02:01:23 | 19 | |||||
|
RamiGO Resource Report Resource Website 10+ mentions |
RamiGO (RRID:SCR_006922) | RamiGO | software resource | Software package with an R interface sending requests to AmiGO visualize, retrieving DAG GO trees, parsing GraphViz DOT format files and exporting GML files for Cytoscape. Also uses RCytoscape to interactively display AmiGO trees in Cytoscape. | visualization, analysis, ontology or annotation search engine, ontology or annotation visualization, other analysis, classification, go, graph, network, third party client, windows, mac os x, linux, unix, bio.tools |
is listed by: Gene Ontology Tools is listed by: OMICtools is listed by: Debian is listed by: bio.tools is related to: Gene Ontology is related to: Cytoscape is related to: AmiGO has parent organization: Dana-Farber Cancer Institute has parent organization: Bioconductor |
PMID:23297033 | Artistic License, v2 | biotools:ramigo, OMICS_02267, nlx_149331 | http://bioconductor.org/packages/release/bioc/html/RamiGO.html https://bio.tools/ramigo |
SCR_006922 | ramigo, RamiGO - AmiGO visualize R interface | 2026-02-14 02:01:15 | 11 | |||||
|
ArtificialFastqGenerator Resource Report Resource Website 10+ mentions |
ArtificialFastqGenerator (RRID:SCR_006880) | ArtificialFastqGenerator | software resource | Software to evaluate and improve the accuracy of sequencing error under different experimental conditions. It can identify which components of a system may be suboptimal and which regions of the genome may be problematic. | matlab, java, Next Generation Sequencing, aligns reads, reference genome |
is listed by: OMICtools is listed by: Debian |
PMID:23152858 | GNU GPL v3 | OMICS_00248, SCR_015979 | https://sources.debian.org/src/artfastqgenerator/ | SCR_006880 | Artfastqgenerator - Ouputs artificial FASTQ files derived from a reference genome | 2026-02-14 02:01:24 | 10 | |||||
|
BarraCUDA Resource Report Resource Website 1+ mentions |
BarraCUDA (RRID:SCR_006881) | BarraCUDA | software resource | A sequence mapping software that utilizes the massive parallelism of graphics processing units to accelerate the inexact alignment of short sequence reads to a particular location on a reference genome. It can align a paired-end library containing 14 million pairs of 76bp reads to the Human genome in about 27 minutes (from fastq files to SAM alignment) using a ��380 NVIDIA Geforce GTX 680*. The alignment throughput can be boosted further by using multiple GPUs (up to 8) at the same time. Being based on BWA (http://bio-bwa.sf.net) from the Sanger Institute, BarraCUDA delivers a high level of alignment fidelity and is comparable to other mainstream alignment programs. It can perform gapped alignment with gap extensions, in order to minimise the number of false variant calls in re-sequencing studies. | gpu/cuda, bio.tools |
is listed by: OMICtools is listed by: bio.tools is listed by: Debian has parent organization: University of Cambridge; Cambridge; United Kingdom has parent organization: SourceForge |
PMID:22244497 PMID:19451168 |
Acknowledgement requested | OMICS_00650, biotools:barracuda | https://bio.tools/barracuda | SCR_006881 | 2026-02-14 02:01:17 | 4 | ||||||
|
AutismKB Resource Report Resource Website 10+ mentions |
AutismKB (RRID:SCR_006937) | AutismKB | data analysis service, analysis service resource, data or information resource, production service resource, service resource, database | Genetic factors contribute significantly to ASD. AutismKB is an evidence-based knowledgebase of Autism spectrum disorder (ASD) genetics. The current version contains 2193 genes (99 syndromic autism related genes and 2135 non-syndromic autism related genes), 4617 Copy Number Variations (CNVs) and 158 linkage regions associated with ASD by one or more of the following six experimental methods: # Genome-Wide Association Studies (GWAS); # Genome-wide CNV studies; # Linkage analysis; # Low-scale genetic association studies; # Expression profiling; # Other low-scale gene studies. Based on a scoring and ranking system, 99 syndromic autism related genes and 383 non-syndromic autism related genes (434 genes in total) were designated as having high confidence. Autism spectrum disorder (ASD) is a heterogeneous neurodevelopmental disorder with a prevalence of 1.0-2.6%. The three core symptoms of ASD are: # impairments in reciprocal social interaction; # communication impairments; # presence of restricted, repetitive and stereotyped patterns of behavior, interests and activities. | gene, copy number variation, linkage region, genome-wide association study, family-based association study, case-control association study, expression profile, blast, syndromic, non-syndromic, snp, vntr, bio.tools, FASEB list |
is listed by: Debian is listed by: bio.tools is related to: Gene Ontology has parent organization: Peking University; Beijing; China |
Autism spectrum disorder, Autism | Merck ; Johnson and Johnson ; Natural Science Foundation of China 31025014; Natural Science Foundation of China 2011CBA01102 |
PMID:22139918 | biotools:autismkb, nlx_151318 | https://bio.tools/autismkb | SCR_006937 | Autism Knowledgebase | 2026-02-14 02:01:24 | 33 | ||||
|
Myrna Resource Report Resource Website 1+ mentions |
Myrna (RRID:SCR_006951) | Myrna | software resource | A cloud computing tool for calculating differential gene expression in large RNA-seq datasets. It uses Bowtie for short read alignment and R/Bioconductor for interval calculations, normalization, and statistical testing. These tools are combined in an automatic, parallel pipeline that runs in the cloud (Elastic MapReduce in this case) on a local Hadoop cluster, or on a single computer, exploiting multiple computers and CPUs wherever possible. | mapreduce, hadoop, cloud computing, differential expression, gene expression, rna-seq, bio.tools |
is listed by: OMICtools is listed by: Debian is listed by: bio.tools is related to: ReCount - A multi-experiment resource of analysis-ready RNA-seq gene count datasets has parent organization: Johns Hopkins University; Maryland; USA |
PMID:20701754 | Artistic License | OMICS_01310, biotools:myrna | https://github.com/BenLangmead/myrna https://bio.tools/myrna |
SCR_006951 | Myrna: Cloud-scale differential gene expression for RNA-seq | 2026-02-14 02:01:18 | 2 | |||||
|
SimSeq Resource Report Resource Website 10+ mentions |
SimSeq (RRID:SCR_006947) | SimSeq | software resource, simulation software, software application | An illumina paired-end and mate-pair short read simulator. This project attempts to model as many of the quirks that exist in Illumina data as possible. Some of these quirks include the potential for chimeric reads, and non-biotinylated fragment pull down in mate-pair libraries . | bio.tools |
is listed by: OMICtools is listed by: bio.tools is listed by: Debian |
Free | biotools:simseq, OMICS_00258 | https://bio.tools/simseq | SCR_006947 | 2026-02-14 02:01:26 | 29 | |||||||
|
WSsas - Web Service for the SAS tool Resource Report Resource Website |
WSsas - Web Service for the SAS tool (RRID:SCR_007051) | WSsas | data access protocol, software resource, web service | SAS is a tool for applying structural information to a given protein sequence. It uses FASTA to scan a given protein sequence against all the proteins of known 3D structure in the Protein Data Bank and provides functional residue annotation based on data from the Catalytic Site Atlas and PDBsum. The web service is aimed to facilitate the use of the SAS tool when having a huge number of queries. Currently, the web service provides annotation for binding sites (to ligand, metal or nucleic acid), catalytic residues and amino acids related to protein-protein interactions. | gold standard, bio.tools |
is listed by: bio.tools is listed by: Debian is related to: SAS - Sequence Annotated by Structure has parent organization: European Bioinformatics Institute |
nlx_18182, biotools:wssas | https://bio.tools/wssas | SCR_007051 | Web Service for the SAS tool | 2026-02-14 02:01:27 | 0 | |||||||
|
CD-HIT Resource Report Resource Website 1000+ mentions |
CD-HIT (RRID:SCR_007105) | CD-HIT | software resource, source code, data processing software, software application | THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Software program for clustering biological sequences with many applications in various fields such as making non-redundant databases, finding duplicates, identifying protein families, filtering sequence errors and improving sequence assembly etc. It is very fast and can handle extremely large databases. CD-HIT helps to significantly reduce the computational and manual efforts in many sequence analysis tasks and aids in understanding the data structure and correct the bias within a dataset. The CD-HIT package has CD-HIT, CD-HIT-2D, CD-HIT-EST, CD-HIT-EST-2D, CD-HIT-454, CD-HIT-PARA, PSI-CD-HIT, CD-HIT-OTU and over a dozen scripts. * CD-HIT (CD-HIT-EST) clusters similar proteins (DNAs) into clusters that meet a user-defined similarity threshold. * CD-HIT-2D (CD-HIT-EST-2D) compares 2 datasets and identifies the sequences in db2 that are similar to db1 above a threshold. * CD-HIT-454 identifies natural and artificial duplicates from pyrosequencing reads. * CD-HIT-OTU cluster rRNA tags into OTUs The usage of other programs and scripts can be found in CD-HIT user''s guide. CD-HIT was originally developed by Dr. Weizhong Li at Dr. Adam Godzik''s Lab at the Burnham Institute (now Sanford-Burnham Medical Research Institute)., THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 16,2025. | cluster, protein, sequence, classification, domain, analysis, nucleotide sequence, dna, protein sequence, bio.tools, FASEB list |
is listed by: Debian is listed by: bio.tools is listed by: OMICtools has parent organization: University of California at San Diego; California; USA has parent organization: Google Code is parent organization of: CD-HIT-OTU |
NCRR 1R01RR025030 | PMID:20053844 PMID:16731699 DOI:10.1093/bioinformatics/btl158 |
THIS RESOURCE IS NO LONGER IN SERVICE | OMICS_05157, biotools:cd-hit, nif-0000-30240 | http://cd-hit.org https://code.google.com/p/cdhit/ https://bio.tools/cd-hit https://sources.debian.org/src/cd-hit/ |
http://bioinformatics.ljcrf.edu/cd-hi/ |
SCR_007105 | CD-HIT Program | 2026-02-14 02:01:21 | 3203 |
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the nidm-terms Resources search. From here you can search through a compilation of resources used by nidm-terms and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that nidm-terms has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on nidm-terms then you can log in from here to get additional features in nidm-terms such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
If you are logged into nidm-terms you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the facets that you can filter the data by.
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.