Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
https://code.google.com/p/destruct/
A software tool for identifying structural variation in tumour genomes from whole genome illumina sequencing.
Proper citation: deStruct (RRID:SCR_004747) Copy
http://www.genedb.org/Homepage/Tbruceibrucei927
Database of the most recent sequence updates and annotations for the T. brucei genome. New annotations are constantly being added to keep up with published manuscripts and feedback from the Trypanosomatid research community. You may search by Protein Length, Molecular Mass, Gene Type, Date, Location, Protein Targeting, Transmembrane Helices, Product, GO, EC, Pfam ID, Curation and Comments, and Dbxrefs. BLAST and other tools are available. T. brucei possesses a two-unit genome, a nuclear genome and a mitochondrial (kinetoplast) genome with a total estimated size of 35Mb/haploid genome. The nuclear genome is split into three classes of chromosomes according to their size on pulsed-field gel electrophoresis, 11 pairs of megabase chromosomes (0.9-5.7 Mb), intermediate (300-900 kb) and minichromosomes (50-100 kb). The T. brucei genome contains a ~0.5Mb segmental duplication affecting chromosomes 4 and 8, which is responsible for some 75 gene duplicates unique to this species. A comparative chromosome map of the duplicons can be accessed here (PubmedID 18036214). Protozoan parasites within the species Trypanosoma brucei are the etiological agent of human sleeping sickness and Nagana in animals. Infections are limited to patches of sub-Saharan Africa where insects vectors of the Glossina genus are endemic. The most recent estimates indicate between 50,000 - 70,000 human cases currently exist, with 17 000 new cases each year (WHO Factsheet, 2006). In collaboration with GeneDB, the EuPathDB genomic sequence data and annotations are regularly deposited on TriTrypDB where they can be integrated with other datasets and queried using customized queries.
Proper citation: GeneDB Tbrucei (RRID:SCR_004786) Copy
http://www.baseclear.com/landingpages/basetools-a-wide-range-of-bioinformatics-solutions/sspacev12/
A stand-alone software program for scaffolding pre-assembled contigs using paired-read data. Main features are: a short runtime, multiple library input of paired-end and/or mate pair datasets and possible contig extension with unmapped sequence reads.
Proper citation: SSPACE (RRID:SCR_005056) Copy
Database of information regarding genome and metagenome sequencing projects, and their associated metadata, around the world. It also provides information related to organism properties such as phenotype, ecotype and disease. Both complete and ongoing projects, along with their associated metadata, can be accessed. Users can also register, annotate and publish genome and metagenome data.
Proper citation: Genomes Online Database (RRID:SCR_002817) Copy
A database designed for plant comparative and functional genomics based on complete genomes. It comprises complete proteome sequences from the major phylum of plant evolution. The clustering of these proteomes was performed to define a consistent and extensive set of homeomorphic plant families. Based on this, lists of gene families such as plant or species specific families and several tools are provided to facilitate comparative genomics within plant genomes. The analyses follow two main steps: gene family clustering and phylogenomic analysis of the generated families. Once a group of sequences (cluster) is validated, phylogenetic analyses are performed to predict homolog relationships such as orthologs and ultraparalogs.
Proper citation: GreenPhylDB (RRID:SCR_002834) Copy
Portal that supports Ambystoma-related research and educational efforts. It is composed of several resources: Salamander Genome Project, Ambystoma EST Database, Ambystoma Gene Collection, Ambystoma Map and Marker Collection, Ambystoma Genetic Stock Center, and Ambystoma Research Coordination Network.
Proper citation: Sal-Site (RRID:SCR_002850) Copy
Computational biology research at Memorial Sloan-Kettering Cancer Center (MSKCC) pursues computational biology research projects and the development of bioinformatics resources in the areas of: sequence-structure analysis; gene regulation; molecular pathways and networks, and diagnostic and prognostic indicators. The mission of cBio is to move the theoretical methods and genome-scale data resources of computational biology into everyday laboratory practice and use, and is reflected in the organization of cBio into research and service components ~ the intention being that new computational methods created through the process of scientific inquiry should be generalized and supported as open-source and shared community resources. Faculty from cBio participate in graduate training provided through the following graduate programs: * Gerstner Sloan-Kettering Graduate School of Biomedical Sciences * Graduate Training Program in Computational Biology and Medicine Integral to much of the research and service work performed by cBio is the creation and use of software tools and data resources. The tools that we have created and utilize provide evidence of our involvement in the following areas: * Cancer Genomics * Data Repositories * iPhone & iPod Touch * microRNAs * Pathways * Protein Function * Text Analysis * Transcription Profiling
Proper citation: Computational Biology Center (RRID:SCR_002877) Copy
http://www.broadinstitute.org/annotation/genome/magnaporthe_comparative/MultiHome.html
The Magnaporthe comparative genomics database provides accesses to multiple fungal genomes from the Magnaporthaceae family to facilitate the comparative analysis. As part of the Broad Fungal Genome Initiative, the Magnaporthe comparative project includes the finished M. oryzae (formerly M. grisea) genome, as well as the draft assemblies of Gaeumannomyces graminis var. tritici and M. poae. It provides users the tools to BLAST search, browse genome regions (to retrieve DNA, find clones, and graphically view sequence regions), and provides gene indexes and genome statistics. We were funded to attempt 7x sequence coverage comprising paired end reads from plasmids, Fosmids and BACs. Our strategy involves Whole Genome Shotgun (WGS) sequencing, in which sequence from the entire genome is generated and reassembled. Our specific aims are as follows: 1. Generate and assemble sequence reads yielding 7X coverage of the Magnaporthe oryzae genome through whole genome shotgun sequencing. 2. Generate and incorporate BAC and Fosmid end sequences into the genome assembly to provide a paired-end of average every 2 kb. 3. Integrate the genome sequence with existing physical and genetic map information. 4. Perform automated annotation of the sequence assembly. 5. Distribute the sequence assembly and results of our annotation and analysis through a freely accessible, public web server and by deposition of the sequence assembly in GenBank.
Proper citation: Magnaporthe comparative Database (RRID:SCR_003079) Copy
Project to determine the gene expression profiles of normal, precancer, and cancer cells, whose generated resources are available to the cancer community. Interconnected modules provide access to all CGAP data, bioinformatic analysis tools, and biological resources allowing the user to find in silico answers to biological questions in a fraction of the time it once took in the laboratory. * Genes * Tissues * Pathways * RNAi * Chromosomes * SAGE Genie * Tools
Proper citation: Cancer Genome Anatomy Project (RRID:SCR_003072) Copy
http://bibiserv.techfak.uni-bielefeld.de/dialign/
Tool for multiple sequence alignment using various sources of external information that is particularly useful to detect local homologies in sequences with low overall similarity. While standard alignment methods rely on comparing single residues and imposing gap penalties, DIALIGN constructs pairwise and multiple alignments by comparing entire segments of the sequences. No gap penalty is used. This approach can be used for both global and local alignment, but it is particularly successful in situations where sequences share only local homologies. Several versions of DIALIGN are available online at GOBICS, http://dialign.gobics.de/
Proper citation: DIALIGN (RRID:SCR_003041) Copy
http://resexomedb.bioinf-dz.org/
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on October 28,2025. An online catalog for whole-exome sequencing (WES) results including mutations and gene-disease associations identified by WES. It is browsable and searchable by mutation, gene, study or publication. In addition, it centralizes all publications, software, platforms related to exome / whole genome sequencing.
Proper citation: resExomeDB (RRID:SCR_003224) Copy
http://compbio.cs.sfu.ca/software-novelseq
Software pipeline to detect novel sequence insertions using high throughput paired-end whole genome sequencing data.
Proper citation: NovelSeq (RRID:SCR_003136) Copy
http://mrcanavar.sourceforge.net/
Copy number caller that analyzes the whole-genome next-generation sequence mapping read depth to discover large segmental duplications and deletions. It also has the capability of predicting absolute copy numbers of genomic intervals.
Proper citation: mrCaNaVaR (RRID:SCR_003135) Copy
Nematode & Neglected Genomics (at) The Blaxter Lab is a nematode related portal including databases and services. Resources include genomic and transcriptomic databases for nematodes and other metazoan phyla and freely downloadable software tools for expressed sequence tag analysis, DNA barcode analysis and phylogenomics. Major categories include: * GenePool * 959 Nematode Genomes * Teaching * Research Projects * Bioinformatics Software Tools * Lab Personnel * Lab Wiki * Genomics Databases * NEMBASE4 * Tardigrada: Hypsibius dujardini * Earthworm: Lumbricus rubellus * MolluscDB * ArthropodDB * other Neglected Genomes
Proper citation: nematodes.org (RRID:SCR_003267) Copy
A biopharmaceutical company applying its discoveries in human genetics to develop drugs and diagnostics for common diseases. They specialize in gene discovery - their population approach and resources have enabled them to isolate key genes contributing to major public health challenges from cardiovascular disease to cancer. The company's genotyping capacity is now one of the highest in the world. They have a large population-based biobank containing whole blood and DNA samples with extensive relevant phenotypic information from around 120.000 Icelanders. In the company's work in more than 50 disease projects, their statistical and informatics departments have established themselves in data processing and analysis. deCODE genetics is widely recognized as a center of excellence in genetic research.
Proper citation: deCODE genetics (RRID:SCR_003334) Copy
http://www.lgm.upmc.fr/parseq/
Statistical software for transcription landscape reconstruction at a basepair resolution from RNA Seq read counts. It is based on a state-space model which describes, in terms of abrupt shifts and more progressive drifts, the transcription level dynamics along the genome. Alongside variations of transcription level, it incorporates a component of short-range variation to pull apart local artifacts causing correlated dispersion. Reconstruction of the transcription level relies on a conditional sequential Monte Carlo approach that is combined with parameter estimation in a Markov chain Monte Carlo algorithm known as particle Gibbs. The method allows to estimate the local transcription level, to call transcribed regions, and to identify the transcript borders.
Proper citation: Parseq (RRID:SCR_003464) Copy
http://compbio.dfci.harvard.edu/tgi/
THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.The goal of The Gene Index Project is to use the available Expressed Sequence Transcript (EST) and gene sequences, along with the reference genomes wherever available, to provide an inventory of likely genes and their variants and to annotate these with information regarding the functional roles played by these genes and their products. The promise of genome projects has been a complete catalog of genes in a wide range of organisms. While genome projects have been successful in providing reference genome sequences, the problem of finding genes and their variants in genomic sequence remains an ongoing challenge. TGI has created an inventory that contains genes and their variants together with description. In addition, this resource is attempting to use these catalogs to find links between genes and pathways in different species and to provide lists of features within completed genomes that can aid in the understanding of how gene expression is regulated. DATABASES *Eukaryotic Gene Orthologues (formerly known as TOGA - TIGR Orthologous Gene Alignment): Eukaryotic Gene Orthologues (EGO) at DFGI are generated by pair-wise comparison between the Tentative Consensus (TC) sequences that comprise the Dana Farber Gene Indices from individual organisms. The reciprocal pairs of the best match were clustered into individual groups and multiple sequence alignments were displayed for each group. *GeneChip Oncology Database (GCOD):Cancer gene expression database is a collection of publicly available microarray expression data on Affymetrix GeneChip Arrays related to human cancers. Currently only datasets with available raw data (Affymetrix .CEL files) are processed. All processed datasets were subjected to extensive manual curation, uniform processing and consistent quality control. You can browse the experiments in our collection, perform statistical analysis, and download processed data; or to search gene expression profiles using Entrez gene symbol, Unigene ID, or Affymetrix probeset ID. *Gene Indices: As of July 1, 2008, there are 111 publicly available gene indices. They are separated into 4 categories for better organization and easier access. Animal: 41, Plant: 45, Protist: 15, Fungal: 10 *Genomic Maps: Human, mouse, rat, chicken, drosophila melanogaster, zebrafish, mosquito, caenorhabditis elegans, Arabidopsis thaliana, rice, yeast, fission yeast Dana-Farber Cancer Institute (DFCI) Gene Indices Software Tools: *TGI Clustering tools (TGICL): a software system for fast clustering of large EST datasets. *GICL: this package contains the scripts and all the necessary pre-compiled binaries for 32bit Linux systems. *clview: an assembly file viewer. *SeqClean:a script for automated trimming and validation of ESTs or other DNA sequences by screening for various contaminants, low quality and low-complexity sequences. *cdbfasta/cdbyank: fast indexing/retrieval of fasta records from flat file databases. *DAS/XML Genomic Viewer The Genomic viewer borrows modules from http://www.biodas.org (lstein (at) cshl.org) & http://webreference.com.
Proper citation: Gene Index Project (RRID:SCR_002148) Copy
http://mips.gsf.de/genre/proj/yeast/index.jsp
The MIPS Comprehensive Yeast Genome Database (CYGD) aims to present information on the molecular structure and functional network of the entirely sequenced, well-studied model eukaryote, the budding yeast Saccharomyces cerevisiae. In addition, the data of various projects on related yeasts are used for comparative analysis.
Proper citation: CYGD - Comprehensive Yeast Genome Database (RRID:SCR_002289) Copy
http://microbes.ucsc.edu/cgi-bin/hgGateway?db=neisMeni_MC58_1
Portal contains detailed information for Neisseria meningitidis MC58. Information include DNA molecule summary, primary annotation summary, and taxonomy. It is a tool that allows the researcher to access all of the bacterial genome sequences completed to date. Users may access information on all of the bacterial genomes or any subset of them. Information in the website about its DNA molecule includes: total number of DNA molecules, total size of all DNA molecules, number of primary annotation coding bases, and number of G + C bases. Its primary annotation summary include: total genes, protein coding genes, tRNA genes, and rRNA genes. Sponsors: The CMR was previously funded by two grants, one from the U.S. Department of Energy (DOE) and one from the National Science Foundation (NSF). It is currently partially funded by a Microbial Sequence Center (MSC) grant from the National Institute of Allergy and Infectious Diseases (NIAID)
Proper citation: Neisseria meningitidis MC58 Genome Page (RRID:SCR_002200) Copy
A comprehensive collection of experimentally determined and computationally predicted CCCTC-binding factor (CTCF) binding sites (CTCFBS) from the literature. The database is designed to facilitate the studies on insulators and their roles in demarcating functional genomic domains. The CTCFBS Prediction Tool allows users to scan sequences for the single best match to CTCF position weight matrices. Currently (March 2014), the database contains almost 15 million experimentally determined CTCF binding sites across several species. CTCF binding sites were collected from published papers containing CTCF binding sites identified using ChIPSeq or similar methods, data from the ENCODE project, and a set of approximately 100 manually curated binding sites identified by low-throughput experiments. Users can browse insulator sequence features, function annotations, genomic contexts including histone methylation profiles, flanking gene expression patterns and orthologous regions in other mammalian genomes. Users can also retrieve data by text search, sequence search and genomic range search.
Proper citation: CTCFBSDB (RRID:SCR_002279) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the dkNET Resources search. From here you can search through a compilation of resources used by dkNET and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that dkNET has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on dkNET then you can log in from here to get additional features in dkNET such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into dkNET you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within dkNET that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.