Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.
https://data.broadinstitute.org/alkesgroup/Eagle/
Software package for statistical estimation of haplotype phase either within a genotyped cohort or using a phased reference panel in large scale sequencing. The package includes Eagle1 (to harness identity-by-descent among distant relatives to rapidly call phase using a fast scoring approach) and Eagle2 (to analyze a full probabilistic model similar to the diploid Li-Stephens model used by previous HMM-based methods.
Proper citation: Eagle (RRID:SCR_015991) Copy
A manually curated database of both known and predicted metabolic pathways for the laboratory mouse. It has been integrated with genetic and genomic data for the laboratory mouse available from the Mouse Genome Informatics database and with pathway data from other organisms, including human. The database records for 1,060 genes in Mouse Genome Informatics (MGI) are linked directly to 294 pathways with 1,790 compounds and 1,122 enzymatic reactions in MouseCyc. (Aug. 2013) BLAST and other tools are available. The initial focus for the development of MouseCyc is on metabolism and includes such cell level processes as biosynthesis, degradation, energy production, and detoxification. MouseCyc differs from existing pathway databases and software tools because of the extent to which the pathway information in MouseCyc is integrated with the wealth of biological knowledge for the laboratory mouse that is available from the Mouse Genome Informatics (MGI) database.
Proper citation: MouseCyc (RRID:SCR_001791) Copy
https://github.com/hms-dbmi/spp
R analysis and processing package for Illumina platform Chip-Seq data.
Proper citation: SPP (RRID:SCR_001790) Copy
BioPerl is a community effort to produce Perl code which is useful in biology. This toolkit of perl modules is useful in building bioinformatics solutions in Perl. It is built in an object-oriented manner so that many modules depend on each other to achieve a task. The collection of modules in the bioperl-live repository consist of the core of the functionality of bioperl. Additionally auxiliary modules for creating graphical interfaces (bioperl-gui), persistent storage in RDMBS (bioperl-db), running and parsing the results from hundreds of bioinformatics applications (Run package), software to automate bioinformatic analyses (bioperl-pipeline) are all available as Git modules in our repository. The BioPerl toolkit provides a library of hundreds of routines for processing sequence, annotation, alignment, and sequence analysis reports. It often serves as a bridge between different computational biology applications assisting the user to construct analysis pipelines. This chapter illustrates how BioPerl facilitates tasks such as writing scripts summarizing information from BLAST reports or extracting key annotation details from a GenBank sequence record. BioPerl includes modules written by Sohel Merchant of the GO Consortium for parsing and manipulating OBO ontologies. Platform: Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible
Proper citation: BioPerl (RRID:SCR_002989) Copy
Organization that provides biomedical researchers with online tools and a web portal enabling them to access, review, and integrate disparate ontological resources in all aspects of biomedical investigation and clinical practice. A major focus of the work involves the use of biomedical ontologies to aid in the management and analysis of data derived from complex experiments.
Proper citation: National Center for Biomedical Ontology (RRID:SCR_003304) Copy
http://mged.sourceforge.net/ontologies/MGEDontology.php
An ontology including concepts, definitions, terms, and resources for a standardized description of a microarray experiment in support of MAGE v.1. The MGED ontology is divided into the MGED Core ontology which is intended to be stable and in synch with MAGE v.1; and the MGED Extended ontology which adds further associations and classes not found in MAGE v.1. These terms will enable structure queries of elements of the experiments. Furthermore, the terms will also enable unambiguous descriptions of how the experiment was performed.
Proper citation: MGED Ontology (RRID:SCR_004484) Copy
A collaborative ontology for the definition of sequence features used in biological sequence annotation. SO was initially developed by the Gene Ontology Consortium. Contributors to SO include the GMOD community, model organism database groups such as WormBase, FlyBase, Mouse Genome Informatics group, and institutes such as the Sanger Institute and the EBI. Input to SO is welcomed from the sequence annotation community. The OBO revision is available here: http://sourceforge.net/p/song/svn/HEAD/tree/ SO includes different kinds of features which can be located on the sequence. Biological features are those which are defined by their disposition to be involved in a biological process. Biomaterial features are those which are intended for use in an experiment such as aptamer and PCR_product. There are also experimental features which are the result of an experiment. SO also provides a rich set of attributes to describe these features such as polycistronic and maternally imprinted. The Sequence Ontologies use the OBO flat file format specification version 1.2, developed by the Gene Ontology Consortium. The ontology is also available in OWL from Open Biomedical Ontologies. This is updated nightly and may be slightly out of sync with the current obo file. An OWL version of the ontology is also available. The resolvable URI for the current version of SO is http://purl.obolibrary.org/obo/so.owl.
Proper citation: SO (RRID:SCR_004374) Copy
http://bix.ucsd.edu/projects/singlecell/
Software package for short read data from single cells that improves assembly through use of progressively increasing coverage cutoff. Used for single cell Illumina sequences, allows variable coverage datasets to be utilized with assembly of E. coli and S. aureus single cell reads. Assembles single cell genome of uncultivated SAR324 clade of Deltaproteobacteria.
Proper citation: Velvet-SC (RRID:SCR_004377) Copy
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on December 17, 2021. Database to store, annotate, view, analyze and share microarray data. It provides registered users access to their own data, provides users access to public data, and tools with which to analyze those data, to any public user anywhere in the world. The GenePattern software package has been incorporated directly into SMD, providing access to many new analysis tools, as well as a plug-in architecture that allows users to directly integrate and share additional tools through SMD. This extension is available with the SMD source code that is fully and freely available to others under an Open Source license, enabling other groups to create a local installation of SMD with an enriched data analysis capability. SMD search options allow the user to Search By Experiments, Search By Datasets, or Search By Gene Names. Web services are provided using common standards, such as Simple Object Access Protocol (SOAP). This enables both local and remote researchers to connect to an installation of the database and retrieve data using pre-defined methods, without needing to resort to use of a web browser.
Proper citation: SMD (RRID:SCR_004987) Copy
http://ccb.jhu.edu/software/FLASH/
Open source software tool to merge paired-end reads from next-generation sequencing experiments. Designed to merge pairs of reads when original DNA fragments are shorter than twice length of reads. Can improve genome assemblies and transcriptome assembly by merging RNA-seq data.
Proper citation: FLASH (RRID:SCR_005531) Copy
https://github.com/macs3-project/MACS
Software Python package for identifying transcript factor binding sites. Used to evaluate significance of enriched ChIP regions. Improves spatial resolution of binding sites through combining information of both sequencing tag position and orientation. Can be used for ChIP-Seq data alone, or with control sample with increase of specificity.
Proper citation: MACS (RRID:SCR_013291) Copy
The E. coli Genome Project has the goal of completely sequencing the E. coli and human genomes. They began isolation of an overlapping lambda clonebank of E. coli K-12 strain MG1655. Those clones served as the starting material in our initial efforts to sequence the whole genome. Improvements in sequencing technology have since reached the point where whole-genome sequencing of microbial genomes is routine, and the human genome has in fact been completed. They initiated additional sequencing efforts, concentrating on pathogenic members of the family Enterobacteriaceae -- to which E. coli belongs. They also began a systematic functional characterization of E. coli K-12 genes and their regulation, using the whole genome sequence to address how the over 4000 genes of this organism act together to enable its survival in a wide range of environments.
Proper citation: E. coli Genome project (RRID:SCR_008139) Copy
http://www.hgsc.bcm.tmc.edu/content/bovine-genome-project
Downloadable files of the bos taurus genome. Draft assemblies available for download as contigs or linearized scaffolds of the genomic sequence of cow, Bos taurus, including the final draft assembly (7.1 coverage) and the two previous assemblies. The genome is sequenced to 6- to 8-fold sequence depth, with high-quality finished sequence in some areas. Accompanying EST and SNP analyses is also included. The bovine genome assembly and analysis and the study of cattle genetic history were published in April 24, 2009 issue of Science. The Human Genome Sequencing Center provides BLAST searches of the genome assemblies, either as contigs or as linearized chromosome sequences. The WGS sequence enriched BAC assemblies and the unassembled reads (sequencing reads that did not end up in the genome assembly) can also be searched by BLAST. Traces are available from the NCBI Trace Archive by using the link in the sidebar or by using NCBI MegaBLAST with a same species or cross species query.
Proper citation: Bovine Genome Project (RRID:SCR_008370) Copy
http://trans.nih.gov/bmap/index.htm
The Brain Molecular Anatomy Project is a trans-NIH project aimed at understanding gene expression and function in the nervous system. BMAP has two major scientific goals: # Gene discovery: to catalog of all the genes expressed in the nervous system, under both normal and abnormal conditions. # Gene expression analysis: to monitor gene expression patterns in the nervous system as a function of cell type, anatomical location, developmental stage, and physiological state, and thus gain insight into gene function. In pursuit of these goals, BMAP has launched several initiatives to provide resources and funding opportunities for the scientific community. These include several Requests for Applications and Requests for Proposals, descriptions of which can be found in this Web site. BMAP is also in the process of establishing physical and electronic resources for the community, including repositories of cDNA clones for nervous system genes, and databases of gene expression information for the nervous system. Most of the BMAP initiatives so far have focused on the mouse as a model species because of the ease of experimental and genetic manipulation of this organism, and because many models of human disease are available in the mouse. However, research in humans, other mammalian species, non-mammalian vertebrates, and invertebrates is also being funded through BMAP. For the convenience of interested investigators, we have established this Web site as a central information resource, focusing on major NIH-sponsored funding opportunities, initiatives, genomic resources available to the research community, courses and scientific meetings related to BMAP initiatives, and selected reports and publications. When appropriate, we will also post initiatives not directly sponsored by BMAP, but which are deemed relevant to its goals. Posting decisions are made by the Trans-NIH BMAP Committee
Proper citation: BMAP - Brain Molecular Anatomy Project (RRID:SCR_008852) Copy
A software tool which predicts whether an amino acid substitution or indel has an impact on the biological function of a protein.
Proper citation: PROVEAN (RRID:SCR_002182) Copy
Only worldwide authority that provides standardized nomenclature, i.e. gene names and symbols (short form abbreviations), for all known human genes, and stores all approved symbols in the HGNC database. Approved human gene nomenclature. Database of gene symbols and names. Manually curated genes into groups based on shared characteristics such as homology, function or phenotype. Data for protein-coding genes, pseudogenes and non-coding RNAs.
Proper citation: HGNC (RRID:SCR_002827) Copy
Central data repository for nematode biology including complete genomic sequence, gene predictions and orthology assignments from range of related nematodes.Data concerning genetics, genomics and biology of C. elegans and related nematodes. Derived from initial ACeDB database of C. elegans genetic and sequence information, WormBase includes genomic, anatomical and functional information of C. elegans, other Caenorhabditis species and other nematodes. Maintains public FTP site where researchers can find many commonly requested files and datasets, WormBase software and prepackaged databases.
Proper citation: WormBase (RRID:SCR_003098) Copy
http://pubsearch.stanford.edu/
THIS RESOURCE IS NO LONGER IN SERVCE, documented September 2, 2016. PubSearch is a web-based literature curation tool, allowing curators to search and annotate genes to keywords from articles. It has a simple mySQL database backend and uses a set of Java Servlets and JSPs for querying, modifying, and adding gene, gene-annotation, and literature information. PubSearch can be downloaded from GMOD. Platform: Online tool, Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible
Proper citation: PubSearch (RRID:SCR_005830) Copy
http://oligogenome.stanford.edu/
The Stanford Human OligoGenome Project hosts a database of capture oligonucleotides for conducting high-throughput targeted resequencing of the human genome. This set of capture oligonucleotides covers over 92% of the human genome for build 37 / hg19 and over 99% of the coding regions defined by the Consensus Coding Sequence (CCDS). The capture reaction uses a highly multiplexed approach for selectively circularizing and capturing multiple genomic regions using the in-solution method developed in Natsoulis et al, PLoS One 2011. Combined pools of capture oligonucleotides selectively circularize the genomic DNA target, followed by specific PCR amplification of regions of interest using a universal primer pair common to all of the capture oligonucleotides. Unlike multiplexed PCR methods, selective genomic circularization is capable of efficiently amplifying hundreds of genomic regions simultaneously in multiplex without requiring extensive PCR optimization or producing unwanted side reaction products. Benefits of the selective genomic circularization method are the relative robustness of the technique and low costs of synthesizing standard capture oligonucleotide for selecting genomic targets.
Proper citation: OligoGenome (RRID:SCR_006025) Copy
A comprehensive encyclopedia of genomic functional elements in the model organisms C. elegans and D. melanogaster. modENCODE is run as a Research Network and the consortium is formed by 11 primary projects, divided between worm and fly, spanning the domains of gene structure, mRNA and ncRNA expression profiling, transcription factor binding sites, histone modifications and replacement, chromatin structure, DNA replication initiation and timing, and copy number variation. The raw and interpreted data from this project is vetted by a data coordinating center (DCC) to ensure consistency and completeness. The entire modENCODE data corpus is now available on the Amazon Web Services EC2 cloud. What this means is that virtual machines and virtual compute clusters that you run within the EC2 cloud can mount the modENCODE data set in whole or in part. Your software can run analyses against the data files directly without experiencing the long waits and logistics associated with copying the datasets over to your local hardware. You may also view the data using GBrowse, Dataset Search, or download the data via FTP, as well as download pre-release datasets.
Proper citation: modENCODE (RRID:SCR_006206) Copy
Can't find your Tool?
We recommend that you click next to the search bar to check some helpful tips on searches and refine your search firstly. Alternatively, please register your tool with the SciCrunch Registry by adding a little information to a web form, logging in will enable users to create a provisional RRID, but it not required to submit.
Welcome to the dkNET Resources search. From here you can search through a compilation of resources used by dkNET and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that dkNET has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on dkNET then you can log in from here to get additional features in dkNET such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into dkNET you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the sources that were queried against in your search that you can investigate further.
Here are the categories present within dkNET that you can filter your data on
Here are the subcategories present within this category that you can filter your data on
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.