Logo

Software

Meerkat (http://compbio.med.harvard.edu/Meerkat/)

This program identifies structural variations from whole-genome sequencing data using patterns of discordant read clusters

Nozzle: a report generation toolkit for data analysis pipelines (https://confluence.broadinstitute.org/display/GDAC/Nozzle)

Also available at CRAN: http://cran.r-project.org/web/packages/Nozzle.R1/index.html. The source code is available at https://github.com/parklab/Nozzle.

StratomeX: Visual Analysis of Large-Scale Heterogeneous Genomics Data for Cancer Subtype (http://compbio.med.harvard.edu/tcga/stratomex/)

This is a visual exploration tool for identification and characterization of clusters and correlations in genomics data

Tea (Transposable Element Analyzer) (http://compbio.med.harvard.edu/Tea/)

This is a tool for detection of retrotransposition events in whole-genome sequencing data.

BIC-seq (Bayesian Information Criteria-seq) (http://compbio.med.harvard.edu/Supplements/PNAS11.html)

This program (R code) is for identification of copy number variants in whole-genome sequencing data

Antibody Validation Database (http://compbio.med.harvard.edu/antibodies/)

This site contains antibody validation data from ENCODE, modENCODE, and Epigenome Roadmap projects. See Egelhofer et al (2011) for details

modENCODE chromatin data browser (http://compbio.med.harvard.edu/flychromatin/)

This website allows one to explore the enrichment profiles of histone marks and chromosomal proteins in the Drosophila genome.

Repeat enrichment estimator (http://compbio.med.harvard.edu/repeats/)

This tool aims to measure the enrichment of annotated repeat types in ChIP-seq data

Quantized correlation coefficient (QCC) (https://compbio.med.harvard.edu/Supplements/BMCBioinfo10.html)

This R package computes a robust measurement of the reproducibility of ChIP-chip data.

ChIP-seq analysis (SPP) (http://compbio.med.harvard.edu/Supplements/ChIP-seq)

This R package by Peter Kharchenko implements tools for analysis of sequencing data from chromatin immunoprecipitation experiments. It includes normalization of the binding profile, detection of enriched regions, and an estimate of read depth needed to achieve saturation of binding sites. See Kharchenko et al, Nature Biotechnology (2008) for details.

CGHweb (http://compbio.med.harvard.edu/CGHweb)

This tool provides an interface to apply several popular algorithms to segment a copy-number profile from CGH (comparative genomic hybridization) data. It generates a heatmap panel of the segmented profiles for each method as well as a consensus profile. The clickable heatmap can be moved along the chromosome and zoomed in or out. It also displays the time that each algorithm took and provides numerical values of the segmented profiles for download.

nuScore (http://compbio.med.harvard.edu/nuScore)

This allows estimation of the affinity of the histone core to DNA and prediction of nucleosome arrangement on a given sequence. The algorithm is based on assessment of the energy cost of imposing the deformations required to wrap DNA around the histone surface

ChIP-chip normalization

The R package is available here (v1.0.1). Here is the instruction sheet (part of the package).

CrossChip (http://www.crosschip.org)

This tool is for integrating data from multiple generations of Affymetrix GeneChips. Matching probes based on Affymetrix "best-match" is inadequate for most analyses. This tool allows the user to derive a list of similar probes based on the user-specified criteria on probe sequence similarity and the minimum number of probe pairs needed for each probe set.

sigPathway (for finding significant pathways from microarray data)

For the R package, please see the Supplementary Material page for the Tian et al article (Proc Natl Acad Sci, 2005). A web interface will be available shortly.