Logo

Projects

We are interested in computational analysis of genomic data to answer important questions in epigenetics and cancer.

Analysis of data from next-generation sequencing platforms - The new DNA sequencing platforms are producing a large amount of data. We are very excited about this technology and are interested in various analytical issues related to this type of data. We specialize in protein-DNA interactions as measured by chromatin immunoprecipitation followed by sequencing (ChIP-seq) but we also deal with RNA-seq, nucleosome profiling, and other data types.

Statistical methods for integrative analysis and their applications - We are interested in development of statistical methods for combining multiple types of genomic data. These include meta-analysis of expression and copy number data in public databases.

Genome-wide mapping of chromosomal proteins in Drosophila (Model organism ENCODE Consortium) - How do the chromatin modification patterns look in different states and what do they mean? What is the relationship between gene expression patterns and chromatin structure? We are part of the national consortium that aim to identify all sequence-based functional elements in the c. elegans and drosophila melanogaster genomes. Our specific project is in mapping of histone modification and chromosomal proteins in multiple tissues and stages. We are also involved in the integrative analysis of the entire modENCODE data.

Cancer genomics (Cancer Genome Atlas Project) - We are part of a comprehensive and coordinated effort to characterize the cancer genome through the application of a wide range of the latest genome analysis technologies. The data types include gene expression, DNA copy number, microRNA expression, DNA methylation, SNP, and genome sequencing for mutations. We are particularly interested in DNA copy number and their relationships to other data types.

Mechanism of dosage compensation by the MSL complex in Drosophila (with Mitzi Kuroda Lab) - How does the male fly up-regulate gene expression from the single X chromosome to achieve similar level of expression as in females (XX) ? More generally, what is the mechanism for the targeting and regulation of active chromatin domains? We are using expression arrays and tiling arrays to understand the binding properties of the Male Specific Lethal (MSL) complex.

Epigenomic states in differentiation (with Bob Kingston Lab and David Scadden Lab) - We are mapping nucleosome positions and histone modifications in various stages of development in model systems.

Characterizing expression and genomic DNA signatures in brain tumors (with Mark Johnson Lab) - What are the gene that characterize different types of these tumors? What are the differences between Glioblastoma Multiforme and low grade gliomas, or between subtypes of low-grades? We are identifying such signatures both in expression data and in array CGH data.

Informatics for Integrating Biology and the Bedside (i2b2) - A part of a National Center for Biomedical Computing, this project aims to translate the advances in genomics to patient care. We are interested in the development of informatics framework and statistical methodology that will help bridge the gap between genomic technologies and clinical research.

Systems-based consortium for organ design and engineering (SysCODE) - This interdisciplinary consortium brings the experts in organogenesis, gene regulatory networks, progenitor cells and tissue engineering to develop a systematic framework for synthesizing organs in vitro. We are contributing our expertise in computational analysis of genomic data.