Statistical expression deconvolution from mixed tissue samples

Jennifer Clarke, Pearl Seol, Bertrand Clarke

Research output: Contribution to journalArticlepeer-review

52 Scopus citations


Motivation: Global expression patterns within cells are used for purposes ranging from the identification of disease biomarkers to basic understanding of cellular processes. Unfortunately, tissue samples used in cancer studies are usually composed of multiple cell types and the non-cancerous portions can significantly affect expression profiles. This severely limits the conclusions that can be made about the specificity of gene expression in the cell-type of interest. However, statistical analysis can be used to identify differentially expressed genes that are related to the biological question being studied. Results: We propose a statistical approach to expression deconvolution from mixed tissue samples in which the proportion of each component cell type is unknown. Our method estimates the proportion of each component in amixed tissue sample; this estimate can be used to provide estimates of gene expression from each component. We demonstrate our technique on xenograft samples from breast cancer research and publicly available experimental datasets found in the National Center for Biotechnology Information Gene Expression Omnibus repository. Availability: R code ( for estimating sample proportions is freely available to non-commercial users and available at Contact:

Original languageEnglish (US)
Article numberbtq097
Pages (from-to)1043-1049
Number of pages7
Issue number8
StatePublished - Mar 4 2010

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics


Dive into the research topics of 'Statistical expression deconvolution from mixed tissue samples'. Together they form a unique fingerprint.

Cite this