Statistical redundancy testing for improved gene selection in cancer classification using microarray data

Simin Hu, J. Sunil Rao

Research output: Contribution to journalArticle

9 Scopus citations

Abstract

In gene selection for cancer classification using microarray data, we define an eigenvalue-ratio statistic to measure a gene's contribution to the joint discriminability when this gene is included into a set of genes. Based on this eigenvalue-ratio statistic, we define a novel hypothesis testing for gene statistical redundancy and propose two gene selection methods. Simulation studies illustrate the agreement between statistical redundancy testing and gene selection methods. Real data examples show the proposed gene selection methods can select a compact gene subset which can not only be used to build high quality cancer classifiers but also show biological relevance.

Original languageEnglish (US)
Pages (from-to)29-41
Number of pages13
JournalCancer Informatics
Volume3
DOIs
StatePublished - 2007

Keywords

  • Cancer classification
  • Gene selection
  • Microarray
  • Statistical redundancy

ASJC Scopus subject areas

  • Cancer Research
  • Oncology

Fingerprint Dive into the research topics of 'Statistical redundancy testing for improved gene selection in cancer classification using microarray data'. Together they form a unique fingerprint.

  • Cite this