Warehousing re-annotated cancer genes for biomarker meta-analysis

M. Orsini, A. Travaglione, E. Capobianco

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Translational research in cancer genomics assigns a fundamental role to bioinformatics in support of candidate gene prioritization with regard to both biomarker discovery and target identification for drug development. Efforts in both such directions rely on the existence and constant update of large repositories of gene expression data and omics records obtained from a variety of experiments. Users who interactively interrogate such repositories may have problems in retrieving sample fields that present limited associated information, due for instance to incomplete entries or sometimes unusable files. Cancer-specific data sources present similar problems. Given that source integration usually improves data quality, one of the objectives is keeping the computational complexity sufficiently low to allow an optimal assimilation and mining of all the information. In particular, the scope of integrating intraomics data can be to improve the exploration of gene co-expression landscapes, while the scope of integrating interomics sources can be that of establishing genotype-phenotype associations. Both integrations are relevant to cancer biomarker meta-analysis, as the proposed study demonstrates. Our approach is based on re-annotating cancer-specific data available at the EBI's ArrayExpress repository and building a data warehouse aimed to biomarker discovery and validation studies. Cancer genes are organized by tissue with biomedical and clinical evidences combined to increase reproducibility and consistency of results. For better comparative evaluation, multiple queries have been designed to efficiently address all types of experiments and platforms, and allow for retrieval of sample-related information, such as cell line, disease state and clinical aspects.

Original languageEnglish (US)
Pages (from-to)166-180
Number of pages15
JournalComputer Methods and Programs in Biomedicine
Volume111
Issue number1
DOIs
StatePublished - Jul 2013

Keywords

  • Cancer transcriptomics
  • Data warehousing
  • Re-annotation

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Health Informatics

Fingerprint

Dive into the research topics of 'Warehousing re-annotated cancer genes for biomarker meta-analysis'. Together they form a unique fingerprint.

Cite this