Applying data mining techniques to the mapping of complex disease genes

W. A. Czika, B. S. Weir, S. R. Edwards, R. W. Thompson, D. M. Nielsen, J. C. Brocklebank, C. Zinkus, Eden R Martin, K. E. Hobler

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

The simulated sequence data for the Genetic Analysis Workshop 12 were analyzed using data mining techniques provided by SAS ENTERPRISE MINER™ Release 4.0 in addition to traditional statistical tests for linkage and association of genetic markers with disease status. We examined two ways of combining these approaches to make use of the covariate data along with the genotypic data. The result of incorporating data mining techniques with more classical methods is an improvement in the analysis, both by correctly classifying the affection status of more individuals and by locating more single nucleotide polymorphisms related to the disease, relative to analyses that use classical methods alone.

Original languageEnglish
JournalGenetic Epidemiology
Volume21
Issue numberSUPPL. 1
StatePublished - Oct 23 2001
Externally publishedYes

Fingerprint

Data Mining
Genetic Markers
Genes
Single Nucleotide Polymorphism
Education

Keywords

  • Association tests
  • Data mining
  • Decision trees
  • Logistic regression
  • RC-TDT

ASJC Scopus subject areas

  • Genetics(clinical)
  • Epidemiology

Cite this

Czika, W. A., Weir, B. S., Edwards, S. R., Thompson, R. W., Nielsen, D. M., Brocklebank, J. C., ... Hobler, K. E. (2001). Applying data mining techniques to the mapping of complex disease genes. Genetic Epidemiology, 21(SUPPL. 1).

Applying data mining techniques to the mapping of complex disease genes. / Czika, W. A.; Weir, B. S.; Edwards, S. R.; Thompson, R. W.; Nielsen, D. M.; Brocklebank, J. C.; Zinkus, C.; Martin, Eden R; Hobler, K. E.

In: Genetic Epidemiology, Vol. 21, No. SUPPL. 1, 23.10.2001.

Research output: Contribution to journalArticle

Czika, WA, Weir, BS, Edwards, SR, Thompson, RW, Nielsen, DM, Brocklebank, JC, Zinkus, C, Martin, ER & Hobler, KE 2001, 'Applying data mining techniques to the mapping of complex disease genes', Genetic Epidemiology, vol. 21, no. SUPPL. 1.
Czika WA, Weir BS, Edwards SR, Thompson RW, Nielsen DM, Brocklebank JC et al. Applying data mining techniques to the mapping of complex disease genes. Genetic Epidemiology. 2001 Oct 23;21(SUPPL. 1).
Czika, W. A. ; Weir, B. S. ; Edwards, S. R. ; Thompson, R. W. ; Nielsen, D. M. ; Brocklebank, J. C. ; Zinkus, C. ; Martin, Eden R ; Hobler, K. E. / Applying data mining techniques to the mapping of complex disease genes. In: Genetic Epidemiology. 2001 ; Vol. 21, No. SUPPL. 1.
@article{2bc04b571f7543bf973b74a4cd04ee77,
title = "Applying data mining techniques to the mapping of complex disease genes",
abstract = "The simulated sequence data for the Genetic Analysis Workshop 12 were analyzed using data mining techniques provided by SAS ENTERPRISE MINER™ Release 4.0 in addition to traditional statistical tests for linkage and association of genetic markers with disease status. We examined two ways of combining these approaches to make use of the covariate data along with the genotypic data. The result of incorporating data mining techniques with more classical methods is an improvement in the analysis, both by correctly classifying the affection status of more individuals and by locating more single nucleotide polymorphisms related to the disease, relative to analyses that use classical methods alone.",
keywords = "Association tests, Data mining, Decision trees, Logistic regression, RC-TDT",
author = "Czika, {W. A.} and Weir, {B. S.} and Edwards, {S. R.} and Thompson, {R. W.} and Nielsen, {D. M.} and Brocklebank, {J. C.} and C. Zinkus and Martin, {Eden R} and Hobler, {K. E.}",
year = "2001",
month = "10",
day = "23",
language = "English",
volume = "21",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "SUPPL. 1",

}

TY - JOUR

T1 - Applying data mining techniques to the mapping of complex disease genes

AU - Czika, W. A.

AU - Weir, B. S.

AU - Edwards, S. R.

AU - Thompson, R. W.

AU - Nielsen, D. M.

AU - Brocklebank, J. C.

AU - Zinkus, C.

AU - Martin, Eden R

AU - Hobler, K. E.

PY - 2001/10/23

Y1 - 2001/10/23

N2 - The simulated sequence data for the Genetic Analysis Workshop 12 were analyzed using data mining techniques provided by SAS ENTERPRISE MINER™ Release 4.0 in addition to traditional statistical tests for linkage and association of genetic markers with disease status. We examined two ways of combining these approaches to make use of the covariate data along with the genotypic data. The result of incorporating data mining techniques with more classical methods is an improvement in the analysis, both by correctly classifying the affection status of more individuals and by locating more single nucleotide polymorphisms related to the disease, relative to analyses that use classical methods alone.

AB - The simulated sequence data for the Genetic Analysis Workshop 12 were analyzed using data mining techniques provided by SAS ENTERPRISE MINER™ Release 4.0 in addition to traditional statistical tests for linkage and association of genetic markers with disease status. We examined two ways of combining these approaches to make use of the covariate data along with the genotypic data. The result of incorporating data mining techniques with more classical methods is an improvement in the analysis, both by correctly classifying the affection status of more individuals and by locating more single nucleotide polymorphisms related to the disease, relative to analyses that use classical methods alone.

KW - Association tests

KW - Data mining

KW - Decision trees

KW - Logistic regression

KW - RC-TDT

UR - http://www.scopus.com/inward/record.url?scp=0034798335&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034798335&partnerID=8YFLogxK

M3 - Article

VL - 21

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - SUPPL. 1

ER -