Empirical Bayesian LASSO-logistic regression for multiple binary trait locus mapping

Anhui Huang, Shizhong Xu, Xiaodong Cai

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

Background: Complex binary traits are influenced by many factors including the main effects of many quantitative trait loci (QTLs), the epistatic effects involving more than one QTLs, environmental effects and the effects of gene-environment interactions. Although a number of QTL mapping methods for binary traits have been developed, there still lacks an efficient and powerful method that can handle both main and epistatic effects of a relatively large number of possible QTLs. Results: In this paper, we use a Bayesian logistic regression model as the QTL model for binary traits that includes both main and epistatic effects. Our logistic regression model employs hierarchical priors for regression coefficients similar to the ones used in the Bayesian LASSO linear model for multiple QTL mapping for continuous traits. We develop efficient empirical Bayesian algorithms to infer the logistic regression model. Our simulation study shows that our algorithms can easily handle a QTL model with a large number of main and epistatic effects on a personal computer, and outperform five other methods examined including the LASSO, HyperLasso, BhGLM, RVM and the single-QTL mapping method based on logistic regression in terms of power of detection and false positive rate. The utility of our algorithms is also demonstrated through analysis of a real data set. A software package implementing the empirical Bayesian algorithms in this paper is freely available upon request. Conclusions: The EBLASSO logistic regression method can handle a large number of effects possibly including the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTLs mapping for complex binary traits.

Original languageEnglish
Article number5
JournalBMC Genetics
Volume14
DOIs
StatePublished - Feb 15 2013

Fingerprint

Quantitative Trait Loci
Logistic Models
Gene-Environment Interaction
Microcomputers
Linear Models
Software

Keywords

  • Bayesian shrinkage
  • Binary traits
  • Epistatic effects
  • Logistic regression
  • QTL mapping

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Empirical Bayesian LASSO-logistic regression for multiple binary trait locus mapping. / Huang, Anhui; Xu, Shizhong; Cai, Xiaodong.

In: BMC Genetics, Vol. 14, 5, 15.02.2013.

Research output: Contribution to journalArticle

@article{618d145726b84ee783a892549ac705ac,
title = "Empirical Bayesian LASSO-logistic regression for multiple binary trait locus mapping",
abstract = "Background: Complex binary traits are influenced by many factors including the main effects of many quantitative trait loci (QTLs), the epistatic effects involving more than one QTLs, environmental effects and the effects of gene-environment interactions. Although a number of QTL mapping methods for binary traits have been developed, there still lacks an efficient and powerful method that can handle both main and epistatic effects of a relatively large number of possible QTLs. Results: In this paper, we use a Bayesian logistic regression model as the QTL model for binary traits that includes both main and epistatic effects. Our logistic regression model employs hierarchical priors for regression coefficients similar to the ones used in the Bayesian LASSO linear model for multiple QTL mapping for continuous traits. We develop efficient empirical Bayesian algorithms to infer the logistic regression model. Our simulation study shows that our algorithms can easily handle a QTL model with a large number of main and epistatic effects on a personal computer, and outperform five other methods examined including the LASSO, HyperLasso, BhGLM, RVM and the single-QTL mapping method based on logistic regression in terms of power of detection and false positive rate. The utility of our algorithms is also demonstrated through analysis of a real data set. A software package implementing the empirical Bayesian algorithms in this paper is freely available upon request. Conclusions: The EBLASSO logistic regression method can handle a large number of effects possibly including the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTLs mapping for complex binary traits.",
keywords = "Bayesian shrinkage, Binary traits, Epistatic effects, Logistic regression, QTL mapping",
author = "Anhui Huang and Shizhong Xu and Xiaodong Cai",
year = "2013",
month = "2",
day = "15",
doi = "10.1186/1471-2156-14-5",
language = "English",
volume = "14",
journal = "BMC Genetics",
issn = "1471-2156",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Empirical Bayesian LASSO-logistic regression for multiple binary trait locus mapping

AU - Huang, Anhui

AU - Xu, Shizhong

AU - Cai, Xiaodong

PY - 2013/2/15

Y1 - 2013/2/15

N2 - Background: Complex binary traits are influenced by many factors including the main effects of many quantitative trait loci (QTLs), the epistatic effects involving more than one QTLs, environmental effects and the effects of gene-environment interactions. Although a number of QTL mapping methods for binary traits have been developed, there still lacks an efficient and powerful method that can handle both main and epistatic effects of a relatively large number of possible QTLs. Results: In this paper, we use a Bayesian logistic regression model as the QTL model for binary traits that includes both main and epistatic effects. Our logistic regression model employs hierarchical priors for regression coefficients similar to the ones used in the Bayesian LASSO linear model for multiple QTL mapping for continuous traits. We develop efficient empirical Bayesian algorithms to infer the logistic regression model. Our simulation study shows that our algorithms can easily handle a QTL model with a large number of main and epistatic effects on a personal computer, and outperform five other methods examined including the LASSO, HyperLasso, BhGLM, RVM and the single-QTL mapping method based on logistic regression in terms of power of detection and false positive rate. The utility of our algorithms is also demonstrated through analysis of a real data set. A software package implementing the empirical Bayesian algorithms in this paper is freely available upon request. Conclusions: The EBLASSO logistic regression method can handle a large number of effects possibly including the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTLs mapping for complex binary traits.

AB - Background: Complex binary traits are influenced by many factors including the main effects of many quantitative trait loci (QTLs), the epistatic effects involving more than one QTLs, environmental effects and the effects of gene-environment interactions. Although a number of QTL mapping methods for binary traits have been developed, there still lacks an efficient and powerful method that can handle both main and epistatic effects of a relatively large number of possible QTLs. Results: In this paper, we use a Bayesian logistic regression model as the QTL model for binary traits that includes both main and epistatic effects. Our logistic regression model employs hierarchical priors for regression coefficients similar to the ones used in the Bayesian LASSO linear model for multiple QTL mapping for continuous traits. We develop efficient empirical Bayesian algorithms to infer the logistic regression model. Our simulation study shows that our algorithms can easily handle a QTL model with a large number of main and epistatic effects on a personal computer, and outperform five other methods examined including the LASSO, HyperLasso, BhGLM, RVM and the single-QTL mapping method based on logistic regression in terms of power of detection and false positive rate. The utility of our algorithms is also demonstrated through analysis of a real data set. A software package implementing the empirical Bayesian algorithms in this paper is freely available upon request. Conclusions: The EBLASSO logistic regression method can handle a large number of effects possibly including the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTLs mapping for complex binary traits.

KW - Bayesian shrinkage

KW - Binary traits

KW - Epistatic effects

KW - Logistic regression

KW - QTL mapping

UR - http://www.scopus.com/inward/record.url?scp=84873739129&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84873739129&partnerID=8YFLogxK

U2 - 10.1186/1471-2156-14-5

DO - 10.1186/1471-2156-14-5

M3 - Article

C2 - 23410082

AN - SCOPUS:84873739129

VL - 14

JO - BMC Genetics

JF - BMC Genetics

SN - 1471-2156

M1 - 5

ER -