Publishing Service

Polishing & Checking

Journal of Zhejiang University SCIENCE B

ISSN 1673-1581(Print), 1862-1783(Online), Monthly

An ensemble-based likelihood ratio approach for family-based genomic risk prediction

Abstract: Objective: As one of the most popular designs used in genetic research, family-based design has been well recognized for its advantages, such as robustness against population stratification and admixture. With vast amounts of genetic data collected from family-based studies, there is a great interest in studying the role of genetic markers from the aspect of risk prediction. This study aims to develop a new statistical approach for family-based risk prediction analysis with an improved prediction accuracy compared with existing methods based on family history. Methods: In this study, we propose an ensemble-based likelihood ratio (ELR) approach, Fam-ELR, for family-based genomic risk prediction. Fam-ELR incorporates a clustered receiver operating characteristic (ROC) curve method to consider correlations among family samples, and uses a computationally efficient tree-assembling procedure for variable selection and model building. Results: Through simulations, Fam-ELR shows its robustness in various underlying disease models and pedigree structures, and attains better performance than two existing family-based risk prediction methods. In a real-data application to a family-based genome-wide dataset of conduct disorder, Fam-ELR demonstrates its ability to integrate potential risk predictors and interactions into the model for improved accuracy, especially on a genome-wide level. Conclusions: By comparing existing approaches, such as genetic risk-score approach, Fam-ELR has the capacity of incorporating genetic variants with small or moderate marginal effects and their interactions into an improved risk prediction model. Therefore, it is a robust and useful approach for high-dimensional family-based risk prediction, especially on complex disease with unknown or less known disease etiology.

Key words: Family-based study; Genetic risk prediction; High-dimensional data

Chinese Summary  <22> 基于家系数据集群化似然比算法的疾病基因组遗传风险预测研究

关键词组:家系数据研究;遗传风险预测;高维数据


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





DOI:

10.1631/jzus.B1800162

CLC number:

Q39

Download Full Text:

Click Here

Downloaded:

1914

Download summary:

<Click Here> 

Downloaded:

1560

Clicked:

3806

Cited:

0

On-line Access:

2018-12-03

Received:

2018-03-14

Revision Accepted:

2018-07-12

Crosschecked:

2018-11-08

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE