首页 | 本学科首页   官方微博 | 高级检索  
检索        


An enhanced machine learning tool for cis-eQTL mapping with regularization and confounder adjustments
Authors:Kang K Yan  Hongyu Zhao  Joseph T Wu  Herbert Pang
Institution:1. School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China;2. Department of Biostatistics, Yale University, New Haven, Connecticut
Abstract:Many expression quantitative trait loci (eQTL) studies have been conducted to investigate the biological effects of variants in gene regulation. However, these eQTL studies may suffer from low or moderate statistical power and overly conservative false-discovery rate. In practice, most algorithms for eQTL identification do not model the joint effects of multiple genetic variants with weak or moderate influence. Here we present a novel machine-learning algorithm, lasso least-squares kernel machine (LSKM-LASSO) that model the association between multiple genetic variants and phenotypic traits simultaneously with the existence of nongenetic and genetic confounding. With a more general and flexible framework for the estimation of genetic confounding, LSKM-LASSO is able to provide a more accurate evaluation of the joint effects of multiple genetic variants. Our simulations demonstrate that our approach outperforms three state-of-the-art alternatives in terms of eQTL identification and phenotype prediction. We then apply our method to genotype and gene expression data of 11 tissues obtained from the Genotype-Tissue Expression project. Our algorithm was able to identify more genes with eQTL than other algorithms. By incorporating a regularization term and combining it with least-squares kernel machine, LSKM-LASSO provides a powerful tool for eQTL mapping and phenotype prediction.
Keywords:cis-eQTL mapping  gene expression  least-squares kernel machine  multiple variants  penalized  population structure
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号