首页 | 本学科首页   官方微博 | 高级检索  
检索        


Prioritizing individual genetic variants after kernel machine testing using variable selection
Authors:Qianchuan He  Tianxi Cai  Yang Liu  Ni Zhao  Quaker E Harmon  Lynn M Almli  Elisabeth B Binder  Stephanie M Engel  Kerry J Ressler  Karen N Conneely  Xihong Lin  Michael C Wu
Institution:1. Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America;2. Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, United States of America;3. Epidemiology Branch, NIEHS, Research Triangle Park, United States of America;4. Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, Georgia, United States of America;5. Department of Translational Research in Psychiatry, Max‐Planck Institute of Psychiatry, Munich, Germany;6. Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina, United States of America;7. Division of Depression & Anxiety Disorders, McLean Hospital, Belmont, Massachusetts, United States of America;8. Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia, United States of America
Abstract:Kernel machine learning methods, such as the SNP‐set kernel association test (SKAT), have been widely used to test associations between traits and genetic polymorphisms. In contrast to traditional single‐SNP analysis methods, these methods are designed to examine the joint effect of a set of related SNPs (such as a group of SNPs within a gene or a pathway) and are able to identify sets of SNPs that are associated with the trait of interest. However, as with many multi‐SNP testing approaches, kernel machine testing can draw conclusion only at the SNP‐set level, and does not directly inform on which one(s) of the identified SNP set is actually driving the associations. A recently proposed procedure, KerNel Iterative Feature Extraction (KNIFE), provides a general framework for incorporating variable selection into kernel machine methods. In this article, we focus on quantitative traits and relatively common SNPs, and adapt the KNIFE procedure to genetic association studies and propose an approach to identify driver SNPs after the application of SKAT to gene set analysis. Our approach accommodates several kernels that are widely used in SNP analysis, such as the linear kernel and the Identity by State (IBS) kernel. The proposed approach provides practically useful utilities to prioritize SNPs, and fills the gap between SNP set analysis and biological functional studies. Both simulation studies and real data application are used to demonstrate the proposed approach.
Keywords:genetic association studies  kernel machine methods  KNIFE  set‐based  variable selection
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号