首页 | 本学科首页   官方微博 | 高级检索  
检索        


Detecting genetic association through shortest paths in a bidirected graph
Authors:Masao Ueki  Yoshinori Kawasaki  Gen Tamiya  for Alzheimer's Disease Neuroimaging Initiative
Institution:1. Biostatistics Center, Kurume University, Fukuoka, Japan;2. Department of Statistical Modeling, The Institute of Statistical Mathematics, The Graduate University for Advanced Studies, Tachikawa, Tokyo, Japan;3. Statistical Genetics and Genomics, Tohoku Medical Megabank Organization, Tohoku University, Aoba‐Ku, Sendai, Japan
Abstract:Genome‐wide association studies (GWASs) commonly use marginal association tests for each single‐nucleotide polymorphism (SNP). Because these tests treat SNPs as independent, their power will be suboptimal for detecting SNPs hidden by linkage disequilibrium (LD). One way to improve power is to use a multiple regression model. However, the large number of SNPs preclude simultaneous fitting with multiple regression, and subset regression is infeasible because of an exorbitant number of candidate subsets. We therefore propose a new method for detecting hidden SNPs having significant yet weak marginal association in a multiple regression model. Our method begins by constructing a bidirected graph locally around each SNP that demonstrates a moderately sized marginal association signal, the focal SNPs. Vertexes correspond to SNPs, and adjacency between vertexes is defined by an LD measure. Subsequently, the method collects from each graph all shortest paths to the focal SNP. Finally, for each shortest path the method fits a multiple regression model to all the SNPs lying in the path and tests the significance of the regression coefficient corresponding to the terminal SNP in the path. Simulation studies show that the proposed method can detect susceptibility SNPs hidden by LD that go undetected with marginal association testing or with existing multivariate methods. When applied to real GWAS data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), our method detected two groups of SNPs: one in a region containing the apolipoprotein E (APOE) gene, and another in a region close to the semaphorin 5A (SEMA5A) gene.
Keywords:bidirected graph  conservative multiple test  hidden association  linkage disequilibrium  shortest path
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号