Using biological knowledge to discover higher order interactions in genetic association studies |
| |
Authors: | Chen Gary K Thomas Duncan C |
| |
Institution: | Division of Biostatics, Department of Preventive Medicine, University of Southern California, Los Angeles, California 90089-9601, USA. gary.k.chen@usc.edu |
| |
Abstract: | The recent successes of genome-wide association studies (GWAS) have revealed that many of the replicated findings have explained only a small fraction of the heritability of common diseases. One hypothesis that investigators have suggested is that higher order interactions between SNPs or SNPs and environmental risk factors may account for some of this missing heritability. Searching for these interactions poses great statistical and computational challenges. In this article, we propose a novel method that addresses these challenges by incorporating external biological knowledge into a fully Bayesian analysis. The method is designed to be scalable for high-dimensional search spaces (where it supports interactions of any order) because priors that use such knowledge focus the search in regions that are more biologically plausible and avoid having to enumerate all possible interactions. We provide several examples based on simulated data demonstrating how external information can enhance power, specificity, and effect estimates in comparison to conventional approaches based on maximum likelihood estimates. We also apply the method to data from a GWAS for breast cancer, revealing a set of interactions enriched for the Gene Ontology terms growth, metabolic process, and biological regulation. |
| |
Keywords: | Bayesian analysis variable selection pathway MCMC |
本文献已被 PubMed 等数据库收录! |
|