首页 | 本学科首页   官方微博 | 高级检索  
检索        


A powerful statistical framework for generalization testing in GWAS,with application to the HCHS/SOL
Authors:Tamar Sofer  Ruth Heller  Marina Bogomolov  Christy L Avery  Mariaelisa Graff  Kari E North  Alex P Reiner  Timothy A Thornton  Kenneth Rice  Yoav Benjamini  Cathy C Laurie  Kathleen F Kerr
Institution:1. Department of Biostatistics, University of Washington, Seattle, WA, USA;2. Department of Statistics and Operations Research, Tel‐Aviv University, Tel‐Aviv, Israel;3. Faculty of Industrial Engineering and Management, Technion‐Israel Institute of Technology, Haifa, Israel;4. Department of Epidemiology, University of North Carolina, Chapel Hill, NC, USA;5. Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Abstract:In genome‐wide association studies (GWAS), “generalization” is the replication of genotype‐phenotype association in a population with different ancestry than the population in which it was first identified. Current practices for declaring generalizations rely on testing associations while controlling the family‐wise error rate (FWER) in the discovery study, then separately controlling error measures in the follow‐up study. This approach does not guarantee control over the FWER or false discovery rate (FDR) of the generalization null hypotheses. It also fails to leverage the two‐stage design to increase power for detecting generalized associations. We provide a formal statistical framework for quantifying the evidence of generalization that accounts for the (in)consistency between the directions of associations in the discovery and follow‐up studies. We develop the directional generalization FWER (FWERg) and FDR (FDRg) controlling r‐values, which are used to declare associations as generalized. This framework extends to generalization testing when applied to a published list of Single Nucleotide Polymorphism‐(SNP)‐trait associations. Our methods control FWERg or FDRg under various SNP selection rules based on P‐values in the discovery study. We find that it is often beneficial to use a more lenient P‐value threshold than the genome‐wide significance threshold. In a GWAS of total cholesterol in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), when testing all SNPs with P‐values urn:x-wiley:07410395:media:gepi22029:gepi22029-math-0001 (15 genomic regions) for generalization in a large GWAS of whites, we generalized SNPs from 15 regions. But when testing all SNPs with P‐values urn:x-wiley:07410395:media:gepi22029:gepi22029-math-0002 (89 regions), we generalized SNPs from 27 regions.
Keywords:multiple testing  one‐sided P‐values  shared genetics
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号