首页 | 本学科首页   官方微博 | 高级检索  
检索        


Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations
Authors:Corbin Quick  Pramod Anugu  Solomon Musani  Scott T Weiss  Esteban G Burchard  Marquitta J White  Kevin L Keys  Francesco Cucca  Carlo Sidore  Michael Boehnke  Christian Fuchsberger
Institution:1. Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan;2. University of Mississippi Medical Center, Jackson, Mississippi;3. Harvard Medical School, Boston, Massachusetts

Channing Department of Network Medicine, Brigham and Women's Hospital, Boston, California

Partners HealthCare Personalized Medicine, Boston, Massachusetts;4. Department of Medicine, University of California San Francisco, San Francisco, California

Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California;5. Department of Medicine, University of California San Francisco, San Francisco, California;6. Istituto di Ricerca Genetica e Biomedica (IRGB), CNR, Monserrato, Italy

Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy;7. Istituto di Ricerca Genetica e Biomedica (IRGB), CNR, Monserrato, Italy

Abstract:A key aim for current genome-wide association studies (GWAS) is to interrogate the full spectrum of genetic variation underlying human traits, including rare variants, across populations. Deep whole-genome sequencing is the gold standard to fully capture genetic variation, but remains prohibitively expensive for large sample sizes. Array genotyping interrogates a sparser set of variants, which can be used as a scaffold for genotype imputation to capture a wider set of variants. However, imputation quality depends crucially on reference panel size and genetic distance from the target population. Here, we consider sequencing a subset of GWAS participants and imputing the rest using a reference panel that includes both sequenced GWAS participants and an external reference panel. We investigate how imputation quality and GWAS power are affected by the number of participants sequenced for admixed populations (African and Latino Americans) and European population isolates (Sardinians and Finns), and identify powerful, cost-effective GWAS designs given current sequencing and array costs. For populations that are well-represented in existing reference panels, we find that array genotyping alone is cost-effective and well-powered to detect common- and rare-variant associations. For poorly represented populations, sequencing a subset of participants is often most cost-effective, and can substantially increase imputation quality and GWAS power.
Keywords:genotype imputation  genotyping  GWAS  rare variants  sequencing  study design  WGS
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号