首页 | 官方网站   微博 | 高级检索  
     


Population Genetics Identifies Challenges in Analyzing Rare Variants
Authors:Henry Richard Johnston  Yijuan Hu  David J Cutler
Affiliation:1. Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia, United States of America;2. Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia, United States of America
Abstract:Geneticists have, for years, understood the nature of genome‐wide association studies using common genomic variants. Recently, however, focus has shifted to the analysis of rare variants. This presents potential problems for researchers, as rare variants do not always behave in the same way common variants do, sometimes rendering decades of solid intuition moot. In this paper, we present examples of the differences between common and rare variants. We show why one must be significantly more careful about the origin of rare variants, and how failing to do so can lead to highly inflated type I error. We then explain how to best avoid such concerns with careful understanding and study design. Additionally, we demonstrate that a seemingly low error rate in next‐generation sequencing can dramatically impact the false‐positive rate for rare variants. This is due to the fact that rare variants are, by definition, seen infrequently, making it hard to distinguish between errors and real variants. Compounding this problem is the fact that the proportion of errors is likely to get worse, not better, with increasing sample size. One cannot simply scale their way up in order to solve this problem. Understanding these potential pitfalls is a key step in successfully identifying true associations between rare variants and diseases.
Keywords:rare variants  whole genome sequencing  error rates
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号