首页 | 本学科首页   官方微博 | 高级检索  
检索        


Ordered multinomial regression for genetic association analysis of ordinal phenotypes at Biobank scale
Authors:Christopher A German  Janet S Sinsheimer  Yann C Klimentidis  Hua Zhou  Jin J Zhou
Institution:1. Department of Biostatistics, UCLA Fielding School of Public Health, Los Angeles, California;2. Department of Biostatistics, UCLA Fielding School of Public Health, Los Angeles, California

Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, California

Department of Computational Medicine, David Geffen School of Medicine at UCLA, Los Angeles, California;3. Department of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, Arizona

Abstract:Logistic regression is the primary analysis tool for binary traits in genome-wide association studies (GWAS). Multinomial regression extends logistic regression to multiple categories. However, many phenotypes more naturally take ordered, discrete values. Examples include (a) subtypes defined from multiple sources of clinical information and (b) derived phenotypes generated by specific phenotyping algorithms for electronic health records (EHR). GWAS of ordinal traits have been problematic. Dichotomizing can lead to a range of arbitrary cutoff values, generating inconsistent, hard to interpret results. Using multinomial regression ignores trait value hierarchy and potentially loses power. Treating ordinal data as quantitative can lead to misleading inference. To address these issues, we analyze ordinal traits with an ordered, multinomial model. This approach increases power and leads to more interpretable results. We derive efficient algorithms for computing test statistics, making ordinal trait GWAS computationally practical for Biobank scale data. Our method is available as a Julia package OrdinalGWAS.jl. Application to a COPDGene study confirms previously found signals based on binary case–control status, but with more significance. Additionally, we demonstrate the capability of our package to run on UK Biobank data by analyzing hypertension as an ordinal trait.
Keywords:electronic health record  genome-wide association study  ordered multinomial regression
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号