Correcting for binomial measurement error in predictors in regression with application to analysis of DNA methylation rates by bisulfite sequencing |
| |
Authors: | John Buonaccorsi Agnieszka Prochenka Magne Thoresen Rafal Ploski |
| |
Affiliation: | 1. Department of Mathematics and Statistics, University of Massachusetts‐ Amherst, Amherst, MA, U.S.A.;2. Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland;3. Oslo Centre for Biostatistics and Epidemiology, Department of Biostatistics, University of Oslo, Oslo, Norway;4. Department of Medical Genetics, Warsaw Medical University, Warsaw, Poland |
| |
Abstract: | Motivated by a genetic application, this paper addresses the problem of fitting regression models when the predictor is a proportion measured with error. While the problem of dealing with additive measurement error in fitting regression models has been extensively studied, the problem where the additive error is of a binomial nature has not been addressed. The measurement errors here are heteroscedastic for two reasons; dependence on the underlying true value and changing sampling effort over observations. While some of the previously developed methods for treating additive measurement error with heteroscedasticity can be used in this setting, other methods need modification. A new version of simulation extrapolation is developed, and we also explore a variation on the standard regression calibration method that uses a beta‐binomial model based on the fact that the true value is a proportion. Although most of the methods introduced here can be used for fitting non‐linear models, this paper will focus primarily on their use in fitting a linear model. While previous work has focused mainly on estimation of the coefficients, we will, with motivation from our example, also examine estimation of the variance around the regression line. In addressing these problems, we also discuss the appropriate manner in which to bootstrap for both inferences and bias assessment. The various methods are compared via simulation, and the results are illustrated using our motivating data, for which the goal is to relate the methylation rate of a blood sample to the age of the individual providing the sample. Copyright © 2016 John Wiley & Sons, Ltd. |
| |
Keywords: | binomial bootstrap measurement error methylation regression calibration SIMEX |
|
|