Categorizing variables in the statistical analysis of data: consequences for interpreting the results] |
| |
Authors: | F Cumsille S I Bangdiwala |
| |
Affiliation: | División de Bioestadística, Escuela de Salud Pública y Centro de Epidemiología Clínica, Universidad de Chile, Direccion. |
| |
Abstract: | Frequently during the process of data analysis in epidemiological studies, the scale of one or more continuous variables is changed. The objective of this paper was to assess the consequences of categorizing variables during data analysis. We studied three situations with different scenarios for statistical analysis with regression models. The results show that dichotomizing continuous variables can substantially modify the relationships between dependent and independent variables. Thus, for example, in epidemiological studies trying to evaluate the effect of an exposure on a response, the magnitude and/or the direction of this effect can be biased by dichotomizing a variable. We therefore recommend avoiding, as much as possible, the categorization of variables when doing analyses. |
| |
Keywords: | |
|
|