首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Logistic regression is the primary analysis tool for binary traits in genome-wide association studies (GWAS). Multinomial regression extends logistic regression to multiple categories. However, many phenotypes more naturally take ordered, discrete values. Examples include (a) subtypes defined from multiple sources of clinical information and (b) derived phenotypes generated by specific phenotyping algorithms for electronic health records (EHR). GWAS of ordinal traits have been problematic. Dichotomizing can lead to a range of arbitrary cutoff values, generating inconsistent, hard to interpret results. Using multinomial regression ignores trait value hierarchy and potentially loses power. Treating ordinal data as quantitative can lead to misleading inference. To address these issues, we analyze ordinal traits with an ordered, multinomial model. This approach increases power and leads to more interpretable results. We derive efficient algorithms for computing test statistics, making ordinal trait GWAS computationally practical for Biobank scale data. Our method is available as a Julia package OrdinalGWAS.jl. Application to a COPDGene study confirms previously found signals based on binary case–control status, but with more significance. Additionally, we demonstrate the capability of our package to run on UK Biobank data by analyzing hypertension as an ordinal trait.  相似文献   

2.
In health services research studies, a score for satisfaction with care is often determined for a patient by summing several items, each measured on a Likert scale with ordered response options indicating satisfaction with some aspect of medical care (e.g., five categories ordered 1-poor to 5-excellent). A common goal is to determine patient and physician level predictors of patient satisfaction using regression analysis. The large number of categories in the ordinal summed response variable may present obstacles to traditional analytic methods for ordinal data such as the proportional odds model for cumulative logits. Further, linear regression is generally known to be inappropriate. Quasi-likelihood methods provide a flexible and tractable alternative modelling procedure. Weak assumptions about the measurement scale may be made by estimating parameters that define a family of link functions. A quasi-likelihood analysis of data from a study of elderly patients' satisfaction with communication with their primary care physician is presented. Although several factors are significantly related to satisfaction, a diagnostic plot based upon cumulative deviances reveals inadequacy of fit for the patients with the lowest observed satisfaction.  相似文献   

3.
In many medical studies, researchers widely use composite or long ordinal scores, that is, scores that have a large number of categories and a natural ordering often resulting from the sum of a number of short ordinal scores, to assess function or quality of life. Typically, we analyse these using unjustified assumptions of normality for the outcome measure, which are unlikely to be even approximately true. Scores of this type are better analysed using methods reserved for more conventional (short) ordinal scores, such as the proportional‐odds model. We can avoid the need for a large number of cut‐point parameters that define the divisions between the score categories for long ordinal scores in the proportional‐odds model by the inclusion of orthogonal polynomial contrasts. We introduce the repeated measures proportional‐odds logistic regression model and describe for long ordinal outcomes modifications to the generalized estimating equation methodology used for parameter estimation. We introduce data from a trial assessing two surgical interventions, briefly describe and re‐analyse these using the new model and compare inferences from the new analysis with previously published results for the primary outcome measure (hip function at 12 months postoperatively). We use a simulation study to illustrate how this model also has more general application for conventional short ordinal scores, to select amongst competing models of varying complexity for the cut‐point parameters. Copyright © 2013 John Wiley & Sons, Ltd.  相似文献   

4.
A mixed-effects multinomial logistic regression model is described for analysis of clustered or longitudinal nominal or ordinal response data. The model is parameterized to allow flexibility in the choice of contrasts used to represent comparisons across the response categories. Estimation is achieved using a maximum marginal likelihood (MML) solution that uses quadrature to numerically integrate over the distribution of random effects. An analysis of a psychiatric data set, in which homeless adults with serious mental illness are repeatedly classified in terms of their living arrangement, is used to illustrate features of the model.  相似文献   

5.
A method is proposed for classification to ordinal categories by applying the search partition analysis (SPAN) approach. It is suggested that SPAN be repeatedly applied to binary outcomes formed by collapsing adjacent categories of the ordinal scale. By a simple device, whereby successive binary partitions are constrained to be nested, a partition for classification to the ordinal states is obtained. The approach is applied to ordinal categories of glucose tolerance to discriminate between diabetes, impaired glucose tolerance and normal states. The results are compared with analysis by ordinal logistic regression and by classification trees.  相似文献   

6.
For many clinical trials and epidemiologic investigations in the field of ophthalmology, paired ordinal data are often collected through the detailed grading of retinal photographs. One method for analysis of these data is the extension of the generalized estimating equation (GEE) methodology to multinomial data with cumulative link functions. Prior to the development of this advanced technique, however, ophthalmologists developed a method of combining the ordinal responses of both eyes of a patient into a single person-level response on a new ordinal scale. A relationship between the regression coefficients of these two methods is derived as a function of the correlation between eyes. We investigate the applicability of this result and the relationship of the standard errors in simulation experiments and in an example from the Wisconsin Epidemiologic Study of Diabetic Retinopathy.  相似文献   

7.
This paper presents a new goodness‐of‐fit test for an ordered stereotype model used for an ordinal response variable. The proposed test is based on the well‐known Hosmer–Lemeshow test and its version for the proportional odds regression model. The latter test statistic is calculated from a grouping scheme assuming that the levels of the ordinal response are equally spaced which might be not true. One of the main advantages of the ordered stereotype model is that it allows us to determine a new uneven spacing of the ordinal response categories, dictated by the data. The proposed test takes the use of this new adjusted spacing to partition data. A simulation study shows good performance of the proposed test under a variety of scenarios. Finally, the results of the application in two examples are presented. Copyright © 2016 John Wiley & Sons, Ltd.  相似文献   

8.
A non-parametric multi-dimensional isotonic regression estimator is developed for use in estimating a set of target quantiles from an ordinal toxicity scale. We compare this estimator to the standard parametric maximum likelihood estimator from a proportional odds model for extremely small data sets. A motivating example is from phase I oncology clinical trials, where various non-parametric designs have been proposed that lead to very small data sets, often with ordinal toxicity response data. Our comparison of estimators is performed in conjunction with three of these non-parametric sequential designs for ordinal response data, two from the literature and a new design based on a random walk rule. We also compare with a non-parametric design for binary response trials, by keeping track of ordinal data for estimation purposes, but dichotomizing the data in the design phase. We find that a multidimensional isotonic regression-based estimator far exceeds the others in terms of accuracy and efficiency. A rule by Simon et al. (J. Natl. Cancer Inst. 1997; 89:1138-1147) yields particularly efficient estimators, more so than the random walk rule, but has higher numbers of dose-limiting toxicity. A small data set from a leukemia clinical trial is analysed using our multidimensional isotonic regression-based estimator.  相似文献   

9.
Health surveys often include a general question on self-assessed health (SAH), usually measured on an ordinal scale with three to five response categories, from 'very poor' or 'poor' to 'very good' or 'excellent'. This paper assesses the scaling of responses on the SAH question. It compares alternative procedures designed to impose cardinality on the ordinal responses. These include OLS, ordered probit and interval regression approaches. The cardinal measures of health are used to compute and decompose concentration indices for income-related inequality in health. Results are provided using Finnish data on 15D and the SAH questions. Further evidence emerges for the internal validity of a method used in a pioneering study by van Doorslaer and Jones which was based on Canadian data on the McMaster Health Utility Index Mark III (HUI) and SAH. The study validates the conclusions drawn by van Doorslaer and Jones. It confirms that the interval regression approach is superior to OLS and ordered probit regression in assessing health inequality. However, regarding the choice of scaling instrument, it is concluded that the scaling of SAH categories and, consequently, the measured degree of inequality, are sensitive to characteristics of the chosen scaling instrument.  相似文献   

10.
Many investigators conducting translational research are performing high‐throughput genomic experiments and then developing multigenic classifiers using the resulting high‐dimensional data set. In a large number of applications, the class to be predicted may be inherently ordinal. Examples of ordinal outcomes include tumor‐node‐metastasis (TNM) stage (I, II, III, IV); drug toxicity evaluated as none, mild, moderate, or severe; and response to treatment classified as complete response, partial response, stable disease, or progressive disease. While one can apply nominal response classification methods to ordinal response data, in doing so some information is lost that may improve the predictive performance of the classifier. This study examined the effectiveness of alternative ordinal splitting functions combined with bootstrap aggregation for classifying an ordinal response. We demonstrate that the ordinal impurity and ordered twoing methods have desirable properties for classifying ordinal response data and both perform well in comparison to other previously described methods. Developing a multigenic classifier is a common goal for microarray studies, and therefore application of the ordinal ensemble methods is demonstrated on a high‐throughput methylation data set. Copyright © 2009 John Wiley & Sons, Ltd.  相似文献   

11.
Adolescent alcohol use is a serious public health concern. Despite advances in the theoretical conceptualization of pathways to alcohol use, researchers are limited by the statistical techniques currently available. Researchers often fit linear models and restrictive categorical models (e.g., proportional odds models) to ordinal data with many response categories defined by collapsed count data (0 drinking days, 1–2days, 3–6days, etc.). Consequently, existing models ignore the underlying count process, resulting in disjoint between the construct of interest and the models being fitted. Our proposed ordinal modeling approach overcomes this limitation by explicitly linking ordinal responses to a suitable underlying count distribution. In doing so, researchers can use maximum likelihood estimation to fit count models to ordinal data as if they had directly observed the underlying discrete counts. The usefulness of our ordinal negative binomial and ordinal zero‐inflated negative binomial models is verified by simulation studies. We also demonstrate our approach using real empirical data from the 2010 National Survey of Drug Use and Health. Results show the benefit of the proposed ordinal modeling framework compared with existing methods. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

12.
Ordinal regression models for epidemiologic data   总被引:7,自引:0,他引:7  
Health status is often measured in epidemiologic studies on an ordinal scale, but data of this type are generally reduced for analysis to a single dichotomy. Several statistical models have been developed to make full use of information in ordinal response data, but have not been much used in analyzing epidemiologic studies. The authors discuss two of these statistical models--the cumulative odds model and the continuation ratio model. They may be interpreted in terms of odds ratios, can account for confounding variables, have clear and testable assumptions, and have parameters that may be estimated and hypotheses that may be tested using available statistical packages. However, calculations of asymptotic relative efficiency and results of simulations showed that simple logistic regression applied to dichotomized responses can in some realistic situations have more than 75% of the efficiency of ordinal regression models, but only if the ordinal scale is collapsed into a dichotomy close to the optimal point. The application of the proposed models to data from a study of chest x-rays of workers exposed to mineral fibers confirmed that they are easy to use and interpret, but gave results quite similar to those obtained using simple logistic regression after dichotomizing outcome in the conventional way.  相似文献   

13.
In agreement studies, when objects are rated independently by two raters (or twice by the same rater), an association between their ratings on two categories arises, reflecting the distinguishability of these two categories for these raters. When ratings are performed on an ordinal scale, this association between ratings on two categories increases when the distance between these categories increases on the ordinal scale. Goodman's log-linear models derived for the analysis of agreement between two raters on an ordinal scale assume that distinguishabilities between adjacent categories are either constant, or a priori fixed. Log-non-linear models that allow variations of the distinguishabilities between adjacent categories along the scale, may lead to difficulties in parameter estimation.This paper describes a new class of log-linear non-uniform association models. These models extend the log-linear uniform association model by allowing variations of distinguishability between adjacent categories (along the scale). These new models are used to analyse ordinal agreement between dermatologists when assessing the severity of different cutaneous signs of ageing on women faces.  相似文献   

14.
Response misclassification of counted data biases and understates the uncertainty of parameter estimators in Poisson regression models. To correct these problems, researchers have devised classical procedures that rely on asymptotic distribution results and supplemental validation data in order to estimate unknown misclassification parameters. We derive a new Bayesian Poisson regression procedure that accounts and corrects for misclassification for a count variable with two categories. Under the Bayesian paradigm, one can use validation data, expert opinion, or a combination of these two approaches to correct for the consequences of misclassification. The Bayesian procedure proposed here yields an operationally effective way to correct and account for misclassification effects in Poisson count regression models. We demonstrate the performance of the model in a simulation study. Additionally, we analyze two real-data examples and compare our new Bayesian inference method that adjusts for misclassification with a similar analysis that ignores misclassification.  相似文献   

15.
This paper describes the methodologies used to develop a prediction model to assist health workers in developing countries in facing one of the most difficult health problems in all parts of the world: the presentation of an acutely ill young infant. Statistical approaches for developing the clinical prediction model faced at least two major difficulties. First, the number of predictor variables, especially clinical signs and symptoms, is very large, necessitating the use of data reduction techniques that are blinded to the outcome. Second, there is no uniquely accepted continuous outcome measure or final binary diagnostic criterion. For example, the diagnosis of neonatal sepsis is ill-defined. Clinical decision makers must identify infants likely to have positive cultures as well as to grade the severity of illness. In the WHO/ARI Young Infant Multicentre Study we have found an ordinal outcome scale made up of a mixture of laboratory and diagnostic markers to have several clinical advantages as well as to increase the power of tests for risk factors. Such a mixed ordinal scale does present statistical challenges because it may violate constant slope assumptions of ordinal regression models. In this paper we develop and validate an ordinal predictive model after choosing a data reduction technique. We show how ordinality of the outcome is checked against each predictor. We describe new but simple techniques for graphically examining residuals from ordinal logistic models to detect problems with variable transformations as well as to detect non-proportional odds and other lack of fit. We examine an alternative type of ordinal logistic model, the continuation ratio model, to determine if it provides a better fit. We find that it does not but that this model is easily modified to allow the regression coefficients to vary with cut-offs of the response variable. Complex terms in this extended model are penalized to allow only as much complexity as the data will support. We approximate the extended continuation ratio model with a model with fewer terms to allow us to draw a nomogram for obtaining various predictions. The model is validated for calibration and discrimination using the bootstrap. We apply much of the modelling strategy described in Harrell, Lee and Mark (Statist. Med. 15 , 361–387 (1998)) for survival analysis, adapting it to ordinal logistic regression and further emphasizing penalized maximum likelihood estimation and data reduction. © 1998 John Wiley & Sons, Ltd.  相似文献   

16.
Health-Related Quality of Life (HRQoL) measures are becoming increasingly used in clinical trials as primary outcome measures. Investigators are now asking statisticians for advice on how to analyse studies that have used HRQoL outcomes.HRQoL outcomes, like the SF-36, are usually measured on an ordinal scale. However, most investigators assume that there exists an underlying continuous latent variable that measures HRQoL, and that the actual measured outcomes (the ordered categories), reflect contiguous intervals along this continuum.The ordinal scaling of HRQoL measures means they tend to generate data that have discrete, bounded and skewed distributions. Thus, standard methods of analysis such as the t-test and linear regression that assume Normality and constant variance may not be appropriate. For this reason, conventional statistical advice would suggest that non-parametric methods be used to analyse HRQoL data. The bootstrap is one such computer intensive non-parametric method for analysing data.We used the bootstrap for hypothesis testing and the estimation of standard errors and confidence intervals for parameters, in four datasets (which illustrate the different aspects of study design). We then compared and contrasted the bootstrap with standard methods of analysing HRQoL outcomes. The standard methods included t-tests, linear regression, summary measures and General Linear Models.Overall, in the datasets we studied, using the SF-36 outcome, bootstrap methods produce results similar to conventional statistical methods. This is likely because the t-test and linear regression are robust to the violations of assumptions that HRQoL data are likely to cause (i.e. non-Normality). While particular to our datasets, these findings are likely to generalise to other HRQoL outcomes, which have discrete, bounded and skewed distributions. Future research with other HRQoL outcome measures, interventions and populations, is required to confirm this conclusion.  相似文献   

17.
We consider the case of a community intervention trial evaluated with a series of cross-sectional surveys and having outcomes measured on an ordinal scale. We propose a modelling procedure that combines ridit analysis and linear regression methods. We use the multinomial distribution as the basis for variance estimation of the mean ridits and then use simple regression models to estimate differences (for example, between intervention and comparison areas) among the ridits. We illustrate this procedure with data from a community intervention trial promoting condom use, with the adoption of consistent condom use measured on a 5-point ordinal scale.  相似文献   

18.
Correlated ordinal response data often arise in public health studies. Sample-size (power) calculations are a crucial step in designing such studies to ensure an adequate sample to detect a significant effect. Here we extend Rochon's method of sample-size estimation with a repeated binary response to the ordinal case. The proposed sample-size calculations are based on an analysis with generalized estimating equations (GEE) and inference with the Wald test. Simulation results demonstrate the merit of the proposed power calculations. Analysis of an arthritis clinical trial is used for illustration.  相似文献   

19.
Ordinal responses are very common in longitudinal data collected from substance abuse research or other behavioral research. This study develops a new statistical model with free SAS macros that can be applied to characterize time‐varying effects on ordinal responses. Our simulation study shows that the ordinal‐scale time‐varying effects model has very low estimation bias and sometimes offers considerably better performance when fitting data with ordinal responses than a model that treats the response as continuous. Contrary to a common assumption that an ordinal scale with several levels can be treated as continuous, our results indicate that it is not so much the number of levels on the ordinal scale but rather the skewness of the distribution that makes a difference on relative performance of linear versus ordinal models. We use longitudinal data from a well‐known study on youth at high risk for substance abuse as a motivating example to demonstrate that the proposed model can characterize the time‐varying effect of negative peer influences on alcohol use in a way that is more consistent with the developmental theory and existing literature, in comparison with the linear time‐varying effect model. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

20.
Classifying a measurable clinical outcome as a dichotomous variable often involves difficulty with borderline cases that could fairly be assigned either of the two binary class memberships. In such situations the indicated class membership is often highly subjective and subject to, for instance, a measurement error. In other situations the intermediate level of a three-level ordinal factor may sometimes be explicitly reserved for cases which could likely belong to either of the two binary classes. Such indefinite readings are often eliminated from the statistical analysis. In this article we review conceptual and methodological aspects of employing proportional odds logistic regression for a three level ordinal factor as a suitable alternative to ordinary logistic regression when dealing with limited uncertainty in classifying clinical outcome as a binary variable.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号