期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Meta‐analysis with missing study‐level sample variance data

Amit K. Chowdhry Robert H. Dworkin Michael P. McDermott 《Statistics in medicine》2016,35(17):3021-3032

We consider a study‐level meta‐analysis with a normally distributed outcome variable and possibly unequal study‐level variances, where the object of inference is the difference in means between a treatment and control group. A common complication in such an analysis is missing sample variances for some studies. A frequently used approach is to impute the weighted (by sample size) mean of the observed variances (mean imputation). Another approach is to include only those studies with variances reported (complete case analysis). Both mean imputation and complete case analysis are only valid under the missing‐completely‐at‐random assumption, and even then the inverse variance weights produced are not necessarily optimal. We propose a multiple imputation method employing gamma meta‐regression to impute the missing sample variances. Our method takes advantage of study‐level covariates that may be used to provide information about the missing data. Through simulation studies, we show that multiple imputation, when the imputation model is correctly specified, is superior to competing methods in terms of confidence interval coverage probability and type I error probability when testing a specified group difference. Finally, we describe a similar approach to handling missing variances in cross‐over studies. Copyright © 2016 John Wiley & Sons, Ltd. 相似文献

2.

Imputation of systematically missing predictors in an individual participant data meta‐analysis: a generalized approach using MICE

下载免费PDF全文

Shahab Jolani Thomas P. A. Debray Hendrik Koffijberg Stef van Buuren Karel G. M. Moons 《Statistics in medicine》2015,34(11):1841-1863

Individual participant data meta‐analyses (IPD‐MA) are increasingly used for developing and validating multivariable (diagnostic or prognostic) risk prediction models. Unfortunately, some predictors or even outcomes may not have been measured in each study and are thus systematically missing in some individual studies of the IPD‐MA. As a consequence, it is no longer possible to evaluate between‐study heterogeneity and to estimate study‐specific predictor effects, or to include all individual studies, which severely hampers the development and validation of prediction models. Here, we describe a novel approach for imputing systematically missing data and adopt a generalized linear mixed model to allow for between‐study heterogeneity. This approach can be viewed as an extension of Resche‐Rigon's method (Stat Med 2013), relaxing their assumptions regarding variance components and allowing imputation of linear and nonlinear predictors. We illustrate our approach using a case study with IPD‐MA of 13 studies to develop and validate a diagnostic prediction model for the presence of deep venous thrombosis. We compare the results after applying four methods for dealing with systematically missing predictors in one or more individual studies: complete case analysis where studies with systematically missing predictors are removed, traditional multiple imputation ignoring heterogeneity across studies, stratified multiple imputation accounting for heterogeneity in predictor prevalence, and multilevel multiple imputation (MLMI) fully accounting for between‐study heterogeneity. We conclude that MLMI may substantially improve the estimation of between‐study heterogeneity parameters and allow for imputation of systematically missing predictors in IPD‐MA aimed at the development and validation of prediction models. Copyright © 2015 John Wiley & Sons, Ltd. 相似文献

3.

Multiple imputation for handling systematically missing confounders in meta‐analysis of individual participant data

Matthieu Resche‐Rigon Ian R. White Jonathan&#x;W. &#x;Bartlett Sanne A.E. Peters Simon G. Thompson 《Statistics in medicine》2013,32(28):4890-4905

A variable is ‘systematically missing’ if it is missing for all individuals within particular studies in an individual participant data meta‐analysis. When a systematically missing variable is a potential confounder in observational epidemiology, standard methods either fail to adjust the exposure–disease association for the potential confounder or exclude studies where it is missing. We propose a new approach to adjust for systematically missing confounders based on multiple imputation by chained equations. Systematically missing data are imputed via multilevel regression models that allow for heterogeneity between studies. A simulation study compares various choices of imputation model. An illustration is given using data from eight studies estimating the association between carotid intima media thickness and subsequent risk of cardiovascular events. Results are compared with standard methods and also with an extension of a published method that exploits the relationship between fully adjusted and partially adjusted estimated effects through a multivariate random effects meta‐analysis model. We conclude that multiple imputation provides a practicable approach that can handle arbitrary patterns of systematic missingness. Bias is reduced by including sufficient between‐study random effects in the imputation model. Copyright © 2013 John Wiley & Sons, Ltd. 相似文献

4.

Multiple imputation for IPD meta‐analysis: allowing for heterogeneity and studies with missing covariates

下载免费PDF全文

M. Quartagno J. R. Carpenter 《Statistics in medicine》2016,35(17):2938-2954

Recently, multiple imputation has been proposed as a tool for individual patient data meta‐analysis with sporadically missing observations, and it has been suggested that within‐study imputation is usually preferable. However, such within study imputation cannot handle variables that are completely missing within studies. Further, if some of the contributing studies are relatively small, it may be appropriate to share information across studies when imputing. In this paper, we develop and evaluate a joint modelling approach to multiple imputation of individual patient data in meta‐analysis, with an across‐study probability distribution for the study specific covariance matrices. This retains the flexibility to allow for between‐study heterogeneity when imputing while allowing (i) sharing information on the covariance matrix across studies when this is appropriate, and (ii) imputing variables that are wholly missing from studies. Simulation results show both equivalent performance to the within‐study imputation approach where this is valid, and good results in more general, practically relevant, scenarios with studies of very different sizes, non‐negligible between‐study heterogeneity and wholly missing variables. We illustrate our approach using data from an individual patient data meta‐analysis of hypertension trials. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. 相似文献

5.

Combining fractional polynomial model building with multiple imputation

下载免费PDF全文

Tim P. Morris Ian R. White James R. Carpenter Simon J. Stanworth Patrick Royston 《Statistics in medicine》2015,34(25):3298-3317

Multivariable fractional polynomial (MFP) models are commonly used in medical research. The datasets in which MFP models are applied often contain covariates with missing values. To handle the missing values, we describe methods for combining multiple imputation with MFP modelling, considering in turn three issues: first, how to impute so that the imputation model does not favour certain fractional polynomial (FP) models over others; second, how to estimate the FP exponents in multiply imputed data; and third, how to choose between models of differing complexity. Two imputation methods are outlined for different settings. For model selection, methods based on Wald‐type statistics and weighted likelihood‐ratio tests are proposed and evaluated in simulation studies. The Wald‐based method is very slightly better at estimating FP exponents. Type I error rates are very similar for both methods, although slightly less well controlled than analysis of complete records; however, there is potential for substantial gains in power over the analysis of complete records. We illustrate the two methods in a dataset from five trauma registries for which a prognostic model has previously been published, contrasting the selected models with that obtained by analysing the complete records only. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. 相似文献

6.

Multiple imputation in the presence of non‐normal data

下载免费PDF全文

Katherine J. Lee John B. Carlin 《Statistics in medicine》2017,36(4):606-617

Multiple imputation (MI) is becoming increasingly popular for handling missing data. Standard approaches for MI assume normality for continuous variables (conditionally on the other variables in the imputation model). However, it is unclear how to impute non‐normally distributed continuous variables. Using simulation and a case study, we compared various transformations applied prior to imputation, including a novel non‐parametric transformation, to imputation on the raw scale and using predictive mean matching (PMM) when imputing non‐normal data. We generated data from a range of non‐normal distributions, and set 50% to missing completely at random or missing at random. We then imputed missing values on the raw scale, following a zero‐skewness log, Box–Cox or non‐parametric transformation and using PMM with both type 1 and 2 matching. We compared inferences regarding the marginal mean of the incomplete variable and the association with a fully observed outcome. We also compared results from these approaches in the analysis of depression and anxiety symptoms in parents of very preterm compared with term‐born infants. The results provide novel empirical evidence that the decision regarding how to impute a non‐normal variable should be based on the nature of the relationship between the variables of interest. If the relationship is linear in the untransformed scale, transformation can introduce bias irrespective of the transformation used. However, if the relationship is non‐linear, it may be important to transform the variable to accurately capture this relationship. A useful alternative is to impute the variable using PMM with type 1 matching. Copyright © 2016 John Wiley & Sons, Ltd. 相似文献

7.

An analytic method for the placebo‐based pattern‐mixture model

Kaifeng Lu 《Statistics in medicine》2014,33(7):1134-1145

Pattern‐mixture models provide a general and flexible framework for sensitivity analyses of nonignorable missing data. The placebo‐based pattern‐mixture model (Little and Yau, Biometrics 1996; 52 :1324–1333) treats missing data in a transparent and clinically interpretable manner and has been used as sensitivity analysis for monotone missing data in longitudinal studies. The standard multiple imputation approach (Rubin, Multiple Imputation for Nonresponse in Surveys, 1987) is often used to implement the placebo‐based pattern‐mixture model. We show that Rubin's variance estimate of the multiple imputation estimator of treatment effect can be overly conservative in this setting. As an alternative to multiple imputation, we derive an analytic expression of the treatment effect for the placebo‐based pattern‐mixture model and propose a posterior simulation or delta method for the inference about the treatment effect. Simulation studies demonstrate that the proposed methods provide consistent variance estimates and outperform the imputation methods in terms of power for the placebo‐based pattern‐mixture model. We illustrate the methods using data from a clinical study of major depressive disorders. Copyright © 2013 John Wiley & Sons, Ltd. 相似文献

8.

Multiple imputation for harmonizing longitudinal non‐commensurate measures in individual participant data meta‐analysis

下载免费PDF全文

Juned Siddique Jerome P. Reiter Ahnalee Brincks Robert D. Gibbons Catherine M. Crespi C. Hendricks Brown 《Statistics in medicine》2015,34(26):3399-3414

There are many advantages to individual participant data meta‐analysis for combining data from multiple studies. These advantages include greater power to detect effects, increased sample heterogeneity, and the ability to perform more sophisticated analyses than meta‐analyses that rely on published results. However, a fundamental challenge is that it is unlikely that variables of interest are measured the same way in all of the studies to be combined. We propose that this situation can be viewed as a missing data problem in which some outcomes are entirely missing within some trials and use multiple imputation to fill in missing measurements. We apply our method to five longitudinal adolescent depression trials where four studies used one depression measure and the fifth study used a different depression measure. None of the five studies contained both depression measures. We describe a multiple imputation approach for filling in missing depression measures that makes use of external calibration studies in which both depression measures were used. We discuss some practical issues in developing the imputation model including taking into account treatment group and study. We present diagnostics for checking the fit of the imputation model and investigate whether external information is appropriately incorporated into the imputed values. Copyright © 2015 John Wiley & Sons, Ltd. 相似文献

9.

Accounting for uncertainty due to ‘last observation carried forward’ outcome imputation in a meta‐analysis model

下载免费PDF全文

Vasiliki Dimitrakopoulou Orestis Efthimiou Stefan Leucht Georgia Salanti 《Statistics in medicine》2015,34(5):742-752

Missing outcome data are a problem commonly observed in randomized control trials that occurs as a result of participants leaving the study before its end. Missing such important information can bias the study estimates of the relative treatment effect and consequently affect the meta‐analytic results. Therefore, methods on manipulating data sets with missing participants, with regard to incorporating the missing information in the analysis so as to avoid the loss of power and minimize the bias, are of interest. We propose a meta‐analytic model that accounts for possible error in the effect sizes estimated in studies with last observation carried forward (LOCF) imputed patients. Assuming a dichotomous outcome, we decompose the probability of a successful unobserved outcome taking into account the sensitivity and specificity of the LOCF imputation process for the missing participants. We fit the proposed model within a Bayesian framework, exploring different prior formulations for sensitivity and specificity. We illustrate our methods by performing a meta‐analysis of five studies comparing the efficacy of amisulpride versus conventional drugs (flupenthixol and haloperidol) on patients diagnosed with schizophrenia. Our meta‐analytic models yield estimates similar to meta‐analysis with LOCF‐imputed patients. Allowing for uncertainty in the imputation process, precision is decreased depending on the priors used for sensitivity and specificity. Results on the significance of amisulpride versus conventional drugs differ between the standard LOCF approach and our model depending on prior beliefs on the imputation process. Our method can be regarded as a useful sensitivity analysis that can be used in the presence of concerns about the LOCF process. Copyright © 2014 JohnWiley & Sons, Ltd. 相似文献

10.

Nonlinear multiple imputation for continuous covariate within semiparametric Cox model: application to HIV data in Senegal

Jules Brice Tchatchueng Mbougua Christian Laurent Ibra Ndoye Eric Delaporte Henri Gwet Nicolas Molinari 《Statistics in medicine》2013,32(26):4651-4665

Multiple imputation is commonly used to impute missing covariate in Cox semiparametric regression setting. It is to fill each missing data with more plausible values, via a Gibbs sampling procedure, specifying an imputation model for each missing variable. This imputation method is implemented in several softwares that offer imputation models steered by the shape of the variable to be imputed, but all these imputation models make an assumption of linearity on covariates effect. However, this assumption is not often verified in practice as the covariates can have a nonlinear effect. Such a linear assumption can lead to a misleading conclusion because imputation model should be constructed to reflect the true distributional relationship between the missing values and the observed values. To estimate nonlinear effects of continuous time invariant covariates in imputation model, we propose a method based on B‐splines function. To assess the performance of this method, we conducted a simulation study, where we compared the multiple imputation method using Bayesian splines imputation model with multiple imputation using Bayesian linear imputation model in survival analysis setting. We evaluated the proposed method on the motivated data set collected in HIV‐infected patients enrolled in an observational cohort study in Senegal, which contains several incomplete variables. We found that our method performs well to estimate hazard ratio compared with the linear imputation methods, when data are missing completely at random, or missing at random. Copyright © 2013 John Wiley & Sons, Ltd. 相似文献

11.

A comparison of existing methods for multiple imputation in individual participant data meta‐analysis

下载免费PDF全文

Deborah Kunkel Eloise E. Kaizar 《Statistics in medicine》2017,36(22):3507-3532

Multiple imputation is a popular method for addressing missing data, but its implementation is difficult when data have a multilevel structure and one or more variables are systematically missing. This systematic missing data pattern may commonly occur in meta‐analysis of individual participant data, where some variables are never observed in some studies, but are present in other hierarchical data settings. In these cases, valid imputation must account for both relationships between variables and correlation within studies. Proposed methods for multilevel imputation include specifying a full joint model and multiple imputation with chained equations (MICE). While MICE is attractive for its ease of implementation, there is little existing work describing conditions under which this is a valid alternative to specifying the full joint model. We present results showing that for multilevel normal models, MICE is rarely exactly equivalent to joint model imputation. Through a simulation study and an example using data from a traumatic brain injury study, we found that in spite of theoretical differences, MICE imputations often produce results similar to those obtained using the joint model. We also assess the influence of prior distributions in MICE imputation methods and find that when missingness is high, prior choices in MICE models tend to affect estimation of across‐study variability more than compatibility of conditional likelihoods. Copyright © 2017 John Wiley & Sons, Ltd. 相似文献

12.

Multiple imputation for an incomplete covariate that is a ratio

Tim P. Morris Ian R. White Patrick Royston Shaun R. Seaman Angela M. Wood 《Statistics in medicine》2014,33(1):88-104

We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is to impute missing values for the numerator and denominator, or the log‐transformed numerator and denominator, and then calculate the ratio of interest; we call this ‘passive’ imputation. Alternatively, missing ratio values might be imputed directly, with or without the numerator and/or the denominator in the imputation model; we call this ‘active’ imputation. In two motivating datasets, one involving body mass index as a covariate and the other involving the ratio of total to high‐density lipoprotein cholesterol, we assess the sensitivity of results to the choice of imputation model and, as an alternative, explore fully Bayesian joint models for the outcome and incomplete ratio. Fully Bayesian approaches using Winbugs were unusable in both datasets because of computational problems. In our first dataset, multiple imputation results are similar regardless of the imputation model; in the second, results are sensitive to the choice of imputation model. Sensitivity depends strongly on the coefficient of variation of the ratio's denominator. A simulation study demonstrates that passive imputation without transformation is risky because it can lead to downward bias when the coefficient of variation of the ratio's denominator is larger than about 0.1. Active imputation or passive imputation after log‐transformation is preferable. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. 相似文献

13.

The Role of Environmental Heterogeneity in Meta‐Analysis of Gene–Environment Interactions With Quantitative Traits

下载免费PDF全文

Shi Li Bhramar Mukherjee Jeremy M. G. Taylor Kenneth M. Rice Xiaoquan Wen John D. Rice Heather M. Stringham Michael Boehnke 《Genetic epidemiology》2014,38(5):416-429

With challenges in data harmonization and environmental heterogeneity across various data sources, meta‐analysis of gene–environment interaction studies can often involve subtle statistical issues. In this paper, we study the effect of environmental covariate heterogeneity (within and between cohorts) on two approaches for fixed‐effect meta‐analysis: the standard inverse‐variance weighted meta‐analysis and a meta‐regression approach. Akin to the results in Simmonds and Higgins ( 2007 ), we obtain analytic efficiency results for both methods under certain assumptions. The relative efficiency of the two methods depends on the ratio of within versus between cohort variability of the environmental covariate. We propose to use an adaptively weighted estimator (AWE), between meta‐analysis and meta‐regression, for the interaction parameter. The AWE retains full efficiency of the joint analysis using individual level data under certain natural assumptions. Lin and Zeng (2010a, b) showed that a multivariate inverse‐variance weighted estimator retains full efficiency as joint analysis using individual level data, if the estimates with full covariance matrices for all the common parameters are pooled across all studies. We show consistency of our work with Lin and Zeng (2010a, b). Without sacrificing much efficiency, the AWE uses only univariate summary statistics from each study, and bypasses issues with sharing individual level data or full covariance matrices across studies. We compare the performance of the methods both analytically and numerically. The methods are illustrated through meta‐analysis of interaction between Single Nucleotide Polymorphisms in FTO gene and body mass index on high‐density lipoprotein cholesterol data from a set of eight studies of type 2 diabetes. 相似文献

14.

Multiple imputation using chained equations: Issues and guidance for practice 总被引：1，自引：0，他引：1

White IR Royston P Wood AM 《Statistics in medicine》2011,30(4):377-399

Multiple imputation by chained equations is a flexible and practical approach to handling missing data. We describe the principles of the method and show how to impute categorical and quantitative variables, including skewed variables. We give guidance on how to specify the imputation model and how many imputations are needed. We describe the practical analysis of multiply imputed data, including model building and model checking. We stress the limitations of the method and discuss the possible pitfalls. We illustrate the ideas using a data set in mental health, giving Stata code fragments. 相似文献

15.

数据缺失机制识别及处理的标准化流程及集成系统

下载免费PDF全文

岳廷妍张昱勤李晓松马越张韬《现代预防医学》2019,(21):3928-3932

目的提出数据缺失机制识别及处理的标准化操作流程,并开发相应集成系统,为非统计专业背景的医学工作者处理缺失数据提供恰当、专业且简便的实现工具。方法系统集成了完成者数据集法、K最近邻分类算法和链式方程多元插值法等缺失数据处理方法,并将其归纳到缺失机制识别及处理的统一框架下,为缺失数据处理提供了从缺失统计,缺失机制识别到缺失处理的标准化流程。结果将归纳的标准化流程分步骤开发为缺失统计、缺失识别、缺失处理等功能模块并进行了集成化,构建了缺失机制识别及处理集成系统。结论标准化操作流程及集成系统实现了缺失机制识别加缺失数据处理全过程,操作方式简单便捷,结果展示直观易懂,为缺失数据的处理提供了更为简便可行的选择,便于医学工作者实际应用。相似文献

16.

Dealing with missing covariates in epidemiologic studies: a comparison between multiple imputation and a full Bayesian approach

下载免费PDF全文

Nicole S. Erler Dimitris Rizopoulos Joost van Rosmalen Vincent W. V. Jaddoe Oscar H. Franco Emmanuel M. E. H. Lesaffre 《Statistics in medicine》2016,35(17):2955-2974

Incomplete data are generally a challenge to the analysis of most large studies. The current gold standard to account for missing data is multiple imputation, and more specifically multiple imputation with chained equations (MICE). Numerous studies have been conducted to illustrate the performance of MICE for missing covariate data. The results show that the method works well in various situations. However, less is known about its performance in more complex models, specifically when the outcome is multivariate as in longitudinal studies. In current practice, the multivariate nature of the longitudinal outcome is often neglected in the imputation procedure, or only the baseline outcome is used to impute missing covariates. In this work, we evaluate the performance of MICE using different strategies to include a longitudinal outcome into the imputation models and compare it with a fully Bayesian approach that jointly imputes missing values and estimates the parameters of the longitudinal model. Results from simulation and a real data example show that MICE requires the analyst to correctly specify which components of the longitudinal process need to be included in the imputation models in order to obtain unbiased results. The full Bayesian approach, on the other hand, does not require the analyst to explicitly specify how the longitudinal outcome enters the imputation models. It performed well under different scenarios. Copyright © 2016 John Wiley & Sons, Ltd. 相似文献

17.

多重填补在随机干预试验研究中的应用

张熙林燧恒《中国卫生统计》2011,28(5)

目的利用多重填补方法实现对含缺失值的随机干预试验进行分析.方法结合心理干预试验研究数据,利用SAS程序PROC MI和PROC MIANALYZE实现缺失数据的填补,应用稳健协方差分析评价心理健康干预效果.结果填补与未填补分析结果一致,心理健康指标在干预组和对照组差别均无统计学意义,但CBO结局与干预有交互作用.结论干预对学生心理健康起到一定的作用,但差别无统计学意义. 相似文献

18.

Penalized regression procedures for variable selection in the potential outcomes framework

下载免费PDF全文

Debashis Ghosh Yeying Zhu Donna L. Coffman 《Statistics in medicine》2015,34(10):1645-1658

A recent topic of much interest in causal inference is model selection. In this article, we describe a framework in which to consider penalized regression approaches to variable selection for causal effects. The framework leads to a simple ‘impute, then select’ class of procedures that is agnostic to the type of imputation algorithm as well as penalized regression used. It also clarifies how model selection involves a multivariate regression model for causal inference problems and that these methods can be applied for identifying subgroups in which treatment effects are homogeneous. Analogies and links with the literature on machine learning methods, missing data, and imputation are drawn. A difference least absolute shrinkage and selection operator algorithm is defined, along with its multiple imputation analogs. The procedures are illustrated using a well‐known right‐heart catheterization dataset. Copyright © 2015 John Wiley & Sons, Ltd. 相似文献

19.

Use of multiple imputation in the epidemiologic literature

Klebanoff MA Cole SR 《American journal of epidemiology》2008,168(4):355-357

相似文献

20.

Imputing variance estimates do not alter the conclusions of a meta-analysis with continuous outcomes: a case study of changes in renal function after living kidney donation

Thiessen Philbrook H Barrowman N Garg AX 《Journal of clinical epidemiology》2007,60(3):228-240

OBJECTIVE: To assess how different imputation methods used to account for missing variance data in primary studies influence tests of heterogeneity and pooled results from a meta-analysis with continuous outcomes. STUDY DESIGN AND SETTING: Point and variance estimates for changes in serum creatinine, glomerular filtration rate, systolic blood pressure, and diastolic blood pressure were variably reported among 48 primary longitudinal studies of living kidney donors (71%-78% of point estimates were reported, 8%-13% of variance data were reported). We compared the results of meta-analysis, which either were restricted to available data or used four methods to impute missing variance data. These methods used reported P-values, reported nonparametric summaries, results from other similar studies using multiple imputation, or results from estimated correlation coefficients. RESULTS: Significant heterogeneity was present in all four outcomes regardless of the imputation methods applied. The random effects point estimates and 95% confidence intervals varied little across imputation methods, and the differences were not clinically significant. CONCLUSIONS: Different methods to impute the variance data in the primary studies did not alter the conclusions from this meta-analysis of continuous outcomes. Such reproducibility increases confidence in the results. However, as with most meta-analyses, there was no gold standard of truth, and results must be interpreted judiciously. The generalization of these findings to other meta-analyses, which differ in outcomes, missing data, or between-study heterogeneity, requires further consideration. 相似文献