共查询到19条相似文献,搜索用时 93 毫秒
1.
2.
3.
无金标准情况下诊断试验的评价方法 总被引:6,自引:5,他引:1
目的 探讨无金标准情况下诊断试验的评价方法及患病率的估计。方法 从Beyesian参数估计原理出发,用Gibbs抽样法完成参数后验密度的估计。结果 将有金标准的资料作为无金标准处理,所得参数估计值前后吻合,效果良好,并可得到参数的后验密度分布图。结论 上述方法可有效地估计无金标准情况下诊断试验的评价参数。 相似文献
4.
5.
[目的]探讨无金标准条件下诊断试验准确性评价的潜分类方法。[方法]介绍潜分类模型在无金标准诊断试验评价中的原理、试验设计和评价方法,用两人群两试验实例说明潜分类方法的应用。[结果]对于二分类反应变量,假设条件独立和试验准确性稳定,至少需要两个人群两种试验方法或一个人群三种试验方法才能满足模型可识别性并用于频率学派统计评价;贝叶斯统计不需满足模型的可识别性,但需引入先验分布,且存在先验依赖性。[结论]潜分类方法可用于无金标准时的诊断试验评价,但要选择适合的试验设计和评价方法。 相似文献
6.
7.
8.
目的:分析广州市新型冠状病毒肺炎(COVID-19)疫情防控不同阶段中,密切接触者新型冠状病毒(新冠病毒)核酸检测的灵敏度和特异度,为优化疫情防控策略提供科学依据。方法:2020年2月21日至9月22日广州市COVID-19病例的密切接触者20 348例,均已接受新冠病毒核酸检测。针对疫情防控的不同阶段,比较核酸检测的... 相似文献
9.
10.
目的建立一种基于灵敏度(SEN)和特异度(SPE)任意赋权的广义标准化诊断符合率统计方法。方法广义标准化诊断符合率(e’)的构造满足以下两个原则:(1)灵敏度和特异度的权重(w)之和为1;(2)满足特殊性:当灵敏度和特异度等权时,广义标准化诊断符合率等于标准化诊断符合率(e),即有e’=e。所构造广义标准化诊断符合率为:e1’=[(n1+m1)·2w·SEN1+(m2+n2)·2(1-w)·SPE1]/(n+m),e2’=[(n1+m1)·2w·SEN2+(m2+n2)·2(1-w)·SPE]/(2n+m),(0≤w≤1)。根据中心极限定理,推导出e’的标准误和两个e’比较的Z统计推断方法,进一步推导出权重w的变化对检验统计量Z的影响。结果所构造的广义标准化诊断符合率满足上述构造两个原则。结论本研究所建立的广义标准化诊断符合率方法解决了应用中对灵敏度和特异度有不同赋权要求的问题,为诊断试验评价提供了新的手段。 相似文献
11.
Ransohoff DF 《Journal of clinical epidemiology》2002,55(12):172-1182
The field of clinical research conducted to evaluate diagnostic tests has evolved substantially over the last 25 years. This article discusses three current challenges and opportunities in conducting such research: the limitations of the “traditional focus” of clinical epidemiology in evaluating tests; how different decision makers use data produced by clinical epidemiology research, and what those uses mean for future research; and special challenges and opportunities in evaluating molecular-based tests. 相似文献
12.
Biases in the assessment of diagnostic tests 总被引:15,自引:0,他引:15
C B Begg 《Statistics in medicine》1987,6(4):411-423
Diagnostic tests are traditionally characterized by simple measures of efficacy such as the sensitivity and the specificity. These measures, though widely recognized and easy to understand, are subject to definitional arbitrariness. Moreover, studies constructed to estimate the sensitivity and specificity are susceptible to a variety of biases. In this paper the various potential problems are described with reference to examples from the diagnostic literature. These difficulties have implications for the design of diagnostic test evaluations, and the choice of suitable measures of test efficacy. 相似文献
13.
Intermediate test results often occur with diagnostic tests. When assessing diagnostic accuracy, it is important to properly report and account for these results. In the literature, these results are commonly discarded prior to analysis or treated as either a positive or a negative result. Although such adjustments allow sensitivity and specificity to be computed in the standard way, these forced decisions limit the interpretability and usefulness of the results. Estimation of diagnostic accuracy is further complicated when tests are evaluated without a gold standard. Although traditional latent class modeling can be readily applied to analyze these data and account for intermediate results, these models assume that tests are independent conditional on the true disease status, which is rarely valid in practice. We extend both the log‐linear latent class model and the probit latent class model to accommodate the conditional dependence among tests while taking the intermediate results into consideration. We illustrate our methods using a simulation study and a published medical study on the detection of epileptiform activity in the brain. Copyright © 2012 John Wiley & Sons, Ltd. 相似文献
14.
Sensitivity and specificity have clear definitions when there is a single test for one disease, and the test is either positive or negative. This paper presents a unified appraoch for obtaining posterior probabilities (predictive values) when there are more than two test outcomes and/or more than one disease state. In these cases, sensitivity and specificity do not have clear definitions. Three examples from the literature demonstrate how this approach simplifies the presentation of Bayesian revision of prior probabilities. Use of proper care in data collection for the purpose of estimating conditional probabilities can avoid assumptions of statistical independence. 相似文献
15.
Seyed Reza Jafarzadeh Wesley O. Johnson Jessica M. Utts Ian A. Gardner 《Statistics in medicine》2010,29(20):2090-2106
The receiver operating characteristic (ROC) curve is commonly used for evaluating the discriminatory ability of a biomarker. Measurements for a diagnostic test may be subject to an analytic limit of detection leading to immeasurable or unreportable test results. Ignoring the scores that are beyond the limit of detection of a test leads to a biased assessment of its discriminatory ability, as reflected by indices such as the associated area under the curve (AUC). We propose a Bayesian approach for the estimation of the ROC curve and its AUC for a test with a limit of detection in the absence of gold standard based on assumptions of normally and gamma‐distributed data. The methods are evaluated in simulation studies, and a truncated gamma model with a point mass is used to evaluate quantitative real‐time polymerase chain reaction data for bovine Johne's disease (paratuberculosis). Simulations indicated that estimates of diagnostic accuracy and AUC were good even for relatively small sample sizes (n=200). Exceptions were when there was a high per cent of unquantifiable results (60 per cent) or when AUC was ?0.6, which indicated a marked overlap between the outcomes in infected and non‐infected populations. Copyright © 2010 John Wiley & Sons, Ltd. 相似文献
16.
Receiver operating characteristic (ROC) curves are commonly used to summarize the classification accuracy of diagnostic tests. It is not uncommon in medical practice that multiple diagnostic tests are routinely performed or multiple disease markers are available for the same individuals. When the true disease status is verified by a gold standard (GS) test, a variety of methods have been proposed to combine such potential correlated tests to increase the accuracy of disease diagnosis. In this article, we propose a method of combining multiple diagnostic tests in the absence of a GS. We assume that the test values and their classification accuracies are dependent on covariates. Simulation studies are performed to examine the performance of the combination method. The proposed method is applied to data from a population-based aging study to compare the accuracy of three screening tests for kidney function and to estimate the prevalence of moderate kidney impairment. 相似文献
17.
We explore the estimation of sensitivity and specificity of diagnostic tests when the true disease state is unknown. Instrumental variables which subdivide the patient population are used. A logistic model, relating these instrumental variables to the (unknown) true disease state is proposed. It is shown that this procedure allows the goodness-of-fit to the resulting model to be tested. 相似文献
18.
Estimating disease prevalence in the absence of a gold standard 总被引:6,自引:0,他引:6
When estimating disease prevalence, it is not uncommon to have data from conditionally dependent diagnostic tests. In such a situation, the estimation of prevalence is difficult if none of the tests is considered to be a gold standard. In this paper we develop a Bayesian approach to estimating disease prevalence based on the results of two diagnostic tests, allowing for the possibility that the tests are conditionally dependent, but not conditioning on any particular dependence structure. This involves the construction of four models with various forms of conditional dependence and uses Bayesian model averaging, enabled by reversible jump MCMC, to obtain an overall estimate of the prevalence. This methodology is demonstrated using a study on the prevalence of Strongyloides infection. 相似文献
19.
《Annals of epidemiology》2014,24(7):527-531
PurposeWe evaluated the extent to which use of a hypothesized imperfect gold standard, the Composite International Diagnostic Interview (CIDI), biases the estimates of diagnostic accuracy of the Patient Health Questionnaire-9 (PHQ-9). We also evaluate how statistical correction can be used to address this bias.MethodsThe study was conducted among 926 adults where structured interviews were conducted to collect information about participants' current major depressive disorder using PHQ-9 and CIDI instruments. First, we evaluated the relative psychometric properties of PHQ-9 using CIDI as a gold standard. Next, we used a Bayesian latent class model to correct for the bias.ResultsIn comparison with CIDI, the relative sensitivity and specificity of the PHQ-9 for detecting major depressive disorder at a cut point of 10 or more were 53.1% (95% confidence interval: 45.4%–60.8%) and 77.5% (95% confidence interval, 74.5%–80.5%), respectively. Using a Bayesian latent class model to correct for the bias arising from the use of an imperfect gold standard increased the sensitivity and specificity of PHQ-9 to 79.8% (95% Bayesian credible interval, 64.9%–90.8%) and 79.1% (95% Bayesian credible interval, 74.7%–83.7%), respectively.ConclusionsOur results provided evidence that assessing diagnostic validity of mental health screening instrument, where application of a gold standard might not be available, can be accomplished by using appropriate statistical methods. 相似文献