首页 | 本学科首页   官方微博 | 高级检索  
检索        

机器学习算法在体检人群糖尿病风险预测中的应用
引用本文:欧阳平,李小溪,冷芬,赖晓英,张慧明,严传杰,王楚琼,白雨,邢志强,刘旭涛,缪苗,邓侃,李文源.机器学习算法在体检人群糖尿病风险预测中的应用[J].中华疾病控制杂志,2021,25(7):849-853,868.
作者姓名:欧阳平  李小溪  冷芬  赖晓英  张慧明  严传杰  王楚琼  白雨  邢志强  刘旭涛  缪苗  邓侃  李文源
作者单位:510515广州,南方医科大学南方医院健康管理科;100192北京,北京大数医达科技有限公司;510515广州,南方医科大学南方医院院办
摘    要:目的 探索Logistic回归分析模型和LightGBM(light gradient boosting machine)算法对体检人群未来罹患糖尿病的预测效果及影响因素.方法 选取2003年8月—2019年4月在南方医院健康管理中心多次进行团体参检的36 292例非糖尿病人员,分层随机选取70%样本,以首次体检的性别...

关 键 词:糖尿病  体检  Logistic回归分析模型  LightGBM模型
收稿时间:2020-07-16

Application of machine learning algorithm in diabetes risk prediction of physical examination population
Institution:1.Department of Health Management Section, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China2.Beijing Dudu Yida Technology Co. Ltd, Beijing 100192, China3.Hospital Office, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China
Abstract:  Objective  To explore the predictive effect and influencing factors of Logistic regression analysis model and Light GBM algorithm on the development of diabetes in the physical examination population.  Methods  A total of 36 292 subjects without diabetes were selected from the Health Management Center of Nanfang Hospital from August 2003 to April 2019. We ramdomly selected 70% samples by stratification to construct trainingset. The independent variables were 34 indicators including gender, age, body mass index (BMI), waist circumference, heart rate, systolic blood pressure, diastolic blood pressure, and fasting blood glucose in the first physical examination. We defined the dependent variable as developing diabetes within 5 years from the first physical examination.Logistic regression analysis model and LightGBM (light gradient boosting machine) algorithm was uesd to establish diabetes prediction models, respectively. The prediction model was applied to the remaining 30% samples and the area under the receiver operating characteristic (ROC) curve (AUC) was used to evaluate the prediction effect.  Results  The AUC of the Logistic regression algorithm model was 0.906, while the AUC of the LightGBM analysis model was 0.910. At the optimal critical point, the sensitivity and specificity of the Logistic regression analysis model were 81.5% and 84.3%, respectively. And the sensitivity and specificity of the LightGBM analysis model were 81.6% and 85.2%, respectively.  Conclusion  The Logistic regression algorithm model and LightGBM algorithm model have good prediction effect on the development of diabetes in the physical examination population.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《中华疾病控制杂志》浏览原始摘要信息
点击此处可从《中华疾病控制杂志》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号