首页 | 本学科首页   官方微博 | 高级检索  
检索        

冠心病风险因素识别及其预测模型构建
引用本文:李婕,向菲.冠心病风险因素识别及其预测模型构建[J].中华医学图书情报杂志,2020,29(6):7-13.
作者姓名:李婕  向菲
作者单位:华中科技大学同济医学院医药卫生管理学院,湖北 武汉 430030
摘    要:目的:利用逻辑回归分析识别冠心病发作的危险因素,使用常见机器学习算法构建冠心病风险预测模型,为冠心病的早期预防与筛查提供理论参考。方法:通过对Kaggle发布的冠心病数据进行预处理和特征筛选后进行逻辑回归分析识别主要危险因素,选用逻辑回归、支持向量机、线性判别分析、决策树和随机森林5种常见机器学习算法进行冠心病发病预测。结果:性别、年龄、平均每日吸烟量、总胆固醇水平、收缩压和血糖水平是10年内冠心病发作的主要危险因素。选用的5种机器学习算法准确率与稳定性良好。与基于统计的线性判别分析相比,决策树与随机森林并未表现出明显的优越性。结论:机器学习技术适用于冠心病发作风险的预测,能够为冠心病的防控提供参考依据。

关 键 词:冠心病  风险预测模型  多因素逻辑回归分析  机器学习  随机森林
收稿时间:2020/5/2 0:00:00

Identification of risk factors for coronary heart disease and establishment of their prediction model
LI Jie,XIANG Fei.Identification of risk factors for coronary heart disease and establishment of their prediction model[J].Chinese Journal of Medical Library and Information Science,2020,29(6):7-13.
Authors:LI Jie  XIANG Fei
Institution:Central China University of Science and Technology Tongji Medical College Medical and Health Management School, Wuhan 430030, Hubei Province, China
Abstract:Objective To identify the risk factors for coronary heart disease by logistic regression analysis and to establish their prediction model using the common machine learning algorithms in order to provide reference for the early prevention and diagnosis of coronary heart disease. Methods The risk factors for coronary heart disease were identified by pre-processing the Kaggle-covered data on coronary heart disease and analyzed by logistic regression analysis. The onset of coronary heart disease was predicted using the 5 common machine learning algorithms respectively (logistic regression analysis, support vector machine, linear discrimination analysis, decision tree and random forest).Results Gender, age, average number of daily smoked cigarettes, total cholesterol level, systolic blood pressure and blood glucose level were the risk factors for coronary heart disease. The accuracy and stability of the 5 common machine learning algorithms were good. Decision tree and random forest were significantly advantageous over the linear discrimination analysis in identifying the risk factors for coronary heart disease. Conclusion Machine learning can predict the onset of coronary heart disease and provide reference for its prevention and control.
Keywords:Coronary heart disease  Risk prediction model  multivariate logistic regression analysis  Machine learning  Random forest
点击此处可从《中华医学图书情报杂志》浏览原始摘要信息
点击此处可从《中华医学图书情报杂志》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号