首页 | 本学科首页   官方微博 | 高级检索  
     

基于极值随机森林的慢性胃炎中医证候分类
引用本文:颜建军,胡宗杰,刘国萍,王忆勤,付晶晶,郭睿,钱鹏. 基于极值随机森林的慢性胃炎中医证候分类[J]. 医学教育探索, 2017, 43(5): 698-703
作者姓名:颜建军  胡宗杰  刘国萍  王忆勤  付晶晶  郭睿  钱鹏
作者单位:华东理工大学机械与动力工程学院, 上海 200237,华东理工大学机械与动力工程学院, 上海 200237,上海中医药大学四诊信息综合实验室, 上海 201203,上海中医药大学四诊信息综合实验室, 上海 201203,上海中医药大学四诊信息综合实验室, 上海 201203,上海中医药大学四诊信息综合实验室, 上海 201203;上海中医药大学交叉科学研究院, 上海 201203,上海中医药大学四诊信息综合实验室, 上海 201203
基金项目:国家自然科学基金(81270050,81302913,30901897,81173199)
摘    要:大多数机器学习算法能得到较好的分类效果,但模型却无法解释;而随机森林等模型有良好的可解释性,却无法处理中医数据中兼证的情况。本文利用极值随机森林算法对慢性胃炎中医数据进行证候分类研究,其中决策树的叶节点能输出多个标签,通过加权机制综合分量来处理兼证问题。与已有多标记学习算法和C4.5、CART等基于决策树的算法进行比较,实验结果表明,极值随机森林算法无论在6个证型的分类准确率上,还是在多标记评价指标上都具有更好的效果,而且模型中得到的规则基本符合中医理论。

关 键 词:证候分类  极值随机森林  可解释性  慢性胃炎  决策树
收稿时间:2016-12-30

Syndrome Classification of Chronic Gastritis Based on Extremely Randomized Forest Algorithm
YAN Jian-jun,HU Zong-jie,LIU Guo-ping,WANG Yi-qin,FU Jing-jing,GUO Rui and QIAN Peng. Syndrome Classification of Chronic Gastritis Based on Extremely Randomized Forest Algorithm[J]. Researches in Medical Education, 2017, 43(5): 698-703
Authors:YAN Jian-jun  HU Zong-jie  LIU Guo-ping  WANG Yi-qin  FU Jing-jing  GUO Rui  QIAN Peng
Affiliation:School of Mechanical and Power Engineering, East China University of Science and Technology, Shanghai 200237, China,School of Mechanical and Power Engineering, East China University of Science and Technology, Shanghai 200237, China,Laboratory of Information Access and Synthesis of Traditional Chinese Medicine Four Diagnosis, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China,Laboratory of Information Access and Synthesis of Traditional Chinese Medicine Four Diagnosis, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China,Laboratory of Information Access and Synthesis of Traditional Chinese Medicine Four Diagnosis, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China,Laboratory of Information Access and Synthesis of Traditional Chinese Medicine Four Diagnosis, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China;Institute of Interdisciplinary Research Complex, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China and Laboratory of Information Access and Synthesis of Traditional Chinese Medicine Four Diagnosis, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China
Abstract:Syndrome differentiation and treatment,which is the essence of traditional Chinese medicine (TCM),contain abundant rules.The majority of machine learning algorithms can obtain good classification accuracy,but these models are difficult to be explained.The models established by random forests have great interpretability,while these models cannot deal with multi-syndrome that patients may simultaneously have more than one syndrome in TCM.In this paper,syndrome classification for Chronic Gastritis (CG) is researched by using extremely randomized forest (ERF) algorithm,and compared with state-of-the-art multi-label algorithms and the tree-based algorithms (such as C4.5,CART).The experimental results show that ERF algorithm has better performance than other algorithms in the classification accuracy of every label and the six evaluation metrics of multi-label learning.The rules obtained in the model are basically in accord with TCM theory.
Keywords:syndrome classification  extremely randomized forest  interpretability  chronic gastritis  decision tree
点击此处可从《医学教育探索》浏览原始摘要信息
点击此处可从《医学教育探索》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号