首页 | 本学科首页   官方微博 | 高级检索  
     

儿童先天性心脏病超声心动图报告与个体风险的相关性分析
引用本文:施雅慧李作峰常才张晓艳. 儿童先天性心脏病超声心动图报告与个体风险的相关性分析[J]. 复旦学报(医学版), 2018, 45(2): 151. DOI: 10.3969/j.issn.1672-8467.2018.02.002
作者姓名:施雅慧李作峰常才张晓艳
作者单位:1同济大学生命科学与技术学院 上海 200092; 2飞利浦中国研究院 上海 200233;3复旦大学附属肿瘤医院超声医学科 上海 200032
基金项目:国家自然科学基金(81573023)
摘    要: 目的 分析儿童先天性心脏病超声心动图检查报告中文字描述信息与临床风险评估结果的相关性,并验证文本挖掘方法在此类分析中的可行性和应用价值。方法 回顾性分析1 042例先天性心脏病患儿的彩色超声心动图报告,通过自然语言处理(natural language processing, NLP)技术进行特征提取与筛选,以患儿的风险等级为预测目标,借助机器学习算法构建决策树,推测出临床医师解读心脏超声报告时可能的决策路径。通过50次基于分层抽样的10折交叉验证评价模型的风险等级预测能力,进而评估报告在临床决策中的作用和价值。结果 使用自动生成的全部三元语法(3-gram)或基于领域知识筛选后的特征,所训练的风险等级预测模型分别达到32.82%和48.57%的分类准确率,平均绝对误差(normalized mean absolute error,NMAE)分别为0.33和0.25。结论 超声心动图报告中的文字部分,尤其是描述疾病征象的常用术语,能够在约75%的水平上反映先天性心脏病患儿的严重程度,为临床医师诊疗决策提供重要依据。

关 键 词:超声心动图  先天性心脏病  自然语言处理  机器学习  儿童
收稿时间:2017-01-03

Correlation between echocardiography report narratives and the risk level of congenital heart disease in children
SHI Ya-hui,LI Zuo-feng,CHANG Cai,ZHANG Xiao-yan. Correlation between echocardiography report narratives and the risk level of congenital heart disease in children[J]. Fudan University Journal of Medical Sciences, 2018, 45(2): 151. DOI: 10.3969/j.issn.1672-8467.2018.02.002
Authors:SHI Ya-hui  LI Zuo-feng  CHANG Cai  ZHANG Xiao-yan
Affiliation:1 School of Life Science and Technology, Tongji University, Shanghai 200092, China; 2 Philips Research China, Shanghai 200233, China; 3 Department of Ultrasound, Shanghai Cancer Center, Fudan University, Shanghai 200032, China
Abstract:Objective To analyze the correlation between echocardiography report narratives and the risk level of congenital heart disease in children, and to validate the feasibility and value of employing text mining technique in such task. Methods Echocardiography reports were retrospectively analyzed for 1 042 children with congenital heart disease. We adopted natural language processing (NLP) technique to generate features from the clinical narratives for machine learning algorithms. Decision trees were trained to predict the risk level of patients. Model performance was evaluated by means of classification accuracy and normalized mean absolute error (NMAE), which were averaged among 50 rounds of stratified 10-fold cross validation. By analyzing branches of the decision tree, we formulated the possible decision path of a clinician and identifyied the key information in the clinical narratives. Results Compared with the auto-generated 3-grams,the selected features yielded a better performance. After feature selection, the predict accuracy was improved from 32.82% to 48.57%, while the NMAE reduced from 0.33 to 0.25.ConclusionsBased on echocardiography report narratives, the risk levels of congenital heart disease in children can be evaluated by our model with an accuracy level of 75%.Echocardiographic terms that describe the lesion provide significant information to support the clinical decision making.
Keywords:echocardiography  congenital heart disease  natural language processing  machine learning  children
点击此处可从《复旦学报(医学版)》浏览原始摘要信息
点击此处可从《复旦学报(医学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号