首页 | 本学科首页   官方微博 | 高级检索  
检索        

基于聚类的中医临床术语语义关系的研究
引用本文:陈璟,刘亮亮,张晓如,曹馨宇.基于聚类的中医临床术语语义关系的研究[J].世界科学技术-中医药现代化,2017,19(12):1949-1953.
作者姓名:陈璟  刘亮亮  张晓如  曹馨宇
作者单位:1. 江苏科技大学计算机学院镇江212000,2. 上海对外经贸大学统计与信息学院上海201620;,1. 江苏科技大学计算机学院镇江212000,3. 中国中医科学院中医临床基础医学研究所北京100190
基金项目:国家自然科学基金青年科学基金项目:基于本体的中医诊疗信息模型构建研究(81403281),负责人:曹馨宇;国家科学技术部国家科技支撑计划:‘病症结合’中医药真实世界临床科研方法学研究(2013BAI02B10),负责人:谢琪;中国中医科学院基本科研业务费:国家中医药数据中心发展战略与建设规划研究(ZZ060815),负责人:王斌;国家重点研发计划课题:多维多层多态中医药知识图谱及时空演化模型研究(2017YFB1002302),负责人:李宗友。
摘    要:中医临床本体的建设是中医学国际化的重要组成部分之一,其中对临床术语实体的研究已经颇有成果,但对语义关系的研究却还略有不足。本文提出了一种基于聚类和句法模式相结合的方法对中医临床概念实体之间的语义关系进行研究。通过提取实体周围的特征词,使用Kmeans作为聚类算法,对所有的语料进行第一轮聚类。并以第一轮聚类结果为基础,在同一簇中提取最长公共子序列并泛化作为句法模式,简称句式。根据手工调整过后的句式,自动判断语料中的每一个句子所具有的最能表达语义关系的句式,以句式为特征进行第二轮聚类,该结果即为最终聚类结果。实验结果表明,该方法对语料中存在的语义关系分类的准确率为88.23%。

关 键 词:中医  语义关系  聚类  句法模式
收稿时间:2017/9/19 0:00:00
修稿时间:2017/11/20 0:00:00

Study on Clinical Term Semantic Relationship of Traditional Chinese Medicine Based on Clustering
Chen Jing,Liu Liangliang,Zhang Xiaoru and Cao Xinyu.Study on Clinical Term Semantic Relationship of Traditional Chinese Medicine Based on Clustering[J].World Science and Technology-Modernization of Traditional Chinese Medicine,2017,19(12):1949-1953.
Authors:Chen Jing  Liu Liangliang  Zhang Xiaoru and Cao Xinyu
Institution:1. School of Computer Science and Engineering, Jiangsu University of Science AndTechnology, Zhenjiang 212000, China;,2. School of Statistics And Information, ShanghaiUniversity of International Business And Economics, Shanghai 201620, China;,1. School of Computer Science and Engineering, Jiangsu University of Science AndTechnology, Zhenjiang 212000, China and 3. Institute of BasicResearch in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing 100190, China
Abstract:The construction of clinical ontology of traditional Chinese medicine (TCM) is one of the importantcomponents of TCM internationalization. Among them, the study of clinical term entity has been quite successful. But theresearch on semantic relation is still lacking. This paper presented a method based on the combination of clustering andsyntax pattern to study the semantic relations between TCM conceptual entities. By extracting the feature words aroundthe entity, K-means was used as the clustering algorithm to perform the first round of clustering for all corpora. Based onresults of the first round of clustering, the longest common subsequence was extracted in the same cluster and generalizedas syntax pattern. According to the sentence after manual adjustment, it was automatically judged that each sentence inthe corpus has the most suitable syntax pattern of semantic relations, and the second round of clustering is characterizedby the syntax pattern. The result was the final clustering result. The experimental results showed that the accuracy of thismethod was 88.23% for the classification of semantic relations in corpus.
Keywords:Chinese medicine  semantic relation  clustering  syntax pattern
点击此处可从《世界科学技术-中医药现代化》浏览原始摘要信息
点击此处可从《世界科学技术-中医药现代化》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号