首页 | 本学科首页   官方微博 | 高级检索  
检索        

国外生物医学文本语料库分类及特点研究
引用本文:晏归来,安新颖,范少萍,周永称.国外生物医学文本语料库分类及特点研究[J].医学信息学杂志,2018,39(10):74-80.
作者姓名:晏归来  安新颖  范少萍  周永称
作者单位:中国医学科学院/北京协和医学院医学信息研究所 北京 100020,中国医学科学院/北京协和医学院医学信息研究所 北京 100020,中国医学科学院/北京协和医学院医学信息研究所 北京 100020,中国医学科学院/北京协和医学院医学信息研究所 北京 100020
基金项目:国家重点研发计划“精准医学文本知识网络构建”子课题“精准医学文本语料库构建”(项目编号:2016YFC0901902-2)。
摘    要:通过梳理国外31个生物医学文本语料库标注内容,根据语料库标注实体类型,参照UMLS语义类型将其划分为6大类。总结语料库在语义类型、数据源等方面特点,阐述生物医学文本语料库构建流程及关键步骤,以期为我国生物医学文本语料库相关研究奠定基础。

关 键 词:生物医学文本语料库  语义类型  语义关系
收稿时间:2018/9/13 0:00:00

Study on the Categories and Characteristics of Overseas Biomedical Text Corpuses
YAN Guilai,AN Xinying,FAN Shaoping and ZHOU Yongcheng.Study on the Categories and Characteristics of Overseas Biomedical Text Corpuses[J].Journal of Medical Informatics,2018,39(10):74-80.
Authors:YAN Guilai  AN Xinying  FAN Shaoping and ZHOU Yongcheng
Institution:Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China,Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China,Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China and Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China
Abstract:The paper divides the corpus into six categories by analyzing annotated contents of the 31 overseas biomedical text corpuses and referring to UMLS semantic type according to the annotated entity types of the corpuses. It summarizes characteristics of the corpus in the aspects like semantic type and data source,expatiates on the building process and major steps of biomedical text corpus in the hope of laying down the foundation based on which related studies on China''s biomedical text corpuses will be carried out.
Keywords:biomedical text corpus  semantic type  semantic relations
点击此处可从《医学信息学杂志》浏览原始摘要信息
点击此处可从《医学信息学杂志》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号