首页 | 本学科首页   官方微博 | 高级检索  
检索        


Enhancing HMM-based biomedical named entity recognition by studying special phenomena
Authors:Zhang Jie  Shen Dan  Zhou Guodong  Su Jian  Tan Chew-Lim
Institution:Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, Singapore. zhangjie@i2r.a-star.edu.sg
Abstract:The purpose of this research is to enhance an HMM-based named entity recognizer in the biomedical domain. First, we analyze the characteristics of biomedical named entities. Then, we propose a rich set of features, including orthographic, morphological, part-of-speech, and semantic trigger features. All these features are integrated via a Hidden Markov Model with back-off modeling. Furthermore, we propose a method for biomedical abbreviation recognition and two methods for cascaded named entity recognition. Evaluation on the GENIA V3.02 and V1.1 shows that our system achieves 66.5 and 62.5 F-measure, respectively, and outperforms the previous best published system by 8.1 F-measure on the same experimental setting. The major contribution of this paper lies in its rich feature set specially designed for biomedical domain and the effective methods for abbreviation and cascaded named entity recognition. To our best knowledge, our system is the first one that copes with the cascaded phenomena.
Keywords:Biomedical named entity recognition  Cascaded named entity recognition  Abbreviation recognition  HMM
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号