DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter

Authors:	Arjun Magge Elena Tutubalina Zulfat Miftahutdinov Ilseyar Alimova Anne Dirkson Suzan Verberne Davy Weissenbacher Graciela Gonzalez-Hernandez

Affiliation:	1.DBEI, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;2.Kazan Federal University, Kazan, Russia;3.LIACS, Leiden University, Leiden, Netherlands

Abstract:	ObjectiveResearch on pharmacovigilance from social media data has focused on mining adverse drug events (ADEs) using annotated datasets, with publications generally focusing on 1 of 3 tasks: ADE classification, named entity recognition for identifying the span of ADE mentions, and ADE mention normalization to standardized terminologies. While the common goal of such systems is to detect ADE signals that can be used to inform public policy, it has been impeded largely by limited end-to-end solutions for large-scale analysis of social media reports for different drugs.Materials and MethodsWe present a dataset for training and evaluation of ADE pipelines where the ADE distribution is closer to the average ‘natural balance’ with ADEs present in about 7% of the tweets. The deep learning architecture involves an ADE extraction pipeline with individual components for all 3 tasks.ResultsThe system presented achieved state-of-the-art performance on comparable datasets and scored a classification performance of F₁ = 0.63, span extraction performance of F₁ = 0.44 and an end-to-end entity resolution performance of F₁ = 0.34 on the presented dataset.DiscussionThe performance of the models continues to highlight multiple challenges when deploying pharmacovigilance systems that use social media data. We discuss the implications of such models in the downstream tasks of signal detection and suggest future enhancements.ConclusionMining ADEs from Twitter posts using a pipeline architecture requires the different components to be trained and tuned based on input data imbalance in order to ensure optimal performance on the end-to-end resolution task.

Keywords:	social media mining natural language processing information extraction pharmacovigilance drug safety

设为首页 | 免责声明 | 关于勤云 | 加入收藏