Eventual situations for timeline extraction from clinical reports |
| |
Authors: | Cyril Grouin Natalia Grabar Thierry Hamon Sophie Rosset Xavier Tannier Pierre Zweigenbaum |
| |
Institution: | 1.LIMSI–CNRS, Orsay, France;2.INSERM, UMR_S 872, Eq 20 & UPMC, Paris, France;3.STL CNRS UMR 8163, Université Lille 1 et 3, Villeneuve-d''Ascq, France;4.LIM&Bio (EA3969), Université Paris 13, Bobigny, France;5.Université Paris-Sud 11, Orsay, France |
| |
Abstract: | ObjectiveTo identify the temporal relations between clinical events and temporal expressions in clinical reports, as defined in the i2b2/VA 2012 challenge.DesignTo detect clinical events, we used rules and Conditional Random Fields. We built Random Forest models to identify event modality and polarity. To identify temporal expressions we built on the HeidelTime system. To detect temporal relations, we systematically studied their breakdown into distinct situations; we designed an oracle method to determine the most prominent situations and the most suitable associated classifiers, and combined their results.ResultsWe achieved F-measures of 0.8307 for event identification, based on rules, and 0.8385 for temporal expression identification. In the temporal relation task, we identified nine main situations in three groups, experimentally confirming shared intuitions: within-sentence relations, section-related time, and across-sentence relations. Logistic regression and Naïve Bayes performed best on the first and third groups, and decision trees on the second. We reached a 0.6231 global F-measure, improving by 7.5 points our official submission.ConclusionsCarefully hand-crafted rules obtained good results for the detection of events and temporal expressions, while a combination of classifiers improved temporal link prediction. The characterization of the oracle recall of situations allowed us to point at directions where further work would be most useful for temporal relation detection: within-sentence relations and linking History of Present Illness events to the admission date. We suggest that the systematic situation breakdown proposed in this paper could also help improve other systems addressing this task. |
| |
Keywords: | Natural Language Processing Information Extraction Medical Records Chronology as Topic Text Mining |
|
|