首页 | 本学科首页   官方微博 | 高级检索  
检索        


De-identification of unstructured paper-based health records for privacy-preserving secondary use
Authors:Stefan Fenz  Thomas Neubauer  Antonio Rella
Institution:Vienna University of Technology, Institute of Software Technology and Interactive Systems Favoritenstrasse 9-11, 1040 ViennaAustria
Abstract:Whenever personal data is processed, privacy is a serious issue. Especially in the document-centric e-health area, the patients’ privacy must be preserved in order to prevent any negative repercussions for the patient. Clinical research, for example, demands structured health records to carry out efficient clinical trials, whereas legislation (e.g. HIPAA) regulates that only de-identified health records may be used for research. However, unstructured and often paper-based data dominates information technology, especially in the healthcare sector. Existing approaches are geared towards data in English-language documents only and have not been designed to handle the recognition of erroneous personal data which is the result of the OCR-based digitization of paper-based health records.
Keywords:Computer security  health records  named entity recognition  natural language processing  privacy
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号