首页 | 本学科首页   官方微博 | 高级检索  
检索        

基于社交媒体中文本信息的早期抑郁症检测
引用本文:张梦娜,王君岩,龙洋,张浩峰,胡勇.基于社交媒体中文本信息的早期抑郁症检测[J].中国生物医学工程学报,2022,41(1):21-30.
作者姓名:张梦娜  王君岩  龙洋  张浩峰  胡勇
作者单位:1(南京理工大学医院预防保健科,南京 210094)2(新南威尔士大学计算机科学与工程学院, 悉尼 2052)3(杜伦大学计算机科学系,杜伦 DH1 3LE)4(南京理工大学计算机科学与工程学院,南京 210094)
基金项目:国家自然科学基金(61872187,62072246);;英国医学研究委员会创新基金No.MR/S003916/1~~;
摘    要:诊断抑郁症的传统方法是通过面对面的评估和交谈.但是,许多患有抑郁症的患者不愿意在早期阶段就医,从而使病情恶化.为了在早期判断抑郁症患者的情况,提出一种利用社交媒体文本信息的时间序列特征和多示例学习的检测模型,考虑到抑郁症状不会立即出现,所以时序样本的使用显得非常重要,因此使用无监督LSTM提取时间序列特征,训练分类器实...

关 键 词:抑郁症检测  长短时记忆  时间序列特征  社交媒体  多示例学习
收稿时间:2021-03-08

Early Detection of Depression Based on Textual Information in Social Media
Zhang Mengna,Wang Junyan,Long Yang,Zhang Haofeng,Hu Yong.Early Detection of Depression Based on Textual Information in Social Media[J].Chinese Journal of Biomedical Engineering,2022,41(1):21-30.
Authors:Zhang Mengna  Wang Junyan  Long Yang  Zhang Haofeng  Hu Yong
Institution:(Department of Preventive Health,Hospital of Nanjing University of Science and Technology, Nanjing 210094, China)(School of Computer Science and Engineering, University of New South Wales, Sydney 2052, Australia)(Department of Computer Science, Durham University, Durham DH13LE, UK)(School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China)
Abstract:The traditional method of diagnosing depression is through face-to-face assessment and conversation. However, many patients with depression are reluctant to seek medical attention at an early stage, which makes their condition worse. In order to judge the situation of patients with depression in the early stage, a detection model using time series features of social media textual information and multi-instance learning was proposed in this work. Considering that depressive symptoms will not appear immediately, the use of time series samples will be very important. Therefore, the unsupervised LSTM was used to extract time series features, binary classification was implemented by training a classifier, and a multi-instance learning model was exploited to solve the problem of unbalanced samples. Naive Bayes classifiers, random forests, multivariate social network learning and multimodal depression dictionary learning were used as benchmark methods firstly, and then the multi-instance learning with unsupervised LSTM time series functions was employed to detect depression more accurately. On the basis of the MDDL dataset, 200 survey subjects totally 7946 tweets were selected, and the training-test ratio was set as 8:2. Experimental results were following: the accuracy, precision, recall and F1 score reached 75.0%, 76.0%, 73.0%, and 74.5%, respectively, which demonstrated that it was feasible to use machine learning for early depression detection through text data in social media. In addition, a large number of ablation studies also verified that the method using time series features could achieve better performance than the traditional benchmark methods.
Keywords:depression detection  long short-term memory (LSTM)  time series feature  social media  multi-instance learning  
点击此处可从《中国生物医学工程学报》浏览原始摘要信息
点击此处可从《中国生物医学工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号