论文标题
从reddit上的Covid-19的个人经历的叙述中提取症状
Symptom extraction from the narratives of personal experiences with COVID-19 on Reddit
论文作者
论文摘要
社交媒体对Covid-19的讨论提供了丰富的信息来源,介绍了该病毒如何影响人们的生活与传统公共卫生数据集的质量不同。特别是,当个人在社交媒体上自我报告在病毒过程中的经历时,它可以识别患者症状的每个阶段的情绪。 Reddit论坛R/COVID19-POSTIST的帖子包含来自Covid-19-19个阳性患者的第一手帐户,从而深入了解了该病毒的个人斗争。这些帖子通常具有时间结构,指示文本所指的症状后数天数。使用主题建模和情感分析,我们在症状发作以来的头14天内量化了Covid-19的讨论的变化。关于早期症状(例如发烧,咳嗽和喉咙痛)的论述集中在柱子的开头,而语言表明呼吸问题在十天左右达到顶峰。还确定了有关关键案例的一些对话,并以大致恒定的速度出现。我们确定了两个与这些症状的演变相关的正面和负面情绪的明显簇,并绘制了它们的关系。我们的结果提供了有关Covid-19的患者体验的观点,该观点可以补充其他医疗数据流,并有可能揭示出何时出现心理健康问题。
Social media discussion of COVID-19 provides a rich source of information into how the virus affects people's lives that is qualitatively different from traditional public health datasets. In particular, when individuals self-report their experiences over the course of the virus on social media, it can allow for identification of the emotions each stage of symptoms engenders in the patient. Posts to the Reddit forum r/COVID19Positive contain first-hand accounts from COVID-19 positive patients, giving insight into personal struggles with the virus. These posts often feature a temporal structure indicating the number of days after developing symptoms the text refers to. Using topic modelling and sentiment analysis, we quantify the change in discussion of COVID-19 throughout individuals' experiences for the first 14 days since symptom onset. Discourse on early symptoms such as fever, cough, and sore throat was concentrated towards the beginning of the posts, while language indicating breathing issues peaked around ten days. Some conversation around critical cases was also identified and appeared at a roughly constant rate. We identified two clear clusters of positive and negative emotions associated with the evolution of these symptoms and mapped their relationships. Our results provide a perspective on the patient experience of COVID-19 that complements other medical data streams and can potentially reveal when mental health issues might appear.