论文标题
COVID-19在线数据集的策划集合
A curated collection of COVID-19 online datasets
论文作者
论文摘要
2020年的决定性时刻之一是冠状病毒病(Covid-19)的爆发,这是一种致命的病毒,影响了人体的呼吸系统,以至于需要通过呼吸机获得呼吸辅助。截至2020年6月21日,有12,929,306例确认的病例和569,738例确认死亡的216个国家,地区或地区。大流行的扩散和影响的规模使许多国家都用预防和治疗方法挣扎。引入的臭名昭著的锁定措施减轻了病毒差异,这改变了我们社会习惯的许多方面,在这种情况下,对基于在线服务的需求飞涨。随着病毒的传播,通过在线社交媒体周围的错误信息和虚假新闻也是如此,这似乎比真实性更喜欢病毒。长期以来,大部分民众都局限于他们的房屋,因此对在线错误信息的有毒影响的脆弱性很高。一个典型的例子是与Covid-19相关的各种神话和虚假信息,如果不受组织,这可能会导致灾难性的结果并妨碍对抗病毒的斗争。 尽管科学界正在积极参与识别病毒治疗,但越来越有兴趣打击相关的有害疾病。为此,研究人员一直在策划和记录有关Covid-19的各种数据集。根据现有研究,我们提供了广泛的策划数据集的集合,以支持与大流行的斗争,尤其是关于错误信息。该集合包括3个类别的Twitter数据,有关可靠来源的标准实践的信息以及全球状况报告。我们描述了如何检索数据的水合版本并提供一些可以使用数据解决的研究问题。
One of the defining moments of the year 2020 is the outbreak of Coronavirus Disease (Covid-19), a deadly virus affecting the body's respiratory system to the point of needing a breathing aid via ventilators. As of June 21, 2020 there are 12,929,306 confirmed cases and 569,738 confirmed deaths across 216 countries, areas or territories. The scale of spread and impact of the pandemic left many nations grappling with preventive and curative approaches. The infamous lockdown measure introduced to mitigate the virus spread has altered many aspects of our social routines in which demand for online-based services skyrocketed. As the virus propagate, so does misinformation and fake news around it via online social media, which seems to favour virality over veracity. With a majority of the populace confined to their homes for a long period, vulnerability to the toxic impact of online misinformation is high. A case in point is the various myths and disinformation associated with the Covid-19, which, if left unchecked, could lead to a catastrophic outcome and hamper the fight against the virus. While the scientific community is actively engaged in identifying the virus treatment, there is a growing interest in combating the associated harmful infodemic. To this end, researchers have been curating and documenting various datasets about Covid-19. In line with existing studies, we provide an expansive collection of curated datasets to support the fight against the pandemic, especially concerning misinformation. The collection consists of 3 categories of Twitter data, information about standard practices from credible sources and a chronicle of global situation reports. We describe how to retrieve the hydrated version of the data and proffer some research problems that could be addressed using the data.