基于Web数据挖掘的COVID-19流行病学特征分析

郑文, 赵偲, 李泽堃, 武啸泽, 胡涛

郑文, 赵偲, 李泽堃, 武啸泽, 胡涛. 基于Web数据挖掘的COVID-19流行病学特征分析[J]. 电子科技大学学报, 2020, 49(3): 408-414. doi: 10.12178/1001-0548.2020079
引用本文: 郑文, 赵偲, 李泽堃, 武啸泽, 胡涛. 基于Web数据挖掘的COVID-19流行病学特征分析[J]. 电子科技大学学报, 2020, 49(3): 408-414. doi: 10.12178/1001-0548.2020079

ZHENG Wen, ZHAO Cai, LI Ze-kun, WU Xiao-ze, HU Tao. Epidemiological Characteristics of Novel Coronavirus COVID-19 Based on Web Data Mining[J]. Journal of University of Electronic Science and Technology of China, 2020, 49(3): 408-414. doi: 10.12178/1001-0548.2020079
Citation: ZHENG Wen, ZHAO Cai, LI Ze-kun, WU Xiao-ze, HU Tao. Epidemiological Characteristics of Novel Coronavirus COVID-19 Based on Web Data Mining[J]. Journal of University of Electronic Science and Technology of China, 2020, 49(3): 408-414. doi: 10.12178/1001-0548.2020079

基于Web数据挖掘的COVID-19流行病学特征分析

doi: 10.12178/1001-0548.2020079
  • 郑文1, ,
  • 赵偲1,
  • 李泽堃1,
  • 武啸泽1,
  • 胡涛2
  • 1.

    太原理工大学大数据学院公共安全大数据研究所 太原 030060

  • 2.

    华中科技大学同济医学院 机构 附属武汉中心医院肿瘤科 武汉 430014

基金项目: 国家自然科学基金(11702289 基金 );冲击与安全工程教育部重点实验室开放基金(cj201902)

详细信息

    作者简介:

    郑文(1985-),男,博士,研究员,主要从事公共安全大数据、数字政府建设等方面的研究. E-mail:zhengwen@tyut.edu.cn

  • 中图分类号: TP391

Epidemiological Characteristics of Novel Coronavirus COVID-19 Based on Web Data Mining

  • 1.

    Institute of Public-Safety and Big Data, College of Data Science, Taiyuan University of Technology Taiyuan 030060

  • 2.

    The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology Wuhan 430014

摘要

  • 摘要
  • HTML全文
  • 图(7)表(3)
  • 参考文献(21)
  • 相关文章
  • 施引文献(2)
  • 资源附件(0)
  • 访问统计
  • 摘要:基于Selenium数据挖掘技术,通过对2020年2月4日−22日新浪微博“肺炎患者求助超话”中690例有效病例的分析,获得了新浪微博中真实求助病例的流行病学特征。研究发现,求助患者97.6%来自于武汉,重点集中在武昌、硚口、汉阳等中心城区,与当地的医疗资源和人口密度成正比。微博求助病例主要分布在2020年2月4日−7日,随着医疗资源紧张程度的缓解,通过微博求助的病例明显减少。求助患者确诊日期主要分布在2020年1月16日−2月6日,与中国疾控中心发布的病例分布情况基本一致。求助患者年龄分布中位数为60岁,明显高于中国疾控中心发布的数据,但与武汉市中心医院的数据基本吻合。该文研究结果说明,针对重大突发性传染病,微博等社交媒体除了在舆论传播上发挥作用,在流行病学分析上也具有重要意义。基于社交媒体的实时性和广泛性,结合数据挖掘和大数据分析等方法,有助于决策层快速掌握一线真实情况。
      关键词:

    • 新冠肺炎 /
    • 数据挖掘 /
    • 流行病学 /
    • 社交媒体
    Abstract:Based on the Selenium data mining technology, the epidemiological characteristics of real help cases in Sina Weibo were obtained by the analysis of 690 valid cases posted in the Sina Weibo “Pneumonia Patients Asking for Help” topic from February 4 to February 22, 2020. The research showed that 97.6% of the patients seeking for help came from Wuhan, mainly centralized in Wuchang, Tongkou, Hanyang etc. urban areas, and the proportion is directly proportional to the local medical resources and population density. The cases of Weibo help were mainly distributed from February 4 to February 7, 2020. With the relief of medical resources, the number of cases seeking help through social media decreased significantly. The distribution of patients, whose diagnosed date was mainly from January 16 to February 6, 2020, was basically consistent with the case information released by the Chinese Center for Disease Control and Prevention (CCDC). The median age of patients seeking for help was 60 years old, which was much higher than the data released by the CCDC but was roughly coincident with the data of the central hospital of Wuhan. The results of this study indicate that when dealing with major outbreaks of infectious diseases, social media are equally important in epidemiological analysis as well as the role in the dissemination of public opinion. Based on the wide adoption and timeliness nature of social media, it will be helpful for decision-makers to quickly grasp the real-world situation as it is combined with data mining or big data analysis.
      Key words:

    • COVID-19 /
    • data mining /
    • epidemiological /
    • social media
(0)

相关推荐