Application of Big Data Technology for COVID-19 Prevention and Control in China: Lessons and Recommendations

Background In the prevention and control of infectious diseases, previous research on the application of big data technology has mainly focused on the early warning and early monitoring of infectious diseases. Although the application of big data technology for COVID-19 warning and monitoring remain important tasks, prevention of the disease’s rapid spread and reduction of its impact on society are currently the most pressing challenges for the application of big data technology during the COVID-19 pandemic. After the outbreak of COVID-19 in Wuhan, the Chinese government and nongovernmental organizations actively used big data technology to prevent, contain, and control the spread of COVID-19. Objective The aim of this study is to discuss the application of big data technology to prevent, contain, and control COVID-19 in China; draw lessons; and make recommendations. Methods We discuss the data collection methods and key data information that existed in China before the outbreak of COVID-19 and how these data contributed to the prevention and control of COVID-19. Next, we discuss China’s new data collection methods and new information assembled after the outbreak of COVID-19. Based on the data and information collected in China, we analyzed the application of big data technology from the perspectives of data sources, data application logic, data application level, and application results. In addition, we analyzed the issues, challenges, and responses encountered by China in the application of big data technology from four perspectives: data access, data use, data sharing, and data protection. Suggestions for improvements are made for data collection, data circulation, data innovation, and data security to help understand China’s response to the epidemic and to provide lessons for other countries’ prevention and control of COVID-19. Results In the process of the prevention and control of COVID-19 in China, big data technology has played an important role in personal tracking, surveillance and early warning, tracking of the virus’s sources, drug screening, medical treatment, resource allocation, and production recovery. The data used included location and travel data, medical and health data, news media data, government data, online consumption data, data collected by intelligent equipment, and epidemic prevention data. We identified a number of big data problems including low efficiency of data collection, difficulty in guaranteeing data quality, low efficiency of data use, lack of timely data sharing, and data privacy protection issues. To address these problems, we suggest unified data collection standards, innovative use of data, accelerated exchange and circulation of data, and a detailed and rigorous data protection system. Conclusions China has used big data technology to prevent and control COVID-19 in a timely manner. To prevent and control infectious diseases, countries must collect, clean, and integrate data from a wide range of sources; use big data technology to analyze a wide range of big data; create platforms for data analyses and sharing; and address privacy issues in the collection and use of big data.

[1]  M. Javaid,et al.  Significant Applications of Big Data in COVID-19 Pandemic , 2020, Indian Journal of Orthopaedics.

[2]  Lei Dong,et al.  Migration patterns in China extracted from mobile positioning data , 2019, Habitat International.

[3]  Zhiwei Xu,et al.  Monitoring Pertussis Infections Using Internet Search Queries , 2017, Scientific Reports.

[4]  Qingwu Wu,et al.  Using the internet search data to investigate symptom characteristics of COVID-19: A big data study , 2020, World Journal of Otorhinolaryngology - Head and Neck Surgery.

[5]  Bin Wang,et al.  Time Series Analyses of Hand, Foot and Mouth Disease Integrating Weather Variables , 2015, PloS one.

[6]  Yong Huang,et al.  Dynamic Forecasting of Zika Epidemics Using Google Trends , 2016, bioRxiv.

[7]  Gerardo Chowell,et al.  Big Data for Infectious Disease Surveillance and Modeling. , 2016, The Journal of infectious diseases.

[8]  I. Bogoch,et al.  A Platform for Monitoring Regional Antimicrobial Resistance, Using Online Data Sources: ResistanceOpen. , 2016, The Journal of infectious diseases.

[9]  Andrew Janowczyk,et al.  Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases , 2016, Journal of pathology informatics.

[10]  Kwok-Leung Tsui,et al.  Forecasting influenza in Hong Kong with Google search queries and statistical model fusion , 2017, PloS one.

[11]  Mesut Toğaçar,et al.  COVID-19 detection using deep learning models to exploit Social Mimic Optimization and structured chest X-ray images using fuzzy color and stacking approaches , 2020, Computers in Biology and Medicine.

[12]  Cécile Viboud,et al.  Infectious Disease Surveillance in the Big Data Era: Towards Faster and Locally Relevant Systems. , 2016, The Journal of infectious diseases.

[13]  Y. Liao,et al.  COVID-19: Challenges to GIS with Big Data , 2020, Geography and Sustainability.

[14]  Leesa Lin,et al.  Combat COVID-19 with artificial intelligence and big data , 2020, Journal of travel medicine.

[15]  M. Salathé Digital Pharmacovigilance and Disease Surveillance: Combining Traditional and Big-Data Systems for Better Public Health , 2016, The Journal of infectious diseases.

[16]  John P A Ioannidis,et al.  Informed Consent, Big Data, and the Oxymoron of Research That Is Not Research , 2013, The American journal of bioethics : AJOB.

[17]  C. Viboud,et al.  Elucidating Transmission Patterns From Internet Reports: Ebola and Middle East Respiratory Syndrome as Case Studies , 2016, The Journal of infectious diseases.

[18]  Y. Gel,et al.  Influenza Forecasting with Google Flu Trends , 2013, PloS one.

[19]  Ben Armstrong,et al.  Host, Weather and Virological Factors Drive Norovirus Epidemiology: Time-Series Analysis of Laboratory Surveillance Data in England and Wales , 2009, PloS one.

[20]  Chiara Garattini,et al.  Big Data Analytics, Infectious Diseases and Associated Ethical Impacts , 2017, Philosophy & Technology.

[21]  Kim,et al.  Big Data in Healthcare Hype and Hope , 2013 .

[22]  W. Liang,et al.  Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions , 2020, Journal of thoracic disease.

[23]  L. Yang,et al.  The association between domestic train transportation and novel coronavirus (2019-nCoV) outbreak in China from 2019 to 2020: A data-driven correlational report , 2020, Travel Medicine and Infectious Disease.

[24]  Jianhong Wu,et al.  How Big Data and Artificial Intelligence Can Help Better Manage the COVID-19 Pandemic , 2020, International journal of environmental research and public health.

[25]  Y. Moreno,et al.  Participatory Syndromic Surveillance of Influenza in Europe. , 2016, The Journal of infectious diseases.

[26]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[27]  Shilu Tong,et al.  Disease surveillance based on Internet-based linear models: an Australian case study of previously unmodeled infection diseases , 2016, Scientific reports.

[28]  Qiang Sun,et al.  Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index , 2020, International journal of environmental research and public health.

[29]  Bin Peng,et al.  In silico screening of Chinese herbal medicines with the potential to directly inhibit 2019 novel coronavirus , 2020, Journal of Integrative Medicine.

[30]  Sharareh R Niakan Kalhori,et al.  Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study , 2020, JMIR Public Health and Surveillance.

[31]  Bonnie Feldman,et al.  Big Data in Healthcare Hype and Hope , 2012 .

[32]  Veenu Mangat,et al.  Technology and Trends to Handle Big Data: Survey , 2015, 2015 Fifth International Conference on Advanced Computing & Communication Technologies.

[33]  P. Fournier,et al.  Clinical efficacy of chloroquine derivatives in COVID-19 infection: comparative meta-analysis between the big data and the real world , 2020, New Microbes and New Infections.

[34]  Sangwon Chae,et al.  Predicting Infectious Disease Using Deep Learning and Big Data , 2018, International journal of environmental research and public health.

[35]  Ming-Hsiang Tsou,et al.  Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza , 2016, PloS one.

[36]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[37]  J. Rocklöv,et al.  Short Term Effects of Weather on Hand, Foot and Mouth Disease , 2011, PloS one.

[38]  Yan Bai,et al.  A fully automatic deep learning system for COVID-19 diagnostic and prognostic analysis , 2020, European Respiratory Journal.

[39]  Dmitrii Bychkov,et al.  Deep learning based tissue analysis predicts outcome in colorectal cancer , 2018, Scientific Reports.

[40]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[41]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[42]  Jae Ho Lee,et al.  Correlation between National Influenza Surveillance Data and Google Trends in South Korea , 2013, PloS one.

[43]  Andrew C. Miller,et al.  Advances in nowcasting influenza-like illness rates using search query logs , 2015, Scientific Reports.

[44]  Gail M. Williams,et al.  Imported Dengue Cases, Weather Variation and Autochthonous Dengue Incidence in Cairns, Australia , 2013, PloS one.

[45]  S. Guraya Transforming laparoendoscopic surgical protocols during the COVID-19 pandemic; big data analytics, resource allocation and operational considerations , 2020, International Journal of Surgery.