Data analysis of COVID-2019 epidemic using machine learning methods: a case study of India

At this time, COVID-2019 is spreading its foot in the form of a huge epidemic for the world. This epidemic is spreading its foot very fast in India too. One of the World Health Organization states that COVID-2019 is a serious disease that spreads from one person to another at very fast speed through contact routes and respiratory drops. On this day, India and the world should rise to an effective step to analyze this disease and eliminate the effects of this epidemic. In this paper presented, the growing database of COVID-2019 has been analyzed from March 1, 2020, to April 11, 2020, and the next one is predicted for the number of patients suffering from the rising COVID-2019. Different regression analysis models have been utilized for data analysis of COVID-2019 of India based on data stored by Kaggle in between 1 March 2020 to 11 April 2020. In this study, we have been utilized six regression analysis based models namely quadratic, third degree, fourth degree, fifth degree, sixth degree, and exponential polynomial respectively for the COVID-2019 dataset. We have calculated the root mean square of these six regression analysis models. In these six models, the root mean square error of sixth degree polynomial is very less in compared other like quadratic, third degree, fourth degree, fifth degree, and exponential polynomial. Therefore the sixth degree polynomial regression model is very good models for forecasting the next 6 days for COVID-2019 data analysis in India. In this study, we have found that the sixth degree polynomial regression models will help Indian doctors and the Government in preparing their plans in the next 7 days. Based on further regression analysis study, this model can be tuned for forecasting over long term intervals.

[1]  Zunyou Wu,et al.  Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. , 2020, JAMA.

[2]  Lawrence Carin,et al.  Digital technology and COVID-19 , 2020, Nature Medicine.

[3]  W. Wieland-Alter,et al.  Volatile fingerprinting of human respiratory viruses from cell culture , 2018, Journal of breath research.

[4]  S. Deb,et al.  A time series method to analyze incidence pattern and estimate reproduction number of COVID-19 , 2020, 2003.10655.

[5]  N. Arinaminpathy,et al.  Prudent public health intervention strategies to control the coronavirus disease 2019 transmission in India: A mathematical model-based approach , 2020, The Indian journal of medical research.

[6]  Rajat Kumar Behera,et al.  Predicting Malarial Outbreak using Machine Learning and Deep Learning Approach: A Review and Analysis , 2018, 2018 International Conference on Information Technology (ICIT).

[7]  M. Dunowska,et al.  A serological survey of canine respiratory coronavirus in New Zealand , 2019, New Zealand veterinary journal.

[8]  C. Cheung,et al.  Review of the Clinical Characteristics of Coronavirus Disease 2019 (COVID-19) , 2020, Journal of General Internal Medicine.

[9]  Xin Zhou,et al.  Risk Factors Associated With Acute Respiratory Distress Syndrome and Death in Patients With Coronavirus Disease 2019 Pneumonia in Wuhan, China , 2020, The Journal of Emergency Medicine.

[10]  Michael Krauthammer,et al.  Controlling testing volume for respiratory viruses using machine learning and text mining , 2016, AMIA.

[11]  E. Nsoesie,et al.  A systematic review of studies on forecasting the dynamics of influenza outbreaks , 2013, Influenza and other respiratory viruses.

[12]  C. Campèse,et al.  First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020 , 2020, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[13]  X. Wang,et al.  Predicting hepatitis B virus–positive metastatic hepatocellular carcinomas using gene expression profiling and supervised machine learning , 2003, Nature Medicine.

[14]  Rajan Gupta,et al.  Trend Analysis and Forecasting of COVID-19 outbreak in India , 2020, medRxiv.

[15]  E. Dong,et al.  An interactive web-based dashboard to track COVID-19 in real time , 2020, The Lancet Infectious Diseases.

[16]  P. Piro,et al.  Investigating a Serious Challenge in the Sustainable Development Process: Analysis of Confirmed cases of COVID-19 (New Type of Coronavirus) Through a Binary Classification Using Artificial Intelligence and Regression Analysis , 2020, Sustainability.

[17]  Marta Giovanetti,et al.  Application of the ARIMA model on the COVID-2019 epidemic dataset , 2020, Data in Brief.