Prediction of the disease controllability in a complex network using machine learning algorithms

The application of machine learning (ML) techniques span a vast spectrum ranging from speech, face and character recognition, medical diagnosis, anomaly detection in data to the general classification, prediction, and regression problems. In the present work, we solve the problem of predicting R_0 for disease spreading on complex networks using the regression-based state-of-art ML techniques. R_0 is a metric that determines whether the disease-free epidemic or an endemic state is asymptotically stable and hence indicates the controllability of the disease spread. We predict R_0 , based on training the ML models with structural properties of complex networks, irrespective of the network type. The prediction is possible because: (a) The structure of complex networks plays an essential role in the spreading processes on networks (b) The regression techniques such as Support Vector Regression and Artificial Neural Network Model can be very efficiently used for prediction problems, even for non-linear data. We obtained good accuracy in the prediction of R_0 for the simulated networks as well as real-world networks using these techniques. Moreover, the ML model training is a one-time investment cost in terms of training time and memory, and the trained model can be used for predicting R_0 on unseen/new examples of networks.

[1]  Alexander Grey,et al.  The Mathematical Theory of Infectious Diseases and Its Applications , 1977 .

[2]  Y. Moreno,et al.  Epidemic outbreaks in complex heterogeneous networks , 2001, cond-mat/0107267.

[3]  Herbert W. Hethcote,et al.  The Mathematics of Infectious Diseases , 2000, SIAM Rev..

[4]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[5]  Andrew D Stewart,et al.  AN EMPIRICAL STUDY OF THE EVOLUTION OF VIRULENCE UNDER BOTH HORIZONTAL AND VERTICAL TRANSMISSION , 2005, Evolution; international journal of organic evolution.

[6]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[7]  Qi Tian,et al.  Feature selection using principal feature analysis , 2007, ACM Multimedia.

[8]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[9]  D. Selvathi,et al.  Brain MRI Slices Classification Using Least Squares Support Vector Machine , 2007 .

[10]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Walter F. Stenning,et al.  AN EMPIRICAL STUDY , 2003 .

[12]  L. Monteiro,et al.  On the basic reproduction number and the topological properties of the contact network: An epidemiological study in mainly locally connected cellular automata , 2009, Ecological Modelling.

[13]  A. T. C. Goh,et al.  Back-propagation neural networks for modeling complex systems , 1995, Artif. Intell. Eng..

[14]  J. Botella de Maglia,et al.  [Prevention of malaria]. , 1999, Revista clinica espanola.

[15]  J. Borge-Holthoefer,et al.  Discrete-time Markov chain approach to contact-based disease spreading in complex networks , 2009, 0907.1313.

[16]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[17]  Alessandro Vespignani,et al.  Immunization of complex networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Mark D. F. Shirley,et al.  The impacts of network topology on disease spread , 2005 .

[19]  Sergey N. Dorogovtsev,et al.  Localization and Spreading of Diseases in Complex Networks , 2012, Physical review letters.

[20]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[21]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[22]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[23]  A. Hammer Project title: , 1999 .

[24]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[25]  Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, January 12-14, 2003, Baltimore, Maryland, USA , 2003, SODA.

[26]  Jiang Zhang,et al.  Complex Network Classification with Convolutional Neural Network , 2018, Tsinghua Science and Technology.

[27]  Y. Takane,et al.  Generalized Inverse Matrices , 2011 .

[28]  N. Ling The Mathematical Theory of Infectious Diseases and its applications , 1978 .

[29]  Fábio Henrique Pereira,et al.  Disease spreading in complex networks: A numerical study with Principal Component Analysis , 2017, Expert Systems with Applications.

[30]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[31]  A. J. Lotka Elements of mathematical biology , 1956 .

[32]  Peter M. A. Sloot,et al.  International Journal of Computer Mathematics Stochastic Simulation of Hiv Population Dynamics through Complex Network Modelling Stochastic Simulation of Hiv Population Dynamics through Complex Network Modelling , 2022 .

[33]  Lalit M. Patnaik,et al.  Classification of magnetic resonance brain images using wavelets as input to support vector machine and neural network , 2006, Biomed. Signal Process. Control..

[34]  Ciro Cattuto,et al.  What's in a crowd? Analysis of face-to-face behavioral networks , 2010, Journal of theoretical biology.

[35]  Yasser Yasami,et al.  A novel unsupervised classification approach for network anomaly detection by k-Means clustering and ID3 decision tree learning methods , 2010, The Journal of Supercomputing.

[36]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[37]  P. Kaye Infectious diseases of humans: Dynamics and control , 1993 .

[38]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .