Locating the Source of Large-scale Diffusion of Foodborne Contamination

We study the problem of identifying the source of emerging large-scale outbreaks of foodborne disease. To solve the source identification problem we formulate a probabilistic model of the contamination diffusion process as a random walk on a network and derive the maximum likelihood estimator for the source location. By modeling the transmission process as a random walk, we are able to develop a novel, computationally tractable solution that accounts for all possible paths of travel through the network. This is in contrast to existing approaches to network source identification, which assume that the contamination travels along either the shortest or highest probability paths. We demonstrate the benefits of the multiple-paths approach through application to different network topologies, including stylized models of food supply network structure and real data from the 2011 EHEC outbreak in Germany. We show significant improvements in accuracy and reliability compared with the relevant state-of-the-art approach to source identification. Beyond foodborne disease, these methods should find application in identifying the source of spread in network-based diffusion processes more generally, including in networks not well approximated by tree-like structure.

[1]  Klaus Stark,et al.  Epidemic profile of Shiga-toxin-producing Escherichia coli O104:H4 outbreak in Germany. , 2011, The New England journal of medicine.

[2]  H. Friedrich Simulation of logistics in food retailing for freight transportation analysis , 2010 .

[3]  Kun Hu,et al.  A Likelihood-Based Approach to Identifying Contaminated Food Products Using Sales Data: Performance and Challenges , 2014, PLoS Comput. Biol..

[4]  Tejas Bhatt,et al.  Pilot projects for improving product tracing along the food supply system. , 2013, Journal of food science.

[5]  Régis Pouillot,et al.  Estimating parametric distributions of storage time and temperature of ready-to-eat foods for U.S. households. , 2010, Journal of food protection.

[6]  Matthias Greiner,et al.  Trace-back and trace-forward tools developed ad hoc and used during the STEC O104:H4 outbreak 2011 in Germany and generic concepts for future outbreak situations. , 2013, Foodborne pathogens and disease.

[7]  Christos Faloutsos,et al.  Efficiently spotting the starting points of an epidemic in a large graph , 2013, Knowledge and Information Systems.

[8]  Bechara Choucair,et al.  Health Department Use of Social Media to Identify Foodborne Illness — Chicago, Illinois, 2013–2014 , 2014, MMWR. Morbidity and mortality weekly report.

[9]  Eunsoo Seo,et al.  Identifying rumors and their sources in social networks , 2012, Defense + Commercial Sensing.

[10]  Elaine O. Nsoesie,et al.  Online reports of foodborne illness capture foods implicated in official foodborne outbreak reports. , 2014, Preventive medicine.

[11]  Luis Gravano,et al.  Using Online Reviews by Restaurant Patrons to Identify Unreported Cases of Foodborne Illness — New York City, 2012–2013 , 2014, MMWR. Morbidity and mortality weekly report.

[12]  L. Gould,et al.  Vital Signs: Multistate Foodborne Outbreaks - United States, 2010-2014. , 2015, MMWR. Morbidity and mortality weekly report.

[13]  Denyse I. LeBlanc,et al.  Comparative simulation of Escherichia coli O157:H7 behaviour in packaged fresh-cut lettuce distributed in a typical Canadian supply chain in the summer and winter , 2014 .

[14]  Alireza Louni,et al.  A two-stage algorithm to estimate the source of information diffusion in social media networks , 2014, 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[15]  John A. Painter,et al.  Attribution of Foodborne Illnesses, Hospitalizations, and Deaths to Food Commodities by using Outbreak Data, United States, 1998–2008 , 2013, Emerging infectious diseases.

[16]  Leila Hashemi Beni,et al.  A national produce supply chain database for food safety risk analysis , 2015 .

[17]  Dirk Helbing,et al.  Origin Detection During Food-borne Disease Outbreaks - A Case Study of the 2011 EHEC/HUS Outbreak in Germany , 2014, PLoS currents.

[18]  Yue M. Lu,et al.  A fast Monte Carlo algorithm for source localization on graphs , 2013, Optics & Photonics - Optical Engineering + Applications.

[19]  Devavrat Shah,et al.  Rumors in a Network: Who's the Culprit? , 2009, IEEE Transactions on Information Theory.

[20]  Matthias Greiner,et al.  German outbreak of Escherichia coli O104:H4 associated with sprouts. , 2011, The New England journal of medicine.

[21]  Riccardo Zecchina,et al.  Bayesian inference of epidemics on networks via Belief Propagation , 2013, Physical review letters.

[22]  H. Friedrich,et al.  Dynamic freight flow modelling for risk evaluation in food supply , 2015, Transportation Research Part E: Logistics and Transportation Review.

[23]  L. D. Costa,et al.  Identifying the starting point of a spreading process in complex networks. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  D. Helbing,et al.  The Hidden Geometry of Complex, Network-Driven Contagion Phenomena , 2013, Science.

[25]  Martin Vetterli,et al.  Locating the Source of Diffusion in Large-Scale Networks , 2012, Physical review letters.

[26]  Robert V. Tauxe,et al.  Outbreak of Escherichia coli O104:H4 Infections Associated with Sprout Consumption — Europe and North America, May–July 2011 , 2013, MMWR. Morbidity and mortality weekly report.

[27]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[28]  Lenka Zdeborová,et al.  Inferring the origin of an epidemy with dynamic message-passing algorithm , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Vincenzo Fioriti,et al.  Predicting the sources of an outbreak with a spectral technique , 2012, ArXiv.

[30]  Melinda J. Wilkins,et al.  Outbreak Investigations (Epidemiology) , 2015 .