Enabling Cost-Effective Population Health Monitoring By Exploiting Spatiotemporal Correlation

Because of its important role in health policy-shaping, population health monitoring (PHM) is considered a fundamental block for public health services. However, traditional public health data collection approaches, such as clinic-visit-based data integration or health surveys, could be very costly and time-consuming. To address this challenge, this paper proposes a cost-effective approach called Compressive Population Health (CPH), where a subset of a given area is selected in terms of regions within the area for data collection in the traditional way, while leveraging inherent spatial correlations of neighboring regions to perform data inference for the rest of the area. By alternating selected regions longitudinally, this approach can validate and correct previously assessed spatial correlations. To verify whether the idea of CPH is feasible, we conduct an in-depth study based on spatiotemporal morbidity rates of chronic diseases in more than 500 regions around London for over ten years. We introduce our CPH approach and present three extensive analytical studies. The first confirms that significant spatiotemporal correlations do exist. In the second study, by deploying multiple state-of-the-art data recovery algorithms, we verify that these spatiotemporal correlations can be leveraged to do data inference accurately using only a small number of samples. Finally, we compare different methods for region selection for traditional data collection and show how such methods can further reduce the overall cost while maintaining high PHM quality.

[1]  L. Meyers Contact network epidemiology: Bond percolation applied to infectious disease prediction and control , 2006 .

[2]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[3]  Christoph Trattner,et al.  Monitoring obesity prevalence in the United States through bookmarking activities in online food portals , 2017, PloS one.

[4]  Volkan Cevher,et al.  Model-Based Compressive Sensing , 2008, IEEE Transactions on Information Theory.

[5]  Yanchi Liu,et al.  Diagnosing New York city's noises with ubiquitous data , 2014, UbiComp.

[6]  Tom Chan,et al.  Using routinely collected health data for surveillance, quality improvement and research: Framework and key questions to assess ethics and privacy and enable data access , 2015, BMJ Health & Care Informatics.

[7]  Clemens Scott Kruse,et al.  The use of Electronic Health Records to Support Population Health: A Systematic Review of the Literature , 2018, Journal of Medical Systems.

[8]  Cecilia Mascolo,et al.  Understanding the Effects of the Neighbourhood Built Environment on Public Health with Open Data , 2019, WWW.

[9]  Hamed Haddadi,et al.  #FoodPorn: Obesity Patterns in Culinary Interactions , 2015, Digital Health.

[10]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[11]  S. Wyke,et al.  Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study , 2012, The Lancet.

[12]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[13]  Kari Kuulasmaa,et al.  European health examination surveys – a tool for collecting objective information about the health of the population , 2018, Archives of Public Health.

[14]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[15]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[16]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[17]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[18]  Richard G. Baraniuk,et al.  Compressive Sensing , 2008, Computer Vision, A Reference Guide.

[19]  Robert Haining,et al.  Handbook of Spatial Epidemiology , 2021, Journal of the American Statistical Association.

[20]  Mihui Kim,et al.  Predicting Flu-Rate Using Big Data Analytics Based on Social Data and Weather Conditions , 2017 .

[21]  M. Bartley Health Inequality: An Introduction to Concepts, Theories and Methods , 2004 .

[22]  Xue Liu,et al.  Data Loss and Reconstruction in Wireless Sensor Networks , 2014, IEEE Transactions on Parallel and Distributed Systems.

[23]  Michele Zorzi,et al.  Sensing, Compression, and Recovery for WSNs: Sparse Signal Modeling and Monitoring Framework , 2012, IEEE Transactions on Wireless Communications.

[24]  Jeannine S. Schiller,et al.  Summary health statistics for U.S. adults: National Health Interview Survey, 2002. , 2004, Vital and health statistics. Series 10, Data from the National Health Survey.

[25]  Walter Willinger,et al.  Spatio-temporal compressive sensing and internet traffic matrices , 2009, SIGCOMM '09.

[26]  Steven Cummins,et al.  Associations between fast food and physical activity environments and adiposity in mid-life: cross-sectional, observational evidence from UK Biobank , 2017, The Lancet. Public health.

[27]  L. Thorpe,et al.  Innovations in Population Health Surveillance: Using Electronic Health Records for Chronic Disease Surveillance. , 2017, American journal of public health.

[28]  Xing Xie,et al.  Predicting the Spatio-Temporal Evolution of Chronic Diseases in Population with Human Mobility Data , 2018, IJCAI.

[29]  Wen Hu,et al.  Face recognition on smartphones via optimised Sparse Representation Classification , 2014, IPSN-14 Proceedings of the 13th International Symposium on Information Processing in Sensor Networks.

[30]  Xianwei Feng,et al.  Regional inequality in health and its determinants: evidence from China. , 2010, Health policy.

[31]  Minglu Li,et al.  A Compressive Sensing Approach to Urban Traffic Estimation with Probe Vehicles , 2013, IEEE Transactions on Mobile Computing.

[32]  J. Lucas,et al.  Summary health statistics for U.S. adults: national health interview survey, 2012. , 2014, Vital and health statistics. Series 10, Data from the National Health Survey.

[33]  Zhu Wang,et al.  TrajCompressor: An Online Map-matching-based Trajectory Compression Framework Leveraging Vehicle Heading Direction and Change , 2020, IEEE Transactions on Intelligent Transportation Systems.

[34]  Tianrui Li,et al.  ST-MVL: Filling Missing Values in Geo-Sensory Time Series Data , 2016, IJCAI.

[35]  Paul Kind,et al.  Variations in population health status: results from a United Kingdom national questionnaire survey , 1998, BMJ.

[36]  Andreas Krause,et al.  Near-optimal sensor placements in Gaussian processes , 2005, ICML.

[37]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[38]  Massimo Fornasier,et al.  Compressive Sensing , 2015, Handbook of Mathematical Methods in Imaging.

[39]  Daqing Zhang,et al.  CCS-TA: quality-guaranteed online task allocation in compressive crowdsensing , 2015, UbiComp.