Missing sensor value estimation method for participatory sensing environment

Participatory sensing produces incomplete sensor data. Thus, we have to fill in the gaps of any missing values in the sensor data in order to provide sensor-based services. We propose a method to estimate a missing value of incomplete sensor data. It accurately estimates a missing value by repeating two processes: selecting sensors locally correlated with the sensor that includes the missing value and then updating the training sensor dataset that consist of data from the selected sensors available for multiple regression. This procedure effectively helps to find more suitable neighbor records of a query record from the training sensor dataset and to refine the regression model using the records. It overcomes three problems that other estimation methods have: a decrease in the amount of available training sensor dataset due to missing values, the difficulty in finding similar records of a query due to the “curse of dimensionality,” and the complexity in formalizing the estimation model due to “overfitting.” The main feature of our method is the way it repeatedly prunes inessential sensors while exploiting the anti-monotone property in which the training sensor dataset R' that consist of the sensors V' ⊂ V is larger than the data R that consist of V. Empirical evaluations done using public datasets in which we appended missing values show that our method increases the training sensor dataset for estimation and improves estimation accuracy through repeated sensor selections. Furthermore, we confirmed through a field trial and a life-log enrichment trial, that our method was effective for estimating missing sensor values in a participatory sensing environment.

[1]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[2]  M. Hansen,et al.  Participatory Sensing , 2019, Internet of Things.

[3]  Paul A Murtaugh,et al.  Performance of several variable-selection methods applied to real ecological data. , 2009, Ecology letters.

[4]  Andrew Campbell,et al.  The Rise of People-Centric Sensing , 2008, IEEE Internet Computing.

[5]  Wen Hu,et al.  Ear-phone: an end-to-end participatory urban noise mapping system , 2010, IPSN '10.

[6]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[7]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[8]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[9]  Minho Shin,et al.  Anonysense: privacy-aware people-centric sensing , 2008, MobiSys '08.

[10]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[11]  John K. Dixon,et al.  Pattern Recognition with Partly Missing Data , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[12]  Suman Nath,et al.  Privacy-aware regression modeling of participatory sensing data , 2010, SenSys '10.

[13]  Siyuan Liu,et al.  Effective routine behavior pattern discovery from sparse mobile phone data via collaborative filtering , 2013, 2013 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[14]  Qinghua Li,et al.  Providing privacy-aware incentives for mobile sensing , 2013, 2013 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[15]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[16]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[17]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[18]  Tarek F. Abdelzaher,et al.  GreenGPS: a participatory sensing fuel-efficient maps application , 2010, MobiSys '10.

[19]  Andrea Montanari,et al.  Matrix Completion from Noisy Entries , 2009, J. Mach. Learn. Res..

[20]  Vana Kalogeraki,et al.  Privacy preservation for participatory sensing data , 2013, 2013 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[21]  Xi Fang,et al.  Crowdsourcing to smartphones: incentive mechanism design for mobile phone sensing , 2012, Mobicom '12.

[22]  Hisashi Kurasawa,et al.  Top of worlds: method for improving motivation to participate in sensing services , 2012, UbiComp '12.