Slow Start Transition in Participatory Sensing Applications

In this paper, we present the "Slow Start Problem" in participatory sensing applications where a service is provided based on data collected by participants. The slow start problem refers to the initial stage in participatory sensing service deployment, during which service adoption remains sparse and, hence, the collected data does not offer adequate coverage. Predictive models, learned from data, offer a way to generalize from sparse observations, but the models themselves need to be statistically reliable to offer a reliable service. To achieve service reliability, this paper offers a modeling approach, where simpler models are used initially, gradually transitioning to more elaborate models, when enough data is collected. A key challenge and contribution of the work is to time model transitions correctly to provide theoretical guarantees on modeling error. Our technique takes a holistic approach in bounding modeling error as opposed to prior statistical approaches that bound the error of a single model component at a time. This technique is tested in the context of a vehicular participatory sensing application, called GreenGPS, where participant data is used to build models that predict fuel consumption of vehicles on different routes for the purposes of choosing the most fuel-efficient route for each vehicle (as opposed to the shortest or fastest). We show that our approach significantly reduces prediction error in the initial stages of deployment.

[1]  B. Muthén,et al.  How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power , 2002 .

[2]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[3]  Mirco Musolesi,et al.  The Rise of People-Centric Sensing , 2008, IEEE Internet Comput..

[4]  Leonidas J. Guibas,et al.  Mobiscopes for Human Spaces , 2007, IEEE Pervasive Computing.

[5]  Fatemeh Saremi,et al.  Combining Map-Based Inference and Crowd-Sensing for Detecting Traffic Regulators , 2015, 2015 IEEE 12th International Conference on Mobile Ad Hoc and Sensor Systems.

[6]  Robert D. Nowak,et al.  Faster Rates in Regression via Active Learning , 2005, NIPS.

[7]  A Alexander Beaujean Sample Size Determination for Regression Models Using Monte Carlo Methods in R , 2014 .

[8]  Mani Srivastava,et al.  Human-centric sensing , 2012, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[9]  J. Kiefer,et al.  Optimum Designs in Regression Problems , 1959 .

[10]  Robert D. Nowak,et al.  Backcasting: adaptive sampling for sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[11]  Hengchang Liu,et al.  Experiences with GreenGPS—Fuel-Efficient Navigation Using Participatory Sensing , 2016, IEEE Transactions on Mobile Computing.

[12]  Shalabh,et al.  Linear Models and Generalizations: Least Squares and Alternatives , 2007 .

[13]  Emily R. Dickinson,et al.  Exploring the Limitations of Measures of Students' Socioeconomic Status (SES) , 2014 .

[14]  W. N. Venables,et al.  An introduction to R : a programming environment for data analysis and graphics , 2009 .

[15]  Mark H. Hansen,et al.  Participatory sensing - eScholarship , 2006 .

[16]  Ken Kelley,et al.  Sample size for multiple regression: obtaining regression coefficients that are accurate, not simply significant. , 2003, Psychological methods.

[17]  Ken Kelley,et al.  Sample Size Planning with Applications to Multiple Regression: Power and Accuracy for Omnibus and Targeted Effects , 2008 .