Online reconstruction from big data via compressive censoring

This is an era of data deluge with individuals and pervasive sensors acquiring large and ever-increasing amounts of data. Nevertheless, given the inherent redundancy, the costs related to data acquisition, transmission, and storage can be reduced if the per-datum importance is properly exploited. In this context, the present paper investigates sparse linear regression with censored data that appears naturally under diverse data collection setups. A practical censoring rule is proposed here for data reduction purposes. A sparsity-aware censored maximum-likelihood estimator is also developed, which fits well to big data applications. Building on recent advances in online convex optimization, a novel algorithm is finally proposed to enable real-time processing. The online algorithm applies even to the general censoring setup, while its simple closed-form updates enjoy provable convergence. Numerical simulations corroborate its effectiveness in estimating sparse signals from only a subset of exact observations, thus reducing the processing cost in big data applications.

[1]  Dean P. Foster,et al.  The risk inflation criterion for multiple regression , 1994 .

[2]  Georgios B. Giannakis,et al.  Online Adaptive Estimation of Sparse Signals: Where RLS Meets the $\ell_1$ -Norm , 2010, IEEE Transactions on Signal Processing.

[3]  Georgios B. Giannakis,et al.  Distributed Robust Power System State Estimation , 2012, IEEE Transactions on Power Systems.

[4]  H. Poor,et al.  Censoring for Collaborative Spectrum Sensing in Cognitive Radios , 2007, 2007 Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers.

[5]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[6]  Joel A. Tropp,et al.  Just relax: convex programming methods for identifying sparse signals in noise , 2006, IEEE Transactions on Information Theory.

[7]  Ludger Evers,et al.  Sparse kernel methods for high-dimensional survival data , 2008, Bioinform..

[8]  Georgios B. Giannakis,et al.  Sensor-Centric Data Reduction for Estimation With WSNs via Censoring and Quantization , 2012, IEEE Transactions on Signal Processing.

[9]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.

[10]  Econo Metrica REGRESSION ANALYSIS WHEN THE DEPENDENT VARIABLE IS TRUNCATED NORMAL , 2016 .

[11]  Lihua Xie,et al.  Asymptotically Optimal Parameter Estimation With Scheduled Measurements , 2013, IEEE Transactions on Signal Processing.

[12]  Ambuj Tewari,et al.  Composite objective mirror descent , 2010, COLT 2010.

[13]  J. Tobin Estimation of Relationships for Limited Dependent Variables , 1958 .

[14]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[15]  Dmitry M. Malioutov,et al.  Sequential Compressed Sensing , 2010, IEEE Journal of Selected Topics in Signal Processing.

[16]  T. Amemiya Tobit models: A survey , 1984 .

[17]  Gang Wang,et al.  Power Scheduling of Kalman Filtering in Wireless Sensor Networks with Data Packet Drops , 2013 .