Efficient Linear Fusion of Distributed MMSE Estimators for Big Data

Many signal processing applications require performing statistical inference on large datasets, where computational and/or memory restrictions become an issue. In this big data setting, computing an exact global centralized estimator is often unfeasible. Furthermore, even when approximate numerical solutions (e.g., based on Monte Carlo methods) working directly on the whole dataset can be computed, they may not provide a satisfactory performance either. Hence, several authors have recently started considering distributed inference approaches, where the data is divided among multiple workers (cores, machines or a combination of both). The computations are then performed in parallel and the resulting distributed or partial estimators are finally combined to approximate the intractable global estimator. In this paper, we focus on the scenario where no communication exists among the workers, deriving efficient linear fusion rules for the combination of the distributed estimators. Both a Bayesian perspective (based on the Bernstein-von Mises theorem and the asymptotic normality of the estimators) and a constrained optimization view are provided for the derivation of the linear fusion rules proposed. We concentrate on minimum mean squared error (MMSE) partial estimators, but the approach is more general and can be used to combine any kind of distributed estimators as long as they are unbiased. Numerical results show the good performance of the algorithms developed, both in simple problems where analytical expressions can be obtained for the distributed MMSE estimators, and in a wireless sensor network localization problem where Monte Carlo methods are used to approximate the partial estimators.

[1]  Frédéric Lavancier,et al.  A general procedure to combine estimators , 2014, Comput. Stat. Data Anal..

[2]  Edward J. Wegman,et al.  Statistical Signal Processing , 1985 .

[3]  Kenneth F. Wallis,et al.  Combining forecasts – forty years later , 2011 .

[4]  Xiaodong Wang,et al.  Monte Carlo methods for signal processing , 2005 .

[5]  A. V. D. Vaart Asymptotic Statistics: Delta Method , 1998 .

[6]  Michael Allen,et al.  Parallel programming: techniques and applications using networked workstations and parallel computers , 1998 .

[7]  L. L. Cam,et al.  Asymptotic Methods In Statistical Decision Theory , 1986 .

[8]  Arnaud Doucet,et al.  On the Utility of Graphics Cards to Perform Massively Parallel Simulation of Advanced Monte Carlo Methods , 2009, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[9]  G.B. Giannakis,et al.  Distributed compression-estimation using wireless sensor networks , 2006, IEEE Signal Processing Magazine.

[10]  Georgios B. Giannakis,et al.  Signal processing for Big Data , 2014, 2014 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA).

[11]  Mónica F. Bugallo,et al.  Efficient linear combination of partial Monte Carlo estimators , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Edward I. George,et al.  Bayes and big data: the consensus Monte Carlo algorithm , 2016, Big Data and Information Theory.

[13]  R. Bordley The Combination of Forecasts: a Bayesian Approach , 1982 .

[14]  Qing Zhao,et al.  Distributed Learning in Wireless Sensor Networks , 2007 .

[15]  Don H. Johnson,et al.  Statistical Signal Processing , 2009, Encyclopedia of Biometrics.

[16]  Cliburn Chan,et al.  Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[17]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[18]  N. Chopin A sequential particle filter method for static models , 2002 .

[19]  Jean Dickinson Gibbons,et al.  Nonparametric Statistical Inference , 1972, International Encyclopedia of Statistical Science.

[20]  Ananthram Swami,et al.  Wireless Sensor Networks: Signal Processing and Communications , 2007 .

[21]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[22]  H. V. Trees Detection, Estimation, And Modulation Theory , 2001 .

[23]  J. Mendel Lessons in Estimation Theory for Signal Processing, Communications, and Control , 1995 .

[24]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[25]  Darren J. Wilkinson,et al.  Parallel Bayesian Computation , 2005 .

[26]  Andrew Gelman,et al.  Sampling for Bayesian Computation with Large Datasets , 2005 .

[27]  Soummya Kar,et al.  Gossip Algorithms for Distributed Signal Processing , 2010, Proceedings of the IEEE.

[28]  S. Godsill,et al.  Special issue on Monte Carlo methods for statistical signal processing , 2002 .

[29]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[30]  Chong Wang,et al.  Asymptotically Exact, Embarrassingly Parallel MCMC , 2013, UAI.

[31]  Greg M. Allenby Cross-Validation, the Bayes Theorem, and Small-Sample Bias , 1990 .

[32]  Ali H. Sayed,et al.  Diffusion LMS Strategies for Distributed Estimation , 2010, IEEE Transactions on Signal Processing.

[33]  A.S. Willsky,et al.  Distributed fusion in sensor networks , 2006, IEEE Signal Processing Magazine.

[34]  J. Dickinson Some Statistical Results in the Combination of Forecasts , 1973 .

[35]  P. Müller,et al.  Bayesian Nonparametrics: An invitation to Bayesian nonparametrics , 2010 .

[36]  Sailes K. Sengijpta Fundamentals of Statistical Signal Processing: Estimation Theory , 1995 .

[37]  J. M. Bates,et al.  The Combination of Forecasts , 1969 .

[38]  Xiangyu Wang,et al.  Parallelizing MCMC via Weierstrass Sampler , 2013, 1312.4605.

[39]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[40]  Mónica F. Bugallo,et al.  Efficient Multiple Importance Sampling Estimators , 2015, IEEE Signal Processing Letters.