Scalable Estimator for Multi-task Gaussian Graphical Models Based in an IoT Network

Recently, the Internet of Things (IoT) receives significant interest due to its rapid development. But IoT applications still face two challenges: heterogeneity and large scale of IoT data. Therefore, how to efficiently integrate and process these complicated data becomes an essential problem. In this article, we focus on the problem that analyzing variable dependencies of data collected from different edge devices in the IoT network. Because data from different devices are heterogeneous and the variable dependencies can be characterized into a graphical model, we can focus on the problem that jointly estimating multiple, high-dimensional, and sparse Gaussian Graphical Models for many related tasks (edge devices). This is an important goal in many fields. Many IoT networks have collected massive multi-task data and require the analysis of heterogeneous data in many scenarios. Past works on the joint estimation are non-distributed and involve computationally expensive and complex non-smooth optimizations. To address these problems, we propose a novel approach: Multi-FST. Multi-FST can be efficiently implemented on a cloud-server-based IoT network. The cloud server has a low computational load and IoT devices use asynchronous communication with the server, leading to efficiency. Multi-FST shows significant improvement, over baselines, when tested on various datasets.

[1]  T. Cai,et al.  A Constrained ℓ1 Minimization Approach to Sparse Precision Matrix Estimation , 2011, 1102.2233.

[2]  Mainak Chatterjee,et al.  Prospect Theoretic Approach for Data Integrity in IoT Networks under Manipulation Attacks , 2018, ArXiv.

[3]  Rajiv Ranjan,et al.  Processing Distributed Internet of Things Data in Clouds , 2015, IEEE Cloud Computing.

[4]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[5]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[6]  M. West,et al.  Sparse graphical models for exploring gene expression data , 2004 .

[7]  Weixin Xie,et al.  Weighted Fused Pathway Graphical Lasso for Joint Estimation of Multiple Gene Networks , 2019, Front. Genet..

[8]  FetzerChristof,et al.  The Timed Asynchronous Distributed System Model , 1999 .

[9]  Holger Hoefling A Path Algorithm for the Fused Lasso Signal Approximator , 2009, 0910.0526.

[10]  Noah Simon,et al.  A Sparse-Group Lasso , 2013 .

[11]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[12]  Daniel P. Kennedy,et al.  The Autism Brain Imaging Data Exchange: Towards Large-Scale Evaluation of the Intrinsic Brain Architecture in Autism , 2013, Molecular Psychiatry.

[13]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[14]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[15]  Mo Sha,et al.  Robust Optimal Selection of Radio Type and Transmission Power for Internet of Things , 2019, ACM Trans. Sens. Networks.

[16]  Adam J. Rothman,et al.  Generalized Thresholding of Large Covariance Matrices , 2009 .

[17]  Athanasios V. Vasilakos,et al.  IoT-Based Big Data Storage Systems in Cloud Computing: Perspectives and Challenges , 2017, IEEE Internet of Things Journal.

[18]  Takashi Washio,et al.  Learning a common substructure of multiple graphical Gaussian models , 2012, Neural Networks.

[19]  Henry Markram,et al.  Computing the size and number of neuronal clusters in local circuits , 2013, Front. Neuroanat..

[20]  Yan Wang,et al.  Computation Offloading with Multiple Agents in Edge-Computing–Supported IoT , 2019, ACM Trans. Sens. Networks.

[21]  Jeffrey D. Ullman,et al.  Set Merging Algorithms , 1973, SIAM J. Comput..

[22]  Antonio Iera,et al.  The Internet of Things: A survey , 2010, Comput. Networks.

[23]  Beilun Wang,et al.  A constrained $$\ell $$ℓ1 minimization approach for estimating multiple sparse Gaussian or nonparanormal graphical models , 2016, Machine Learning.

[24]  M. Yuan,et al.  Model selection and estimation in the Gaussian graphical model , 2007 .

[25]  Xiaotong Shen,et al.  Structural Pursuit Over Multiple Undirected Graphs , 2014, Journal of the American Statistical Association.

[26]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[27]  Yi Zhang,et al.  Prognostic gene expression signatures can be measured in tissues collected in RNAlater preservative. , 2006, The Journal of molecular diagnostics : JMD.

[28]  Jin Luo,et al.  Behavioral feature recognition of multi-task compressed sensing with fusion relevance in the Internet of Things environment , 2020, Comput. Commun..

[29]  Weidong Liu,et al.  Adaptive Thresholding for Sparse Covariance Matrix Estimation , 2011, 1102.2237.

[30]  Adam J. Rothman,et al.  Sparse permutation invariant covariance estimation , 2008, 0801.4837.

[31]  Christophe Ambroise,et al.  Inferring multiple graphical structures , 2009, Stat. Comput..

[32]  Nicholas R. Jennings,et al.  Real-time information processing of environmental sensor network data using bayesian gaussian processes , 2012, ACM Trans. Sens. Networks.

[33]  M. Ben-Daya,et al.  Internet of things and supply chain management: a literature review , 2019, Int. J. Prod. Res..

[34]  Russell A. Poldrack,et al.  OpenfMRI: Open sharing of task fMRI data , 2017, NeuroImage.

[35]  J. Friedman,et al.  New Insights and Faster Computations for the Graphical Lasso , 2011 .

[36]  Su-In Lee,et al.  Node-based learning of multiple Gaussian graphical models , 2013, J. Mach. Learn. Res..

[37]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[38]  Xiangfeng Wang,et al.  Asynchronous Distributed ADMM for Large-Scale Optimization—Part I: Algorithm and Convergence Analysis , 2015, IEEE Transactions on Signal Processing.

[39]  K. Mani Chandy,et al.  Asynchronous distributed simulation via a sequence of parallel computations , 1981, CACM.

[40]  Nancy A. Lynch,et al.  Efficiency of Synchronous Versus Asynchronous Distributed Systems , 1983, J. ACM.

[41]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[42]  F. Zhan,et al.  The role of the Wnt-signaling antagonist DKK1 in the development of osteolytic lesions in multiple myeloma. , 2003, The New England journal of medicine.

[43]  Takashi Watanabe,et al.  A Multi-Task Scheduling Mechanism Based on ACO for Maximizing Workers’ Benefits in Mobile Crowdsensing Service Markets With the Internet of Things , 2019, IEEE Access.