A Convergent Semi-Proximal Alternating Direction Method of Multipliers for Recovering Internet Traffics from Link Measurements

It is challenging to recover the large-scale internet traffic data purely from the link measurements. With the rapid growth of the problem scale, it will be extremely difficult to sustain the recovery accuracy and the computational cost. In this work, we propose a new Sparsity Low-Rank Recovery (SLRR) model and its Schur Complement Based semi-proximal Alternating Direction Method of Multipliers (SCB-spADMM) solver. Our approach distinguishes itself mainly for the following two aspects. First, we fully exploit the spatial low-rank property and the sparsity of traffic data, which are barely considered in the literature. Our model can be divided into a series of subproblems, which only relate to the traffics in a certain individual time interval. Thus, the model scale is significantly reduced. Second, we establish a globally convergent ADMM-type algorithm inspired by [Li et al., Math. Program., 155(2016)] to solve the SLRR model. In each iteration, all the intermediate variables’ optimums can be calculated analytically, which makes the algorithm fast and accurate. Besides, due to the separability of the SLRR model, it is possible to design a parallel algorithm to further reduce computational time. According to the numerical results on the classic datasets Abilene and GEANT, our method achieves the best accuracy with a low computational cost. Moreover, in our newly released large-scale Huawei Origin-Destination (HOD) network traffics, our method perfectly reaches the seconds-level feedback, which meets the essential requirement for practical scenarios. keywords: Large-scale network recovery, HOD dataset, Spatial low-rankness, Nuclear norm minimization, semi-proximal ADMM

[1]  Xiaoming Yuan,et al.  Matrix completion via an alternating direction method , 2012 .

[2]  Lei Shi,et al.  A Compressive Sensing-Based Approach to End-to-End Network Traffic Reconstruction , 2020, IEEE Transactions on Network Science and Engineering.

[3]  Geoffrey J. McLachlan,et al.  Analyzing Microarray Gene Expression Data , 2004 .

[4]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[5]  Mikael Johansson,et al.  Traffic matrix estimation on a large IP backbone: a comparison on real data , 2004, IMC '04.

[6]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[7]  Yair Wiseman,et al.  Real-time monitoring of traffic congestions , 2017, 2017 IEEE International Conference on Electro Information Technology (EIT).

[8]  Matthew Roughan,et al.  Internet Traffic Matrices: A Primer , 2013 .

[9]  Xin Wang,et al.  Efficiently Inferring Top-k Elephant Flows based on Discrete Tensor Completion , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[10]  Carsten Lund,et al.  Estimating point-to-point and point-to-multipoint traffic matrices: an information-theoretic approach , 2005, IEEE/ACM Transactions on Networking.

[11]  Kim-Chuan Toh,et al.  An efficient inexact symmetric Gauss–Seidel based majorized ADMM for high-dimensional convex composite conic programming , 2015, Mathematical Programming.

[12]  Yin Zhang,et al.  Solving a low-rank factorization model for matrix completion by a nonlinear successive over-relaxation algorithm , 2012, Mathematical Programming Computation.

[13]  Dafang Zhang,et al.  Accurate Recovery of Missing Network Measurement Data With Localized Tensor Completion , 2019, IEEE/ACM Transactions on Networking.

[14]  Haoyu Song,et al.  Fast hash table lookup using extended bloom filter: an aid to network processing , 2005, SIGCOMM '05.

[15]  Emmanuel J. Candès,et al.  The Power of Convex Relaxation: Near-Optimal Matrix Completion , 2009, IEEE Transactions on Information Theory.

[16]  B. Yu,et al.  Time-varying network tomography: router link data , 2000, 2000 IEEE International Symposium on Information Theory (Cat. No.00CH37060).

[17]  V. Jacobson,et al.  Congestion avoidance and control , 1988, SIGCOMM '88.

[18]  J. Kruskal Rank, decomposition, and uniqueness for 3-way and n -way arrays , 1989 .

[19]  Mikkel Thorup,et al.  Optimizing OSPF/IS-IS weights in a changing world , 2002, IEEE J. Sel. Areas Commun..

[20]  Werner Dubitzky,et al.  Fundamentals of Data Mining in Genomics and Proteomics , 2009 .

[21]  Walter Willinger,et al.  Spatio-Temporal Compressive Sensing and Internet Traffic Matrices (Extended Version) , 2012, IEEE/ACM Transactions on Networking.

[22]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[23]  Albert G. Greenberg,et al.  Fast accurate computation of large-scale IP traffic matrices from link loads , 2003, SIGMETRICS '03.

[24]  Lei Guo,et al.  A compressive sensing‐based approach to end‐to‐end network traffic reconstruction utilising partial measured origin‐destination flows , 2015, Trans. Emerg. Telecommun. Technol..

[25]  H. Kiers Towards a standardized notation and terminology in multiway analysis , 2000 .

[26]  Dafang Zhang,et al.  Spatio-temporal tensor completion for imputing missing internet traffic data , 2015, 2015 IEEE 34th International Performance Computing and Communications Conference (IPCCC).

[27]  Amin Vahdat,et al.  B4 and after: managing hierarchy, partitioning, and asymmetry for availability and scale in google's software-defined WAN , 2018, SIGCOMM.

[28]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[29]  Gaogang Xie,et al.  Accurate recovery of internet traffic data under dynamic measurements , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[30]  Xuyun Zhang,et al.  Collaborative Network Traffic Analysis via Alternating Direction Method of Multipliers , 2018, 2018 IEEE 22nd International Conference on Computer Supported Cooperative Work in Design ((CSCWD)).

[31]  Y. Vardi,et al.  Network Tomography: Estimating Source-Destination Traffic Intensities from Link Data , 1996 .

[32]  Dafang Zhang,et al.  Active Sparse Mobile Crowd Sensing Based on Matrix Completion , 2019, SIGMOD Conference.

[33]  M. Kilmer,et al.  Factorization strategies for third-order tensors , 2011 .

[34]  Bingsheng He,et al.  The direct extension of ADMM for multi-block convex minimization problems is not necessarily convergent , 2014, Mathematical Programming.

[35]  Kim-Chuan Toh,et al.  On the equivalence of inexact proximal ALM and ADMM for a class of convex composite programming , 2018, Math. Program..

[36]  Kim-Chuan Toh,et al.  A Schur complement based semi-proximal ADMM for convex quadratic conic programming and extensions , 2014, Mathematical Programming.

[37]  Jiannong Cao,et al.  Fast Tensor Factorization for Accurate Internet Anomaly Detection , 2017, IEEE/ACM Transactions on Networking.

[38]  Gaogang Xie,et al.  Sequential and adaptive sampling for matrix completion in network monitoring systems , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[39]  Steve Uhlig,et al.  Providing public intradomain traffic matrices to the research community , 2006, CCRV.

[40]  Konstantina Papagiannaki,et al.  Structural analysis of network traffic flows , 2004, SIGMETRICS '04/Performance '04.

[41]  장주욱 [서평]「Wide Area Network Design : Concepts and Tools for Optimization」 , 2000 .

[42]  David A. Maltz,et al.  Network traffic characteristics of data centers in the wild , 2010, IMC '10.