Streaming big data meets backpressure in distributed network computation

We study network response to a stream of queries that require computations on remotely located data, and we seek to characterize the network performance limits in terms of maximum sustainable query rate that can be satisfied. The available network setup consists of (i) a communication network graph with finite-bandwidth links over which data is routed, (ii) computation nodes with certain computation capacity, over which computation load is balanced, and (iii) network nodes that need to schedule raw and processed data transmissions. Our aim is to design a universal methodology and distributed algorithm to adaptively allocate resources in order to support maximum query rate. The proposed algorithms extend in a nontrivial way the backpressure (BP) algorithm to take into account computations carried out in the presence of query streams. They contribute to the fundamental understanding of network computation performance limits when the query rate is limited by both the communication bandwidth and the computation capacity, a classical setting that arises in streaming big data applications in network clouds and fogs.

[1]  Sanjay Shakkottai,et al.  Towards a queueing-based framework for in-network function computation , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[2]  R. Srikant,et al.  Scheduling Efficiency of Distributed Greedy Scheduling Algorithms in Wireless Networks , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[3]  David Thaler,et al.  Using name-based mappings to increase hit rates , 1998, TNET.

[4]  Longbo Huang,et al.  Utility optimal scheduling in processing networks , 2010, Perform. Evaluation.

[5]  Leandros Tassiulas,et al.  Resource Allocation and Cross-Layer Control in Wireless Networks , 2006, Found. Trends Netw..

[6]  Eytan Modiano,et al.  Dynamic power allocation and routing for time varying wireless networks , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[7]  Panganamala Ramana Kumar,et al.  Computing and communicating functions over sensor networks , 2005, IEEE Journal on Selected Areas in Communications.

[8]  Sreeram Kannan,et al.  Multi-Session Function Computation and Multicasting in Undirected Graphs , 2013, IEEE Journal on Selected Areas in Communications.

[9]  Jean Walrand,et al.  Stable and utility-maximizing scheduling for stochastic processing networks , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[10]  D. Manjunath,et al.  Optimal Embedding of Functions for In-Network Computation: Complexity Analysis and Algorithms , 2014, IEEE/ACM Transactions on Networking.

[11]  Laurent Massoulié,et al.  Randomized Decentralized Broadcasting Algorithms , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[12]  Ness B. Shroff,et al.  On distributed computation rate optimization for deploying cloud computing programming frameworks , 2013, PERV.

[13]  Leandros Tassiulas,et al.  Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks , 1992 .

[14]  Massimo Franceschetti,et al.  Network Coding for Computing: Cut-Set Bounds , 2009, IEEE Transactions on Information Theory.

[15]  D. Manjunath,et al.  Network flows for functions , 2010, 2011 IEEE International Symposium on Information Theory Proceedings.

[16]  Jr. C. Humes,et al.  A regulator stabilization technique: Kumar-Seidman revisited , 1994, IEEE Trans. Autom. Control..

[17]  Donald F. Towsley,et al.  A unified modeling framework for distributed resource allocation of general fork and join processing networks , 2010, SIGMETRICS '10.

[18]  Ness B. Shroff,et al.  Greedy Maximal Matching: Performance Limits for Arbitrary Network Graphs Under the Node-Exclusive Interference Model , 2009, IEEE Transactions on Automatic Control.