Portkey: Adaptive Key-Value Placement over Dynamic Edge Networks

Owing to a need for low latency data accesses, emerging IoT and mobile applications commonly require distributed data stores (e.g., key-value or KV stores) to operate entirely at the network's edge. Unfortunately, existing KV stores employ randomized data placement policies (e.g., consistent hashing) that ignore the client mobility and resulting variance in client-server latencies that are inherent to edge applications---the effect is largely suboptimal and inefficient data placement. We present Portkey, a distributed KV store that dynamically adapts data placement according to time-varying client mobility and data access patterns. The key insight with Portkey is to lean into the inherent mobility and prioritize rapid but approximate placement decisions over delayed optimal ones. Doing so enables the efficient tracking of client-server latencies despite edge resource constraints, and the use of greedy placement heuristics that are self-correcting over short timescales. Results with a realistic autonomous vehicle dataset and two small-scale deployments reveal that Portkey reduces average and tail request latency by 21-82% and 26-77% compared to existing placement strategies.

[1]  Abhishek Chandra,et al.  Nebula: Distributed Edge Cloud for Data Intensive Computing , 2014, 2014 IEEE International Conference on Cloud Engineering.

[2]  Ramesh Govindan,et al.  CarMap: Fast 3D Feature Map Updates for Automobiles , 2020, NSDI.

[3]  William A. Arbaugh,et al.  Improving the latency of 802.11 hand-offs using neighbor graphs , 2004, MobiSys '04.

[4]  Harsha V. Madhyastha,et al.  Near-Optimal Latency Versus Cost Tradeoffs in Geo-Distributed Storage , 2020, NSDI.

[5]  Alexander J. Smola,et al.  Scaling Distributed Machine Learning with the Parameter Server , 2014, OSDI.

[6]  Umakishore Ramachandran,et al.  DataFog: Towards a Holistic Data Management Platform for the IoT Age at the Network Edge , 2018, HotEdge.

[7]  Lin Xiao,et al.  YCSB++: benchmarking and performance debugging advanced features in scalable table stores , 2011, SoCC.

[8]  Pan Hui,et al.  Future Networking Challenges: The Case of Mobile Augmented Reality , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[9]  Deborah Estrin,et al.  GHT: a geographic hash table for data-centric storage , 2002, WSNA '02.

[10]  Xing Zhang,et al.  A Survey on Mobile Edge Networks: Convergence of Computing, Caching and Communications , 2017, IEEE Access.

[11]  Subir Biswas,et al.  Vehicle-to-vehicle wireless communication protocols for enhancing highway traffic safety , 2006, IEEE Communications Magazine.

[12]  Erik G. Ström,et al.  Wireless Access for Ultra-Reliable Low-Latency Communication: Principles and Building Blocks , 2018, IEEE Network.

[13]  Enrico Natalizio,et al.  UAV-assisted disaster management: Applications and open issues , 2016, 2016 International Conference on Computing, Networking and Communications (ICNC).

[14]  Alec Wolman,et al.  Volley: Automated Data Placement for Geo-Distributed Cloud Services , 2010, NSDI.

[15]  Marco Fiore,et al.  The networking shape of vehicular mobility , 2008, MobiHoc '08.

[16]  Arun Ravindran,et al.  An Edge Datastore Architecture For Latency-Critical Distributed Machine Vision Applications , 2018, HotEdge.

[17]  Dipankar Raychaudhuri,et al.  Scalability and Performance Evaluation of Edge Cloud Systems for Latency Constrained Applications , 2018, 2018 IEEE/ACM Symposium on Edge Computing (SEC).

[18]  Aakanksha Chowdhery,et al.  Urban IoT Edge Analytics , 2018 .

[19]  Marco Fiore,et al.  On the instantaneous topology of a large-scale urban vehicular network: the cologne case , 2013, MobiHoc '13.

[20]  Qin Lv,et al.  Earthquake Early Warning and Beyond: Systems Challenges in Smartphone-based Seismic Network , 2019, HotMobile.

[21]  Alexandru Iosup,et al.  Sharing and Caring of Data at the Edge , 2020, HotEdge.

[22]  Eyal de Lara,et al.  Toward Session Consistency for the Edge , 2018, HotEdge.

[23]  Michal Król,et al.  Wireless Sensor Networks and Multi-UAV systems for natural disaster management , 2017, Comput. Networks.

[24]  Kaibin Huang,et al.  Broadband Analog Aggregation for Low-Latency Federated Edge Learning , 2018, IEEE Transactions on Wireless Communications.

[25]  Jorge Bernardino,et al.  Consistency Models of NoSQL Databases , 2019, Future Internet.

[26]  Anja Feldmann,et al.  C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection , 2015, NSDI.

[27]  Mehdi Bennis,et al.  Toward Low-Latency and Ultra-Reliable Virtual Reality , 2018, IEEE Network.

[28]  Carlo Fischione,et al.  Low-Latency Networking: Where Latency Lurks and How to Tame It , 2018, Proceedings of the IEEE.

[29]  Tony Savor,et al.  Sharding the Shards: Managing Datastore Locality at Scale with Akkio , 2018, OSDI.

[30]  Jeffrey H. Reed,et al.  Handoff in cellular systems , 1998, IEEE Wirel. Commun..

[31]  Tomasz Stańczyk,et al.  Driver reaction time to lateral entering pedestrian in a simulated crash traffic situation , 2014 .

[32]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[33]  Christian Esteve Rothenberg,et al.  Mininet-WiFi: Emulating software-defined wireless networks , 2015, 2015 11th International Conference on Network and Service Management (CNSM).

[34]  Hubert Eichner,et al.  Towards Federated Learning at Scale: System Design , 2019, SysML.

[35]  Arun Sharma,et al.  Social Hash: An Assignment Framework for Optimizing Distributed Systems Operations on Social Networks , 2016, NSDI.

[36]  Adnan Shahid Khan,et al.  A Review of Vehicle to Vehicle Communication Protocols for VANETs in the Urban Environment , 2018, Future Internet.

[37]  Weisong Shi,et al.  Collaborative cloud-edge computation for personalized driving behavior modeling , 2019, SEC.

[38]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[39]  Lingjia Tang,et al.  The Architectural Implications of Autonomous Driving: Constraints and Acceleration , 2018, ASPLOS.

[40]  Azzedine Boukerche,et al.  An Architecture for Hierarchical Software-Defined Vehicular Networks , 2017, IEEE Communications Magazine.

[41]  C. Berenstein,et al.  Network tomography , 2007 .

[42]  Dan Pei,et al.  Characterizing and Improving WiFi Latency in Large-Scale Operational Networks , 2016, MobiSys.

[43]  Tao Chen,et al.  Millions of Tiny Databases , 2020, NSDI.

[44]  David R. Karger,et al.  Chord: a scalable peer-to-peer lookup protocol for internet applications , 2003, TNET.

[45]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[46]  Urszula Boryczka,et al.  Vehicle route planning in e-waste mobile collection on demand supported by artificial intelligence algorithms , 2018, Transportation Research Part D: Transport and Environment.

[47]  Umakishore Ramachandran,et al.  FogStore: A Geo-Distributed Key-Value Store Guaranteeing Low Latency for Strongly Consistent Access , 2018, DEBS.

[48]  P. Stone,et al.  Intersection Management Protocol for Mixed Autonomous and Human-Operated Vehicles , 2017 .

[49]  Kemal Fidanboylu,et al.  An Overview of Handoff Techniques in Cellular Networks , 2005 .

[50]  Zhe Wang,et al.  Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search , 2007, VLDB.

[51]  Ruben Mayer,et al.  FogStore: Toward a distributed data store for Fog computing , 2017, 2017 IEEE Fog World Congress (FWC).

[52]  Ramesh Govindan,et al.  AVR: Augmented Vehicular Reality , 2018, MobiSys.

[53]  Samir Khuller,et al.  Algorithms for non-uniform size data placement on parallel disks , 2003, J. Algorithms.

[54]  Do Van Thanh,et al.  Crowdsourcing-Based Disaster Management Using Fog Computing in Internet of Things Paradigm , 2016, 2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC).

[55]  Abhishek Chandra,et al.  TripS: automated multi-tiered data placement in a geo-distributed cloud environment , 2017, SYSTOR.

[56]  Vatsal Sharan,et al.  Moment-Based Quantile Sketches for Efficient High Cardinality Aggregation Queries , 2018, Proc. VLDB Endow..

[57]  Wasiq Noor Ahmad Qasmi,et al.  A Low Latency and Consistent Cellular Control Plane , 2020, SIGCOMM.

[58]  Adam Silberstein,et al.  Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.

[59]  Homin K. Lee,et al.  DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees , 2019, Proc. VLDB Endow..

[60]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[61]  Jessica R. Murray,et al.  Crowdsourced earthquake early warning , 2015, Science Advances.

[62]  Eunhee Chang,et al.  Virtual Reality Sickness: A Review of Causes and Measurements , 2020, Int. J. Hum. Comput. Interact..

[63]  Chaitanya Swamy,et al.  Approximation Algorithms for Data Placement Problems , 2008, SIAM J. Comput..

[64]  Ramesh Govindan,et al.  Real-time traffic estimation at vehicular edge nodes , 2017, SEC.

[65]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[66]  Abhishek Chandra,et al.  DeCaf: Iterative Collaborative Processing over the Edge , 2019, HotEdge.

[67]  Dipankar Raychaudhuri,et al.  Towards efficient edge cloud augmentation for virtual reality MMOGs , 2017, SEC.

[68]  Marco Pavone,et al.  A Congestion-aware Routing Scheme for Autonomous Mobility-on-Demand Systems , 2019, 2019 18th European Control Conference (ECC).

[69]  Divyakant Agrawal,et al.  DPaxos: Managing Data Closer to Users for Low-Latency and Mobile Applications , 2018, SIGMOD Conference.

[70]  Hakim Ghazzai,et al.  A Generic Data-Driven Recommendation System for Large-Scale Regular and Ride-Hailing Taxi Services , 2020 .