You Only Look Once, But Compute Twice: Service Function Chaining for Low-Latency Object Detection in Softwarized Networks

With increasing numbers of computer vision and object detection application scenarios, those requiring ultra-low service latency times have become increasingly prominent; e.g., those for autonomous and connected vehicles or smart city applications. The incorporation of machine learning through the applications of trained models in these scenarios can pose a computational challenge. The softwarization of networks provides opportunities to incorporate computing into the network, increasing flexibility by distributing workloads through offloading from client and edge nodes over in-network nodes to servers. In this article, we present an example for splitting the inference component of the YOLOv2 trained machine learning model between client, network, and service side processing to reduce the overall service latency. Assuming a client has 20% of the server computational resources, we observe a more than 12-fold reduction of service latency when incorporating our service split compared to on-client processing and and an increase in speed of more than 25% compared to performing everything on the server. Our approach is not only applicable to object detection, but can also be applied in a broad variety of machine learning-based applications and services.

[1]  Joon-Goo Lee,et al.  Design of a Scalable and Fast YOLO for Edge-Computing Devices , 2020, Sensors.

[2]  Peng Jin,et al.  Artificial Intelligence Enabled Traffic Monitoring System , 2020, Sustainability.

[3]  Nirwan Ansari,et al.  Software-defined network virtualization: an architectural framework for integrating SDN and NFV for service provisioning in future networks , 2016, IEEE Network.

[4]  Frank H. P. Fitzek,et al.  Reducing Latency in Virtual Machines: Enabling Tactile Internet for Human-Machine Co-Working , 2019, IEEE Journal on Selected Areas in Communications.

[5]  Yang Yang,et al.  GC-YOLOv3: You Only Look Once with Global Context Block , 2020, Electronics.

[6]  Michael P. McGarry,et al.  Computation offloading decisions for reducing completion time , 2016, 2017 14th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[7]  Vicent Ortiz Castelló,et al.  Real-time on-board pedestrian detection using generic single-stage algorithms and on-road databases , 2020 .

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Xu Zhang,et al.  Pedestrian Detection Algorithm for Intelligent Vehicles in Complex Scenarios , 2020, Sensors.

[10]  Tarik Taleb,et al.  On Multi-Access Edge Computing: A Survey of the Emerging 5G Network Edge Cloud Architecture and Orchestration , 2017, IEEE Communications Surveys & Tutorials.

[11]  Hai Jin,et al.  Computation Offloading Toward Edge Computing , 2019, Proceedings of the IEEE.

[12]  Frank H.P. Fitzek,et al.  Seamless Service Migration Framework for Autonomous Driving in Mobile Edge Cloud , 2020, 2020 IEEE 17th Annual Consumer Communications & Networking Conference (CCNC).

[13]  Miguel Cazorla,et al.  Pedestrian Movement Direction Recognition Using Convolutional Neural Networks , 2017, IEEE Transactions on Intelligent Transportation Systems.

[14]  Dong Wang,et al.  Sparse-YOLO: Hardware/Software Co-Design of an FPGA Accelerator for YOLOv2 , 2020, IEEE Access.

[15]  Xiaofan Jiang,et al.  City-scale vehicle tracking and traffic flow estimation using low frame-rate traffic cameras , 2019, UbiComp/ISWC Adjunct.

[16]  Jong Won Park,et al.  A Cloud-Based UTOPIA Smart Video Surveillance System for Smart Cities , 2020 .

[17]  Fabrizio Granelli,et al.  An Open Source Testbed for Virtualized Communication Networks , 2021, IEEE Communications Magazine.

[18]  Vivienne Sze,et al.  Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[19]  Qian Lei,et al.  Lane Position Detection Based on Long Short-Term Memory (LSTM) , 2020, Sensors.

[20]  Roberto Bifulco,et al.  Can the Network be the AI Accelerator? , 2018, NetCompute@SIGCOMM.

[21]  Zhan Wang,et al.  Understanding the Performance of In-Network Computing: A Case Study , 2019, 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom).

[22]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Weihong Deng,et al.  Very deep convolutional neural network based image classification using small training sample size , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[24]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[25]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Noa Zilberman,et al.  Do Switches Dream of Machine Learning?: Toward In-Network Classification , 2019, HotNets.

[27]  Jeongho Cho,et al.  Exploring a Multimodal Mixture-Of-YOLOs Framework for Advanced Real-Time Object Detection , 2020 .

[28]  Yan Zhang,et al.  Mobile Edge Computing: A Survey , 2018, IEEE Internet of Things Journal.

[29]  Xinyue Cai,et al.  Mixed YOLOv3-LITE: A Lightweight Real-Time Object Detection Method , 2020, Sensors.

[30]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[31]  Jongseok Kim,et al.  YOLO-Based Simultaneous Target Detection and Classification in Automotive FMCW Radar Systems , 2020, Sensors.

[32]  Jan Rüth,et al.  Towards Executing Computer Vision Functionality on Programmable Network Devices , 2019, ENCP '19.

[33]  Patrick Seeling,et al.  Machine learning for object detection , 2020 .

[34]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Ting Liu,et al.  Recent advances in convolutional neural networks , 2015, Pattern Recognit..