FENCE: Fast, ExteNsible, and ConsolidatEd Framework for Intelligent Big Data Processing

The proliferation of smart devices and the advancement of data-intensive services has led to explosion of data, which uncovers massive opportunities as well as challenges related to real-time analysis of big data streams. The edge computing frameworks implemented over manycore systems can be considered as a promising solution to address these challenges. However, in spite of the availability of modern computing systems with a large number of processing cores and high memory capacity, the performance and scalability of manycore systems can be limited by the software and operating system (OS) level bottlenecks. In this work, we focus on these challenges, and discuss how accelerated communication, efficient caching, and high performance computation can be provisioned over manycore systems. The proposed Fast, ExteNsible, and ConsolidatEd (FENCE) framework leverages the availability of a large number of computing cores and overcomes the OS level bottlenecks to provide high performance and scalability for intelligent big data processing. We implemented a prototype of FENCE and the experiment results demonstrate that FENCE provides improved data reception throughput, read/write throughput, and application processing performance as compared to the baseline Linux system.

[1]  Marco Levorato,et al.  eBPF-based content and computation-aware communication for real-time edge computing , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[2]  Seungjoon Lee,et al.  Network function virtualization: Challenges and opportunities for innovations , 2015, IEEE Communications Magazine.

[3]  Jing Liu,et al.  I'm Not Dead Yet!: The Role of the Operating System in a Kernel-Bypass Era , 2019, HotOS.

[4]  Andreas Polze,et al.  A Performance Survey of Lightweight Virtualization Techniques , 2017, ESOCC.

[5]  Daniel Raumer,et al.  Comparison of frameworks for high-performance packet IO , 2015, 2015 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS).

[6]  Robert Tappan Morris,et al.  An Analysis of Linux Scalability to Many Cores , 2010, OSDI.

[7]  Jon Crowcroft,et al.  Unikernels: library operating systems for the cloud , 2013, ASPLOS '13.

[8]  Seung-Jun Cha,et al.  Boosting Edge Computing Performance Through Heterogeneous Manycore Systems , 2018, 2018 International Conference on Information and Communication Technology Convergence (ICTC).

[9]  Changwoo Min,et al.  Understanding Manycore Scalability of File Systems , 2016, USENIX Annual Technical Conference.

[10]  Sungyong Park,et al.  pNOVA: Optimizing Shared File I/O Operations of NVM File System on Manycore Servers , 2019, APSys '19.

[11]  Qin Zhang,et al.  Edge Computing in IoT-Based Manufacturing , 2018, IEEE Communications Magazine.

[12]  Prem Prakash Jayaraman,et al.  The Role of Big Data Analytics in Industrial Internet of Things , 2019, Future Gener. Comput. Syst..

[13]  Changwoo Min,et al.  Scaling Guest OS Critical Sections with eCS , 2018, USENIX Annual Technical Conference.

[14]  Changwoo Min,et al.  MV-RLU: Scaling Read-Log-Update with Multi-Versioning , 2019, ASPLOS.

[15]  Mohsen Guizani,et al.  Deep Learning for IoT Big Data and Streaming Analytics: A Survey , 2017, IEEE Communications Surveys & Tutorials.

[16]  T. V. Lakshman,et al.  Bringing the cloud to the edge , 2014, 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[17]  Eunyoung Jeong,et al.  mTCP: a Highly Scalable User-level TCP Stack for Multicore Systems , 2014, NSDI.

[18]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[19]  Toke Høiland-Jørgensen,et al.  The eXpress data path: fast programmable packet processing in the operating system kernel , 2018, CoNEXT.

[20]  Heeseung Jo,et al.  FLsched: A Lockless and Lightweight Approach to OS Scheduler for Xeon Phi , 2017, APSys.

[21]  Taesoo Kim,et al.  Scalable and practical locking with shuffling , 2019, SOSP.

[22]  Mugen Peng,et al.  Edge computing technologies for Internet of Things: a primer , 2017, Digit. Commun. Networks.

[23]  Xiao Liu,et al.  Basic Performance Measurements of the Intel Optane DC Persistent Memory Module , 2019, ArXiv.

[24]  Tao Li,et al.  Bridging the I/O performance gap for big data workloads: A new NVDIMM-based approach , 2016, 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[25]  Ejaz Ahmed,et al.  Big Data Analytics in Industrial IoT Using a Concentric Computing Model , 2018, IEEE Communications Magazine.

[26]  K. B. Letaief,et al.  Mobile Edge Intelligence and Computing for the Internet of Vehicles , 2019, Proceedings of the IEEE.

[27]  Mohan Kumar,et al.  mKPAC: Kernel Packet Processing for Manycore Systems , 2018, Middleware.

[28]  Nir Shavit,et al.  Read-log-update: a lightweight synchronization mechanism for concurrent programming , 2015, SOSP.

[29]  Tapani Ristaniemi,et al.  Learn to Cache: Machine Learning for Network Edge Caching in the Big Data Era , 2018, IEEE Wireless Communications.

[30]  Francisco Herrera,et al.  Big data preprocessing: methods and prospects , 2016 .

[31]  Michael L. Scott,et al.  Algorithms for scalable synchronization on shared-memory multiprocessors , 1991, TOCS.

[32]  Shahid Mumtaz,et al.  Social Big-Data-Based Content Dissemination in Internet of Vehicles , 2018, IEEE Transactions on Industrial Informatics.

[33]  Mark D. Hill,et al.  Amdahl's Law in the Multicore Era , 2008, Computer.

[34]  Scott Shenker,et al.  Revisiting network support for RDMA , 2018, SIGCOMM.

[35]  Dimitris Mourtzis,et al.  Industrial Big Data as a Result of IoT Adoption in Manufacturing , 2016 .

[36]  Athanasios V. Vasilakos,et al.  Self-Adaptive Pre-Processing Methodology for Big Data Stream Mining in Internet of Things Environmental Sensor Monitoring , 2017, Symmetry.

[37]  Mohan Kumar,et al.  Mosaic: Processing a Trillion-Edge Graph on a Single Machine , 2017, EuroSys.

[38]  Jörg Ott,et al.  Consolidate IoT Edge Computing with Lightweight Virtualization , 2018, IEEE Network.

[39]  Youngjae Kim,et al.  Write optimization of log-structured flash file system for parallel I/O on manycore servers , 2019, SYSTOR.