A general AI-defined attention network for predicting CDN performance

Abstract It has become an increasingly popular trend to replicate network content across a geographically distributed structure. The Content Delivery Network (CDN) is a prime example. Content providers have been struggling to improve the Quality of Experience (QoE) of its clients via accurate predictions of CDN performance, which is a crucial prerequisite for subsequent policies like resource scheduling. Methods of Artificial Intelligence (AI) like machine learning and deep learning have shown their advantages on performance predictions recently, but it is still challenging to simultaneously ensure accuracy, generality and low overhead. In this paper, we propose a general AI-defined attention network to make predictions on CDN performance, the results of which can improve CDN resource optimal scheduling. First, we formulate this CDN performance prediction problem as a sequence learning problem. And then we design a two-phase deep neural network with attention so that our model can work well for all cache groups in a CDN. Experiment results show that our model outperforms state-of-art models in accuracy while maintaining a relatively low training overhead.

[1]  Wei Li,et al.  Edge cognitive computing based smart healthcare system , 2018, Future Gener. Comput. Syst..

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Jürgen Schmidhuber,et al.  Multi-column deep neural network for traffic sign classification , 2012, Neural Networks.

[4]  Yoshiaki Tanaka,et al.  A Deep Reinforcement Learning Based Approach for Cost- and Energy-Aware Multi-Flow Mobile Data Offloading , 2018, IEICE Trans. Commun..

[5]  Yoshua Bengio,et al.  End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Xin Wang,et al.  Machine Learning for Networking: Workflow, Advances and Opportunities , 2017, IEEE Network.

[7]  Shalin Huang,et al.  A data-driven approach of performance evaluation for cache server groups in content delivery network , 2018, J. Parallel Distributed Comput..

[8]  Gerald Penn,et al.  Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[9]  Cheng Zhang,et al.  Task migration for mobile edge computing using deep reinforcement learning , 2019, Future Gener. Comput. Syst..

[10]  Wei Li,et al.  A Dynamic Service Migration Mechanism in Edge Cognitive Computing , 2018, ACM Trans. Internet Techn..

[11]  Min Chen,et al.  Label-less Learning for Traffic Control in an Edge Network , 2018, IEEE Network.

[12]  Zhihui Lu,et al.  Toy-IoT-Oriented data-driven CDN performance evaluation model with deep learning , 2018, J. Syst. Archit..

[13]  Garrison W. Cottrell,et al.  A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction , 2017, IJCAI.

[14]  Victor C. M. Leung,et al.  From cloud-based communications to cognition-based communications: A computing perspective , 2018, Comput. Commun..