A Fast Q-Learning Based Data Storage Optimization for Low Latency in Data Center Networks

Data storage optimizations (DS, e.g. low latency for data access) in data center networks(DCN) are difficult online-making problems. Previously, they are done with heuristics under static network models which highly rely on designers’ understanding of the environment. Encouraged by recent successes in deep reinforcement learning techniques to solve intricate online assignment problems, we propose to use the Q-learning (QL) technique to train and learn from historical DS decisions, which can significantly reduce the data access delay. However, QL faces two challenges to be widely used in data centers. They are massive input data and the blindness on parameter settings which severely hamper the convergence of the learning process. To solve these two key problems, we develop an evolutionary QL scheme, named as LFDS (Low latency and Fast convergence Data Storage). In the initial stage of the LFDS, the input matrix of QL is sparse to shrink the dimensionality of the massive input data while retaining its information as much as possible. In the following training phase, a specialized neural network is adopted to achieves a quick approximation. To overcome the blindness during QL training, the two key parameters, learning rate, and discount rate are carefully tested with real data input and network architecture. The preferred range of learning rate and discount rate are recommended for the use of QL in data centers, which brings high training rewards and fast convergence. Extensive simulations with real-world data show that the data access latency is decreased by 23.5% and the convergence rate is increased by 15%.

[1]  Wentian Li,et al.  Random texts exhibit Zipf's-law-like word frequency distribution , 1992, IEEE Trans. Inf. Theory.

[2]  Jin Wang,et al.  RETRACTED ARTICLE: The visual object tracking algorithm research based on adaptive combination kernel , 2019, Journal of Ambient Intelligence and Humanized Computing.

[3]  Jianping Pan,et al.  Learning-based Adaptive Data Placement for Low Latency in Data Center Networks , 2018, 2018 IEEE 43rd Conference on Local Computer Networks (LCN).

[4]  Xi Chen,et al.  Single-Image Super-Resolution Algorithm Based on Structural Self-Similarity and Deformation Block Features , 2019, IEEE Access.

[5]  Haiyan Liu,et al.  An improved linear kernel for complementary maximal strip recovery: Simpler and smaller , 2019, Theor. Comput. Sci..

[6]  Frank L. Lewis,et al.  Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control , 2007, Autom..

[7]  Mohammad Ubaidullah Bokhari,et al.  Cloud computing service models: A comparative study , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[8]  Christoforos E. Kozyrakis,et al.  Selecta: Heterogeneous Cloud Storage Configuration for Data Analytics , 2018, USENIX Annual Technical Conference.

[9]  John D. Kececioglu,et al.  Learning Parameter-Advising Sets for Multiple Sequence Alignment , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  Amin Vahdat,et al.  Less Is More: Trading a Little Bandwidth for Ultra-Low Latency in the Data Center , 2012, NSDI.

[11]  Y. Iiguni,et al.  Q-learning algorithm using an adaptive-sized Q-table , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[12]  Jin Wang,et al.  Multimodel Framework for Indoor Localization Under Mobile Edge Computing Environment , 2019, IEEE Internet of Things Journal.

[13]  Abhishek Chandra,et al.  TripS: automated multi-tiered data placement in a geo-distributed cloud environment , 2017, SYSTOR.

[14]  Lusheng Wang,et al.  Green latency-aware data placement in data centers , 2016, Comput. Networks.

[15]  Minyi Guo,et al.  Renewable Energy-Aware Big Data Analytics in Geo-Distributed Data Centers with Reinforcement Learning , 2020, IEEE Transactions on Network Science and Engineering.

[16]  Adam Wierman,et al.  Datum: Managing Data Purchasing and Data Placement in a Geo-Distributed Data Market , 2018, IEEE/ACM Transactions on Networking.

[17]  Feng Li,et al.  Parameters Compressing in Deep Learning , 2020 .

[18]  Charles E. Leiserson,et al.  Fat-trees: Universal networks for hardware-efficient supercomputing , 1985, IEEE Transactions on Computers.

[19]  Alec Wolman,et al.  Volley: Automated Data Placement for Geo-Distributed Cloud Services , 2010, NSDI.

[20]  Jie Xiong,et al.  A novel online incremental and decremental learning algorithm based on variable support vector machine , 2019, Cluster Computing.

[21]  Sanparith Marukatat,et al.  Multi Q-Table Q-Learning , 2019, 2019 10th International Conference of Information and Communication Technology for Embedded Systems (IC-ICTES).

[22]  Yan Zhang,et al.  A Reinforcement Learning-Based Data Storage Scheme for Vehicular Ad Hoc Networks , 2017, IEEE Transactions on Vehicular Technology.

[23]  R.C. Chen,et al.  MININET: A microprocessor-controlled "Mininetwork" , 1976, Proceedings of the IEEE.

[24]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).