Optimal Control-Aware Transmission for Mission-Critical M2M Communications Under Bandwidth Cost Constraints

In this paper, we consider a mission-critical control system, where a dynamic plant is monitored by a mobile device (MD), and the monitored signal is transmitted to a remote controller via heterogeneous cellular and Wi-Fi networks. We propose an optimal control-aware machine-to-machine (M2M) transmission strategy for mission-critical control applications, in which control performance is measured by remote estimation error and system stability while limited by bandwidth cost. Specifically, the problem of minimizing estimation error, subject to the constraints of cellular usage costs and system stability, is formulated as an infinite-horizon constrained Markov decision process (CMDP), where the MD has options to transmit through Wi-Fi or cellular, or to stay idle. We solve the problem by utilizing the Lagrange multiplier approach, and prove that the optimal strategy is a randomized mixture of two threshold structure strategies. Furthermore, to estimate the structured optimal strategy, we present an algorithm called simultaneous perturbation stochastic approximation (SPSA), in which the complexity is <inline-formula> <tex-math notation="LaTeX">$O(|\mathcal {A}|)$ </tex-math></inline-formula> lower than a non-structured one with <inline-formula> <tex-math notation="LaTeX">$|\mathcal {A}|$ </tex-math></inline-formula> being the number of the actions.

[1]  Paulo E. Miyagi,et al.  Requirements Analysis for Machine to Machine Integration within Industry 4.0 , 2018, 2018 13th IEEE International Conference on Industry Applications (INDUSCON).

[2]  Hsiao-Hwa Chen,et al.  Radio Resource Management in Machine-to-Machine Communications—A Survey , 2018, IEEE Communications Surveys & Tutorials.

[3]  Zhu Han,et al.  Service Provisioning and User Association for Heterogeneous Wireless Railway Networks , 2017, IEEE Transactions on Communications.

[4]  Mihaela van der Schaar,et al.  Structure-Aware Stochastic Storage Management in Smart Grids , 2014, IEEE Journal of Selected Topics in Signal Processing.

[5]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6]  Ling Shi,et al.  Scheduling Two Gauss–Markov Systems: An Optimal Solution for Remote State Estimation Under Bandwidth Constraint , 2012, IEEE Transactions on Signal Processing.

[7]  Petros G. Voulgaris,et al.  On optimal ℓ∞ to ℓ∞ filtering , 1995, Autom..

[8]  Dimitri P. Bertsekas,et al.  Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[9]  Kyung Sup Kwak,et al.  Delay-Constrained Optimal Transmission With Proactive Spectrum Handoff in Cognitive Radio Networks , 2016, IEEE Transactions on Communications.

[10]  Linn I. Sennott,et al.  Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs , 1989, Oper. Res..

[11]  Taoka Hidekazu,et al.  Scenarios for 5G mobile and wireless communications: the vision of the METIS project , 2014, IEEE Communications Magazine.

[12]  Wolfgang Kellerer,et al.  Control-aware Uplink Resource Allocation for Cyber-Physical Systems in Wireless Networks , 2017 .

[13]  Brian D. Noble,et al.  BreadCrumbs: forecasting mobile connectivity , 2008, MobiCom '08.

[14]  Billie F. Spencer,et al.  Smart sensing technology: opportunities and challenges , 2004 .

[15]  Matti Latva-aho,et al.  Ultra-Reliable and Low Latency Communication in mmWave-Enabled Massive MIMO Networks , 2017, IEEE Communications Letters.

[16]  Xi-Ren Cao,et al.  Stochastic learning and optimization - A sensitivity-based approach , 2007, Annu. Rev. Control..

[17]  F. Richard Yu,et al.  Energy-Efficient Machine-to-Machine (M2M) Communications in Virtualized Cellular Networks with Mobile Edge Computing (MEC) , 2019, IEEE Transactions on Mobile Computing.

[18]  Walaa Hamouda,et al.  Machine-to-Machine Communications With Massive Access: Congestion Control , 2019, IEEE Internet of Things Journal.

[19]  Wei Xu,et al.  Energy Efficient Resource Allocation in Machine-to-Machine Communications With Multiple Access and Energy Harvesting for IoT , 2017, IEEE Internet of Things Journal.

[20]  Vincent K. N. Lau,et al.  Modulation-Free M2M Communications for Mission-Critical Applications , 2018, IEEE Transactions on Signal and Information Processing over Networks.

[21]  H. Kushner,et al.  Stochastic Approximation and Recursive Algorithms and Applications , 2003 .

[22]  Jianwei Huang,et al.  Delay-Sensitive Mobile Crowdsensing: Algorithm Design and Economics , 2018, IEEE Transactions on Mobile Computing.

[23]  Tiejun Lv,et al.  Learning-Based Multi-Channel Access in 5G and Beyond Networks With Fast Time-Varying Channels , 2020, IEEE Transactions on Vehicular Technology.

[24]  Mo-Yuen Chow,et al.  Networked Control System: Overview and Research Trends , 2010, IEEE Transactions on Industrial Electronics.

[25]  Xinping Guan,et al.  Control Performance Aware Cooperative Transmission in Multiloop Wireless Control Systems for Industrial IoT Applications , 2018, IEEE Internet of Things Journal.

[26]  Huaming Wu,et al.  Stochastic Analysis of Delayed Mobile Offloading in Heterogeneous Networks , 2018, IEEE Transactions on Mobile Computing.

[27]  Vincent K. N. Lau,et al.  Zero MAC Latency Sensor Networking for Cyber-Physical Systems , 2018, IEEE Transactions on Signal Processing.

[28]  Vincent K. N. Lau,et al.  Distributive Stochastic Learning for Delay-Optimal OFDMA Power and Subband Allocation , 2010, IEEE Transactions on Signal Processing.

[29]  Xi-Ren Cao,et al.  Stochastic learning and optimization - A sensitivity-based approach , 2007, Annual Reviews in Control.

[30]  Chenyang Yang,et al.  Ultra-Reliable and Low-Latency Communications in Unmanned Aerial Vehicle Communication Systems , 2019, IEEE Transactions on Communications.

[31]  A. Makowski,et al.  Estimation and optimal control for constrained Markov chains , 1986, 1986 25th IEEE Conference on Decision and Control.

[32]  J. Hespanha,et al.  Communication logics for networked control systems , 2004, Proceedings of the 2004 American Control Conference.

[33]  Nandit Soparkar,et al.  Trading computation for bandwidth: reducing communication in distributed control systems using state estimators , 2002, IEEE Trans. Control. Syst. Technol..

[34]  Hao Wang,et al.  Receiver-assisted cellular/wifi handover management for efficient multipath multimedia delivery in heterogeneous wireless networks , 2016, EURASIP J. Wirel. Commun. Netw..

[35]  Bikramjit Singh,et al.  Contention-Based Access for Ultra-Reliable Low Latency Uplink Transmissions , 2018, IEEE Wireless Communications Letters.

[36]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[37]  J.P. Hespanha,et al.  Optimal communication logics in networked control systems , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).