Age of Information Aware Trajectory Planning of UAVs in Intelligent Transportation Systems: A Deep Learning Approach

Unmanned aerial vehicles (UAVs) are envisioned to play a key role in intelligent transportation systems to complement the communication infrastructure in future smart cities. UAV-assisted vehicular networking research typically adopts throughput and latency as the main performance metrics. These conventional metrics, however, are not adequate to reflect the freshness of the information, an attribute that has been recently identified as a critical requirement to enable services such as autonomous driving and accident prevention. In this paper, we consider a UAV-assisted single-hop vehicular network, wherein sensors (e.g., LiDARs and cameras) on vehicles generate time sensitive data streams, and UAVs are used to collect and process this data while maintaining a minimum age of information (AoI). We aim to jointly optimize the trajectories of UAVs and find scheduling policies to keep the information fresh under minimum throughput constraints. The formulated optimization problem is shown to be mixed integer non-linear program (MINLP) and generally hard to be solved. Motivated by the success of machine learning (ML) techniques particularly deep learning in solving complex problems with low complexity, we reformulate the trajectories and scheduling policies problem as a Markov decision process (MDP) where the system state space considers the vehicular network dynamics. Then, we develop deep reinforcement learning (DRL) to learn the vehicular environment and its dynamics in order to handle UAVs’ trajectory and scheduling policy. In particular, we leverage Deep Deterministic Policy Gradient (DDPG) for learning the trajectories of the deployed UAVs to efficiently minimize the Expected Weighted Sum AoI (EWSA). Simulations results demonstrate the effectiveness of the proposed design and show the deployed UAVs adapt their velocities during the data collection mission in order to minimize the AoI.

[1]  Chadi Assi,et al.  Multihop V2I Communications: A Feasibility Study, Modeling, and Performance Analysis , 2017, IEEE Transactions on Vehicular Technology.

[2]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[3]  Brian D. O. Anderson,et al.  Stochastic Characterization of Information Propagation Process in Vehicular Ad hoc Networks , 2014, IEEE Transactions on Intelligent Transportation Systems.

[4]  Chadi Assi,et al.  Joint Optimization of UAV Trajectory and Radio Resource Allocation for Drive-Thru Vehicular Networks , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[5]  Walid Saad,et al.  A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems , 2018, IEEE Communications Surveys & Tutorials.

[6]  Huaiyu Dai,et al.  UAV-Enabled Age-Optimal Data Collection in Wireless Sensor Networks , 2019, 2019 IEEE International Conference on Communications Workshops (ICC Workshops).

[7]  Vangelis Angelakis,et al.  Age of Information: A New Concept, Metric, and Tool , 2018, Found. Trends Netw..

[8]  Ozan K. Tonguz,et al.  Routing in Sparse Vehicular Ad Hoc Wireless Networks , 2007, IEEE Journal on Selected Areas in Communications.

[9]  Mustafa K. Mehmet Ali,et al.  A Performance Modeling of Connectivity in Vehicular Ad Hoc Networks , 2008, IEEE Transactions on Vehicular Technology.

[10]  Zhen Liu Scheduling of random task graphs on parallel processors , 1995, MASCOTS '95. Proceedings of the Third International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[11]  Li Wang,et al.  Minimizing Packet Expiration Loss With Path Planning in UAV-Assisted Data Sensing , 2019, IEEE Wireless Communications Letters.

[12]  Chi Harold Liu,et al.  Energy-Efficient Distributed Mobile Crowd Sensing: A Deep Learning Approach , 2019, IEEE Journal on Selected Areas in Communications.

[13]  Chi Harold Liu,et al.  Distributed Energy-Efficient Multi-UAV Navigation for Long-Term Communication Coverage by Deep Reinforcement Learning , 2020, IEEE Transactions on Mobile Computing.

[14]  Susana Sargento,et al.  Deploying Roadside Units in Sparse Vehicular Networks: What Really Works and What Does Not , 2014, IEEE Transactions on Vehicular Technology.

[15]  Ali Ghrayeb,et al.  Trajectory Planning of Multiple Dronecells in Vehicular Networks: A Reinforcement Learning Approach , 2020, IEEE Networking Letters.

[16]  Yu Zhang,et al.  Mobile Service Amount Based Link Scheduling for High-Mobility Cooperative Vehicular Networks , 2017, IEEE Transactions on Vehicular Technology.

[17]  Weihua Zhuang,et al.  Stochastic Analysis of a Single-Hop Communication Link in Vehicular Ad Hoc Networks , 2014, IEEE Transactions on Intelligent Transportation Systems.

[18]  Ali Ghrayeb,et al.  Leveraging UAVs for Coverage in Cell-Free Vehicular Networks: A Deep Reinforcement Learning Approach , 2021, IEEE Transactions on Mobile Computing.

[19]  Chadi Assi,et al.  UAV Trajectory Planning for Data Collection from Time-Constrained IoT Devices , 2020, IEEE Transactions on Wireless Communications.

[20]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[21]  Tarak Gandhi,et al.  Pedestrian Protection Systems: Issues, Survey, and Challenges , 2007, IEEE Transactions on Intelligent Transportation Systems.

[22]  Zijing Wang,et al.  Age-Based Path Planning and Data Acquisition in UAV-Assisted IoT Networks , 2019, 2019 IEEE International Conference on Communications Workshops (ICC Workshops).

[23]  Eytan Modiano,et al.  Optimizing Age of Information in Wireless Networks with Throughput Constraints , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[24]  Walid Saad,et al.  Deep Reinforcement Learning for Minimizing Age-of-Information in UAV-Assisted Networks , 2019, 2019 IEEE Global Communications Conference (GLOBECOM).

[25]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[26]  Qingqing Wu,et al.  Joint Trajectory and Communication Design for Multi-UAV Enabled Wireless Networks , 2017, IEEE Transactions on Wireless Communications.

[27]  Ali Ghrayeb,et al.  Trajectory Planning and Resource Allocation of Multiple UAVs for Data Delivery in Vehicular Networks , 2019, IEEE Networking Letters.

[28]  Mohamed-Slim Alouini,et al.  A Survey of Channel Modeling for UAV Communications , 2018, IEEE Communications Surveys & Tutorials.

[29]  Warren B. Powell,et al.  What you should know about approximate dynamic programming , 2009, Naval Research Logistics (NRL).

[30]  Qing Wang,et al.  Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space , 2018, ArXiv.

[31]  Anthony Ephremides,et al.  Optimal scheduling with strict deadlines , 1989 .

[32]  Eytan Modiano,et al.  Minimizing the Age of Information in broadcast wireless networks , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[33]  Eytan Modiano,et al.  Optimal Transmission Scheduling in Symmetric Communication Models With Intermittent Connectivity , 2007, IEEE Transactions on Information Theory.

[34]  Harpreet S. Dhillon,et al.  Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks , 2018, IEEE Transactions on Vehicular Technology.

[35]  Philip Bachman,et al.  Deep Reinforcement Learning that Matters , 2017, AAAI.

[36]  Xijun Wang,et al.  Age-optimal trajectory planning for UAV-assisted data collection , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).