Dynamic Bicycle Dispatching of Dockless Public Bicycle-sharing Systems Using Multi-objective Reinforcement Learning

As a new generation of Public Bicycle-sharing Systems (PBS), the Dockless PBS (DL-PBS) is an important application of cyber-physical systems and intelligent transportation. How to use artificial intelligence to provide efficient bicycle dispatching solutions based on dynamic bicycle rental demand is an essential issue for DL-PBS. In this article, we propose MORL-BD, a dynamic bicycle dispatching algorithm based on multi-objective reinforcement learning to provide the optimal bicycle dispatching solution for DL-PBS. We model the DL-PBS system from the perspective of cyber-physical systems and use deep learning to predict the layout of bicycle parking spots and the dynamic demand of bicycle dispatching. We define the multi-route bicycle dispatching problem as a multi-objective optimization problem by considering the optimization objectives of dispatching costs, dispatch truck's initial load, workload balance among the trucks, and the dynamic balance of bicycle supply and demand. On this basis, the collaborative multi-route bicycle dispatching problem among multiple dispatch trucks is modeled as a multi-agent and multi-objective reinforcement learning model. All dispatch paths between parking spots are defined as state spaces, and the reciprocal of dispatching costs is defined as a reward. Each dispatch truck is equipped with an agent to learn the optimal dispatch path in the dynamic DL-PBS network. We create an elite list to store the Pareto optimal solutions of bicycle dispatch paths found in each action, and finally get the Pareto frontier. Experimental results on the actual DL-PBS show that compared with existing methods, MORL-BD can find a higher quality Pareto frontier with less execution time.

[1]  Aharon Ben-Tal,et al.  Characterization of Pareto and Lexicographic Optimal Solutions , 1980 .

[2]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[3]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[4]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[5]  Wei-Che Tseng,et al.  Data Allocation Optimization for Hybrid Scratch Pad Memory With SRAM and Nonvolatile Memory , 2013, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[6]  Shou-Ren Hu,et al.  An optimal location model for a bicycle sharing program with truck dispatching consideration , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[7]  Richard S. Zemel,et al.  Gated Graph Sequence Neural Networks , 2015, ICLR.

[8]  Marcello Restelli,et al.  Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation , 2016, J. Artif. Intell. Res..

[9]  Fan Zhang,et al.  Last-Mile Transit Service with Urban Infrastructure Data , 2016, ACM Trans. Cyber Phys. Syst..

[10]  Patrick Jaillet,et al.  Dynamic Repositioning to Reduce Lost Demand in Bike Sharing Systems , 2017, J. Artif. Intell. Res..

[11]  Manuela Ruiz-Montiel,et al.  A temporal difference method for multi-objective reinforcement learning , 2017, Neurocomputing.

[12]  Yu Zheng,et al.  Dynamic Bike Reposition: A Spatio-Temporal Reinforcement Learning Approach , 2018, KDD.

[13]  Guoming Tang,et al.  Bikeshare Pool Sizing for Bike-and-Ride Multimodal Transit , 2018, IEEE Transactions on Intelligent Transportation Systems.

[14]  Yiyu Sun,et al.  Sharing and Riding: How the Dockless Bike Sharing Scheme in China Shapes the City , 2018, Urban Science.

[15]  Yongming Huang,et al.  Personalized optimal bicycle trip planning based on Q-learning algorithm , 2018, 2018 IEEE Wireless Communications and Networking Conference (WCNC).

[16]  Jonathan Corcoran,et al.  Vehicle scheduling approach and its practice to optimise public bicycle redistribution in Hangzhou , 2018, IET Intelligent Transport Systems.

[17]  Bin Ran,et al.  Estimating Urban Shared-Bike Trips with Location-Based Social Networking Data , 2019, Sustainability.

[18]  Longbo Huang,et al.  A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems , 2018, AAAI.

[19]  Feihu Huang,et al.  A Rebalancing Strategy for the Imbalance Problem in Bike-Sharing Systems , 2019, Energies.

[20]  Jie Wu,et al.  Optimizing Rebalance Scheme for Dock-Less Bike Sharing Systems with Adaptive User Incentive , 2019, 2019 20th IEEE International Conference on Mobile Data Management (MDM).

[21]  Runzhe Yang,et al.  A Generalized Algorithm for Multi-Objective RL and Policy Adaptation , 2019 .

[22]  Satoshi Kawasaki,et al.  Bike-Share Demand Prediction using Attention based Sequence to Sequence and Conditional Variational AutoEncoder , 2019, PredictGIS@SIGSPATIAL.

[23]  Ying Li,et al.  Study on Allocation Scheme of Bicycle Sharing without Piles , 2019, CICTP 2019.

[24]  Jiming Chen,et al.  Mobility Modeling and Data-Driven Closed-Loop Prediction in Bike-Sharing Systems , 2019, IEEE Transactions on Intelligent Transportation Systems.

[25]  Takashi Wakamatsu,et al.  Practical End-to-End Repositioning Algorithm for Managing Bike-Sharing System , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[26]  Dianhui Mao,et al.  A Novel Dynamic Dispatching Method for Bicycle-Sharing System , 2019, ISPRS Int. J. Geo Inf..

[27]  Haiying Shen,et al.  TOP , 2019, Encyclopedia of Autism Spectrum Disorders.

[28]  Yunjun Gao,et al.  Rebalancing the car-sharing system with reinforcement learning , 2020, World Wide Web.

[29]  Haiying Shen,et al.  TOP: Optimizing Vehicle Driving Speed with Vehicle Trajectories for Travel Time Minimization and Road Congestion Avoidance , 2019, ACM Trans. Cyber Phys. Syst..

[30]  Jie Wu,et al.  Challenges and Opportunities in Algorithmic Solutions for Re-Balancing in Bike Sharing Systems , 2020 .

[31]  Yu Zheng,et al.  Citywide Bike Usage Prediction in a Bike-Sharing System , 2020, IEEE Transactions on Knowledge and Data Engineering.

[32]  Abdeltawab M. Hendawi,et al.  Data Sets, Modeling, and Decision Making in Smart Cities , 2019, ACM Trans. Cyber Phys. Syst..

[33]  Philip S. Yu,et al.  Dynamic Planning of Bicycle Stations in Dockless Public Bicycle-sharing System Using Gated Graph Neural Network , 2021, ACM Trans. Intell. Syst. Technol..