Optimal operations planning of electric autonomous vehicles via asynchronous learning in ride-hailing systems

Abstract Ride-hailing systems with electric autonomous vehicles are recognized as a next-generation development to ease congestion, reduce costs and carbon emissions. In this paper, we consider the operation planning problem involving vehicle dispatching, relocation, and recharging decisions. We model the problem as a Markov Decision Process (MDP) to generate the optimal policy that maximizes the total profits. We propose a flexible policy to provide optimal actions according to the reward considering future requests and vehicle availability. We show that our model outperforms the predetermined rules on improving profits. To handle the curse-of-dimensionality caused by the large scale of state space and uncertainty, we develop an asynchronous learning method to solve the problem by approximating the value function. We first draw the samples of exogenous information and update the quality of approximations based on the quality of decisions, then approximate the exact cost-to-go value function by solving an approximation subproblem efficiently given the state at each period. Two variant algorithms are presented for cases with scarce and sufficient information. We also incorporate the state aggregation and post-decision analysis to further improve computational efficiency. We use a set of shared actual data from Didi platform to verify the proposed model in numerical experiments. To conclude, we extract managerial insights that suggest important guidelines for the ride-hailing operations planning problem.

[1]  Daniele Vigo,et al.  Optimizing relocation operations in electric car-sharing , 2017, Omega.

[2]  Kara M. Kockelman,et al.  Operations of a Shared, Autonomous Electric Vehicle Fleet: Implications of Vehicle & Charging Infrastructure Decisions , 2016 .

[3]  L. Burns Sustainable mobility: A vision of our transport future , 2013, Nature.

[4]  Fernando Ordóñez,et al.  Ridesharing: The state-of-the-art and future directions , 2013 .

[5]  Yong-Wu Zhou,et al.  Should ride-sharing platforms cooperate with car-rental companies? Implications for consumer surplus and driver surplus , 2020, Omega.

[6]  Martin W. P. Savelsbergh,et al.  Enhancing Urban Mobility: Integrating Ride-Sharing and Public Transit , 2016, Comput. Oper. Res..

[7]  Michael Hyland,et al.  Dynamic autonomous vehicle fleet operations: Optimization-based strategies to assign AVs to immediate traveler demand requests , 2018, Transportation Research Part C: Emerging Technologies.

[8]  Warren B. Powell,et al.  Feature Article - Merging AI and OR to Solve High-Dimensional Stochastic Optimization Problems Using Approximate Dynamic Programming , 2010, INFORMS J. Comput..

[9]  Daniele Vigo,et al.  Models and algorithms for reliability-oriented Dial-a-Ride with autonomous electric vehicles , 2017, Eur. J. Oper. Res..

[10]  Martin W. P. Savelsbergh,et al.  Optimization for dynamic ride-sharing: A review , 2012, Eur. J. Oper. Res..

[11]  Wei Wang,et al.  Operating Electric Vehicle Fleet for Ride-Hailing Services With Reinforcement Learning , 2020, IEEE Transactions on Intelligent Transportation Systems.

[12]  Peter W. Glynn,et al.  An empirical algorithm for relative value iteration for average-cost MDPs , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[13]  Zizhuo Wang,et al.  We are on the Way: Analysis of On-Demand Ride-Hailing Systems , 2017, Manuf. Serv. Oper. Manag..

[14]  O. Berman,et al.  Introducing Autonomous Vehicles: Adoption Patterns and Impacts on Social Welfare , 2021, Manuf. Serv. Oper. Manag..

[15]  Atul Bhandari,et al.  An Exact and Efficient Algorithm for the Constrained Dynamic Operator Staffing Problem for Call Centers , 2008, Manag. Sci..

[16]  Jon W. Mark,et al.  Approximation of the Mean Queue Length of an M/G/c Queueing System , 1995, Oper. Res..

[17]  Eduardo C. Xavier,et al.  Taxi and Ride Sharing: A Dynamic Dial-a-Ride Problem with Money as an Incentive , 2015, Expert Syst. Appl..

[18]  Hassan Artail,et al.  The shared-taxi problem: Formulation and solution methods , 2014 .

[19]  W. Y. Szeto,et al.  A survey of dial-a-ride problems: Literature review and recent developments , 2018 .

[20]  T. Cheng,et al.  Matching supply and demand on ride-sharing platforms with permanent agents and competition , 2019 .

[21]  Michael W. Levin,et al.  Congestion-aware system optimal route choice for shared autonomous vehicles , 2017 .

[22]  Emilio Frazzoli,et al.  On-demand high-capacity ride-sharing via dynamic trip-vehicle assignment , 2017, Proceedings of the National Academy of Sciences.

[23]  Warren B. Powell,et al.  “Approximate dynamic programming: Solving the curses of dimensionality” by Warren B. Powell , 2007, Wiley Series in Probability and Statistics.

[24]  Xiaohua Jia,et al.  pRide: Privacy-Preserving Ride Matching Over Road Networks for Online Ride-Hailing Service , 2019, IEEE Transactions on Information Forensics and Security.

[25]  William B. Haskell,et al.  Empirical Dynamic Programming , 2013, Math. Oper. Res..

[26]  George J. Pappas,et al.  Hierarchical data-driven vehicle dispatch and ride-sharing , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[27]  Sicheng Zhang,et al.  Optimal pricing of customized bus services and ride-sharing based on a competitive game model , 2021 .

[28]  Huan Xu,et al.  Approximate Value Iteration for Risk-Aware Markov Decision Processes , 2017, IEEE Transactions on Automatic Control.

[29]  Warren B. Powell,et al.  Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures , 2015, Math. Oper. Res..

[30]  Long He,et al.  Service Region Design for Urban Electric Vehicle Sharing Systems , 2017, Manuf. Serv. Oper. Manag..

[31]  Yi-Ming Wei,et al.  Environmental benefits from ridesharing: A case of Beijing , 2017 .

[32]  Chung-Yee Lee,et al.  Matching and pricing in ride-sharing: Optimality, stability, and financial sustainability , 2020, Omega.

[33]  Hongli Zhang,et al.  PSRide: Privacy-Preserving Shared Ride Matching for Online Ride Hailing Systems , 2021, IEEE Transactions on Dependable and Secure Computing.

[34]  Warren B. Powell,et al.  Approximate Dynamic Programming for Large-Scale Resource Allocation Problems , 2006 .

[35]  Jakob Puchinger,et al.  A survey of models and algorithms for optimizing shared mobility , 2019, Transportation Research Part B: Methodological.

[36]  Nikolaos Geroliminis,et al.  The electric autonomous dial-a-ride problem , 2019, Transportation Research Part B: Methodological.

[37]  Hoong Chuin Lau,et al.  A State Aggregation Approach for Stochastic Multiperiod Last-Mile Ride-Sharing Problems , 2019, Transp. Sci..

[38]  Tong Zhu,et al.  Transport solutions for cleaner air , 2016, Science.

[39]  R. Jayakrishnan,et al.  Autonomous or driver-less vehicles: Implementation strategies and operational concerns , 2017 .

[40]  Warren B. Powell,et al.  Dynamic-Programming Approximations for Stochastic Time-Staged Integer Multicommodity-Flow Problems , 2006, INFORMS J. Comput..

[41]  Desheng Dash Wu,et al.  Data-Driven Driver Dispatching System with Allocation Constraints and Operational Risk Management for a Ride-Sharing Platform , 2020, Decis. Sci..

[42]  Dawn B. Woodard,et al.  Dynamic pricing and matching in ride‐hailing platforms , 2019, Naval Research Logistics (NRL).

[43]  Mor Harchol-Balter,et al.  On the inapproximability of M/G/K: why two moments of job size distribution are not enough , 2010, Queueing Syst. Theory Appl..

[44]  Han Zhu,et al.  Surge Pricing and Two-Sided Temporal Responses in Ride Hailing , 2020, Manuf. Serv. Oper. Manag..

[45]  Berk Ustun,et al.  Importance Sampling in Stochastic Programming: A Markov Chain Monte Carlo Approach , 2015, INFORMS J. Comput..

[46]  Warren B. Powell,et al.  Approximate Dynamic Programming for Planning a Ride-Sharing System using Autonomous Fleets of Electric Vehicles , 2018, Eur. J. Oper. Res..

[47]  Verena Schmid,et al.  Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming , 2012, Eur. J. Oper. Res..

[48]  K. Kockelman,et al.  Management of a Shared Autonomous Electric Vehicle Fleet: Implications of Pricing Schemes , 2016 .

[49]  Armann Ingolfsson,et al.  A Markov Chain Model for an EMS System with Repositioning , 2013 .

[50]  Fritz Busch,et al.  Framework for Automated Taxi Operation: The Family Model , 2017 .

[51]  Fan Zhang,et al.  Ride-Hailing Order Dispatching at DiDi via Reinforcement Learning , 2020, INFORMS J. Appl. Anal..