Efficient Parallel Implementations of Controlled Optimization of Traffic Phases

Finding optimal phase durations for a controlled intersection is a computationally intensive task requiring O(N3) operations. In this paper we introduce cost-optimal parallelization of a dynamic programming algorithm that reduces the complexity to O(N2). Three implementations that span a wide range of parallel hardware are developed. The first is based on shared-memory architecture, using the OpenMP programming model. The second implementation is based on message passing, targeting massively parallel machines including high performance clusters, and supercomputers. The third implementation is based on the data parallel programming model mapped on Graphics Processing Units (GPUs). Key optimizations include loop reversal, communication pruning, load-balancing, and efficient thread to processors assignment. Experiments have been conducted on 8-core server, IBM BlueGene/L supercomputer 2-node boards with 128 processors, and GPU GTX470 GeForce Nvidia with 448 cores. Results indicate practical scalability on all platforms, with maximum speed up reaching 76x for the GTX470.

[1]  Bart De Schutter,et al.  Fast Model Predictive Control for Urban Road Networks via MILP , 2011, IEEE Transactions on Intelligent Transportation Systems.

[2]  Giorgio Valle,et al.  CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment , 2008, BMC Bioinformatics.

[3]  Fang Zhou,et al.  QTL mapping arthritis traits in CXB mice , 2008, BMC Bioinformatics.

[4]  T. R. P. Siriwardena,et al.  Accelerating global sequence alignment using CUDA compatible multi-core GPU , 2010, 2010 Fifth International Conference on Information and Automation for Sustainability.

[5]  Jie Cheng,et al.  Programming Massively Parallel Processors. A Hands-on Approach , 2010, Scalable Comput. Pract. Exp..

[6]  Randy B Machemehl,et al.  Development of a Phase-by-Phase, Arrival-Based, Delay-Optimized Adaptive Traffic Signal Control Methodology with Metaheuristic Search , 2006 .

[7]  Yu-Fai Fung,et al.  Coordinated road-junction traffic control by dynamic programming , 2005, IEEE Trans. Intell. Transp. Syst..

[8]  John E. Stone,et al.  OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.

[9]  Zhou Guangwei,et al.  Optimization of adaptive transit signal priority using parallel genetic algorithm , 2007 .

[10]  Suvrajeet Sen,et al.  Controlled Optimization of Phases at an Intersection , 1997, Transp. Sci..

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  Robert L. Smith,et al.  CoSIGN: A Parallel Algorithm for Coordinated Traffic Signal Control , 2006, IEEE Transactions on Intelligent Transportation Systems.

[13]  Wu-chun Feng,et al.  On the Robust Mapping of Dynamic Programming onto a Graphics Processing Unit , 2009, 2009 15th International Conference on Parallel and Distributed Systems.