Turning Traffic Monitoring Cameras into Intelligent Sensors for Traffic Density Estimation

Accurate traffic state information plays a pivotal role in the Intelligent Transportation Systems (ITS), and it is an essential input to various smart mobility applications such as signal coordination and traffic flow prediction. The current practice to obtain the traffic state information is through specialized sensors such as loop detectors and speed cameras. In most metropolitan areas, traffic monitoring cameras have been installed to monitor the traffic conditions on arterial roads and expressways, and the collected videos or images are mainly used for visual inspection by traffic engineers. Unfortunately, the data collected from traffic monitoring cameras are affected by the 4L characteristics: Low frame rate, Low resolution, Lack of annotated data, and Located in complex road environments. Therefore, despite the great potentials of the traffic monitoring cameras, the 4L characteristics hinder them from providing useful traffic state information (e.g., speed, flow, density). This paper focuses on the traffic density estimation problem as it is widely applicable to various traffic surveillance systems. To the best of our knowledge, there is a lack of the holistic framework for addressing the 4L characteristics and extracting the traffic density information from traffic monitoring camera data. In view of this, this paper proposes a framework for estimating traffic density using uncalibrated traffic monitoring cameras with 4L characteristics. The proposed framework consists of two major components: camera calibration and vehicle detection. The camera calibration method estimates the actual length between pixels in the images and videos, and the vehicle counts are extracted from the deep-learning-based vehicle detection method. Combining the two components, high-granular traffic density can be estimated. To validate the proposed framework, two case studies were conducted in Hong Kong and Sacramento. The results show that the Mean Absolute Error (MAE) in camera calibration is less than 0.2 meters out of 6 meters, and the accuracy of vehicle detection under various conditions is approximately 90%. Overall, the MAE for the estimated density is 9.04 veh/km/lane in Hong Kong and 1.30 veh/km/lane in Sacramento. The research outcomes can be used to calibrate the speed-density fundamental diagrams, and the proposed framework can provide accurate and real-time traffic information without installing additional sensors.

[1]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[2]  Markos Papageorgiou,et al.  A flow-maximizing adaptive local ramp metering strategy , 2004 .

[3]  Nikolas Geroliminis,et al.  Feedback Perimeter Control for Heterogeneous Urban Networks Using Adaptive Optimization , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[4]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Jonathan Krause,et al.  3D Object Representations for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[6]  Yunde Jia,et al.  Vehicle Type Classification Using a Semisupervised Convolutional Neural Network , 2015, IEEE Transactions on Intelligent Transportation Systems.

[7]  Celil Ozkurt,et al.  Automatic Traffic Density Estimation and Vehicle Classification for Traffic Surveillance Systems Using Neural Networks , 2009 .

[8]  Daniel J. Dailey,et al.  Dynamic camera calibration of roadside traffic management cameras for vehicle speed estimation , 2003, IEEE Trans. Intell. Transp. Syst..

[9]  Yingjie Xia,et al.  Urban road traffic speed estimation for missing probe vehicle data based on multiple linear regression model , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[10]  Adam Herout,et al.  Traffic surveillance camera calibration by 3D model bounding box alignment for accurate vehicle speed measurement , 2017, Comput. Vis. Image Underst..

[11]  Joel A. Hesch,et al.  A Direct Least-Squares (DLS) method for PnP , 2011, 2011 International Conference on Computer Vision.

[12]  Jinjun Tang,et al.  Real-Time Traffic Flow Parameter Estimation From UAV Video Based on Ensemble Classifier and Optical Flow , 2019, IEEE Transactions on Intelligent Transportation Systems.

[13]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Markos Papageorgiou,et al.  Exploiting the fundamental diagram of urban networks for feedback-based gating , 2012 .

[15]  Shiqi Li,et al.  A Robust O(n) Solution to the Perspective-n-Point Problem , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Markos Papageorgiou,et al.  Local Ramp Metering in the Presence of a Distant Downstream Bottleneck: Theoretical Analysis and Simulation Study , 2014, IEEE Transactions on Intelligent Transportation Systems.

[17]  Francesc Moreno-Noguer,et al.  Exhaustive Linearization for Robust Camera Pose and Focal Length Estimation , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Ziyuan Pu,et al.  Roadway surveillance video camera calibration using standard shipping container , 2017, 2017 International Smart Cities Conference (ISC2).

[19]  Der-Horng Lee,et al.  Probe Vehicle Population and Sample Size for Arterial Speed Estimation , 2002 .

[20]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[21]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[22]  Imari Sato,et al.  A General and Simple Method for Camera Pose and Focal Length Determination , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Adam Herout,et al.  Vehicle Re-identification for Automatic Video Traffic Surveillance , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[24]  S. Panichpapiboon,et al.  Evaluation of a neighbor-based vehicle density estimation scheme , 2008, 2008 8th International Conference on ITS Telecommunications.

[25]  Vinay Sudhakaran,et al.  Estimation of intersection traffic density on decentralized architectures with deep networks , 2017, 2017 International Smart Cities Conference (ISC2).

[26]  S. Joseph Gladwin,et al.  Vision based Vehicle/Pedestrian Detection in Traffic Surveillance System , 2019, 2019 International Conference on Communication and Signal Processing (ICCSP).

[27]  Yan Huang,et al.  Camera calibration and vehicle tracking: Highway traffic video analytics , 2014 .

[28]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[29]  Jian Zhang,et al.  A Traffic Surveillance System for Obtaining Comprehensive Information of the Passing Vehicles Based on Instance Segmentation , 2021, IEEE Transactions on Intelligent Transportation Systems.

[30]  André de Palma,et al.  Traffic congestion pricing methodologies and technologies , 2011 .

[31]  William H. K. Lam,et al.  Application of automatic vehicle identification technology for real-time journey time estimation , 2011, Inf. Fusion.

[32]  Takeo Kanade,et al.  Computer Vision – ECCV 2014 , 2014, Lecture Notes in Computer Science.

[33]  Xinkai Wu,et al.  Vehicle Re-Identification With Image Processing and Car-Following Model Using Multiple Surveillance Cameras From Urban Arterials , 2021, IEEE Transactions on Intelligent Transportation Systems.

[34]  B D Greenshields,et al.  A study of traffic capacity , 1935 .

[35]  Adam Herout,et al.  Automatic camera calibration by landmarks on rigid objects , 2020, Machine Vision and Applications.

[36]  Changchang Wu,et al.  P3.5P: Pose estimation with unknown focal length , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Qingquan Li,et al.  Urban Traffic Density Estimation Based on Ultrahigh-Resolution UAV Video and Deep Neural Network , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[38]  Stephen T Muench,et al.  Effects of Loop Detector Installation on the Portland Cement Concrete Pavement Lifespan: Case Study on I-5 , 2010 .

[39]  Zhiming Luo,et al.  MIO-TCD: A New Benchmark Dataset for Vehicle Classification and Localization , 2018, IEEE Transactions on Image Processing.

[40]  Long Quan,et al.  Linear N-Point Camera Pose Determination , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[42]  W. Y. Szeto,et al.  A cell-based dynamic congestion pricing scheme considering travel distance and time delay , 2019, Transportmetrica B: Transport Dynamics.

[43]  Nikolaus Hansen,et al.  Adapting arbitrary normal mutation distributions in evolution strategies: the covariance matrix adaptation , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[44]  Pravin Varaiya,et al.  Freeway performance measurement system (pems) , 2002 .

[45]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[46]  José M. F. Moura,et al.  Understanding Traffic Density from Large-Scale Web Camera Data , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  G. F. Newell Nonlinear Effects in the Dynamics of Car Following , 1961 .

[48]  Tasneem S. J. Darwish,et al.  Traffic density estimation in vehicular ad hoc networks: A review , 2015, Ad Hoc Networks.

[49]  Shang-Hong Lai,et al.  GAN-Based Day-to-Night Image Style Transfer for Nighttime Vehicle Detection , 2020, IEEE Transactions on Intelligent Transportation Systems.

[50]  Eric J. Gonzales,et al.  Estimation of left behind subway passengers through archived data and video image processing , 2020, Transportation Research Part C: Emerging Technologies.

[51]  Laurent Kneip,et al.  A Direct Least-Squares Solution to the PnP Problem with Unknown Focal Length , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Macario Cordel,et al.  Convolutional neural network for vehicle detection in low resolution traffic videos , 2016, 2016 IEEE Region 10 Symposium (TENSYMP).

[54]  Dima Damen,et al.  Proceedings of the British Machine Vision Conference , 2014, BMVC 2014.

[55]  Trevor Darrell,et al.  BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling , 2018, ArXiv.

[56]  Chengyi Wang,et al.  An automatic traffic density estimation using Single Shot Detection (SSD) and MobileNet-SSD , 2019, Physics and Chemistry of the Earth, Parts A/B/C.

[57]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  Lakshminarayanan Subramanian,et al.  Traffic density estimation from highly noise image sources , 2012 .

[59]  Adam Herout,et al.  Automatic Camera Calibration for Traffic Understanding , 2014, BMVC.

[60]  Zhigang Xu,et al.  Domain adaptation from daytime to nighttime: A situation-sensitive vehicle detection and traffic flow parameter estimation framework , 2021 .

[61]  Yunfeng Ai,et al.  On Automatic and Dynamic Camera Calibration based on Traffic Visual Surveillance , 2007, 2007 IEEE Intelligent Vehicles Symposium.

[62]  Markos Papageorgiou,et al.  ALINEA: A LOCAL FEEDBACK CONTROL LAW FOR ON-RAMP METERING , 1990 .

[63]  J. Elíasson,et al.  Equity effects of congestion pricing: Quantitative methodology and a case study for Stockholm , 2006 .