Vehicle Identity Recovery for Automatic Number Plate Recognition Data via Heterogeneous Network Embedding

Automatic number plate recognition (ANPR) systems, which have been widely equipped in many cities, produce numerous travel data for intelligent and sustainable transportation. ANPR data operate at an individual level and carry the unique identities of vehicles, which can support personalized traffic planning. However, these systems also suffer from the common problem of missing data. Different from the traditional missing cases, we focus on the problem of the loss of vehicle identities in ANPR records due to recognition failure or other environmental factors. To address the issue, we propose a heterogeneous graph embedding framework that constructs a travel heterogeneous information network (THIN) and learns the embeddings of the entities to find the best matched vehicles for the unknown records. As a result, the recovery of vehicle identities is cast as an entity alignment task on a THIN. The proposed method integrates the vehicle group entities and context relations into the THIN for capturing the spatiotemporal relationships in vehicle travel and adopts a holographic embeddings model for better fitting the network structure. Empirically, we test it with a real ANPR dataset collected from Xuancheng, China, which has a densely-distributed camera network. The experiments demonstrate the effectiveness of the proposed graph structure under different missing rates. Further, we compare it with other embedding methods and the results support the superiority of holographic embeddings.

[1]  Jian Sun,et al.  Vehicle trajectory reconstruction using automatic vehicle identification and traffic count data , 2015 .

[2]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .

[3]  Evgeniy Gabrilovich,et al.  A Review of Relational Machine Learning for Knowledge Graphs , 2015, Proceedings of the IEEE.

[4]  A. Kibangou,et al.  Traffic data imputation via tensor completion based on soft thresholding of Tucker core , 2017 .

[5]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[6]  Katharina Morik,et al.  Dynamic route planning with real-time traffic predictions , 2017, Inf. Syst..

[7]  Wen-Jing Hsu,et al.  Mining GPS data for mobility patterns: A survey , 2014, Pervasive Mob. Comput..

[8]  Jian Yu,et al.  A Service-Based Approach to Traffic Sensor Data Integration and Analysis to Support Community-Wide Green Commute in China , 2016, IEEE Transactions on Intelligent Transportation Systems.

[9]  Eleni I. Vlahogianni,et al.  Road Traffic Forecasting: Recent Advances and New Challenges , 2018, IEEE Intelligent Transportation Systems Magazine.

[10]  Catherine Morency,et al.  Smart card data use in public transit: A literature review , 2011 .

[11]  Yizhou Sun,et al.  Mining heterogeneous information networks: a structural analysis approach , 2013, SKDD.

[12]  Fei-Yue Wang,et al.  Data-Driven Intelligent Transportation Systems: A Survey , 2011, IEEE Transactions on Intelligent Transportation Systems.

[13]  Bin Ran,et al.  Tensor based missing traffic data completion with spatial–temporal correlation , 2016 .

[14]  Xiaolei Ma,et al.  Vehicle trajectory reconstruction from automatic license plate reader data , 2018, Int. J. Distributed Sens. Networks.

[15]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[16]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[17]  Ruimin Li,et al.  Lane-based real-time queue length estimation using license plate recognition data , 2015 .

[18]  William T. Scherer,et al.  Exploring Imputation Techniques for Missing Data in Transportation Management Systems , 2003 .

[19]  Yao-Jan Wu,et al.  Origin-destination pattern estimation based on trajectory reconstruction using automatic license plate recognition data , 2018, Transportation Research Part C: Emerging Technologies.

[20]  Francisco G. Benitez,et al.  Review of traffic data estimations extracted from cellular networks , 2008 .

[21]  Guizhen Yu,et al.  Missing data detection and imputation for urban ANPR system using an iterative tensor decomposition approach , 2019, Transportation Research Part C: Emerging Technologies.

[22]  Henry Leung,et al.  Data fusion in intelligent transportation systems: Progress and challenges - A survey , 2011, Inf. Fusion.