Active Learning With Noisy Labelers for Improving Classification Accuracy of Connected Vehicles

Machine learning has emerged as a promising paradigm for enabling connected, automated vehicles to autonomously cruise the streets and react to unexpected situations. Reacting to such situations requires accurate classification for uncommon events, which in turn depends on the selection of large, diverse, and high-quality training data. In fact, the data available at a vehicle (e.g., photos of road signs) may be affected by errors or have different levels of resolution and freshness. To tackle this challenge, we propose an active learning framework that, leveraging the information collected through onboard sensors as well as received from other vehicles, effectively deals with scarce and noisy data. Given the information received from neighboring vehicles, our solution: (i) selects which vehicles can reliably generate high-quality training data, and (ii) obtains a reliable subset of data to add to the training set by trading off between two essential features, i.e., quality and diversity. The results, obtained with different real-world datasets, demonstrate that our framework significantly outperforms state-of-the-art solutions, providing high classification accuracy with a limited bandwidth requirement for the data exchange between vehicles.

[1]  Dong In Kim,et al.  Toward Secure Blockchain-Enabled Internet of Vehicles: Optimizing Consensus Management Using Reputation and Contract Theory , 2018, IEEE Transactions on Vehicular Technology.

[2]  Mohsen Guizani,et al.  Optimal User-Edge Assignment in Hierarchical Federated Learning Based on Statistical Properties and Network Topology Constraints , 2022, IEEE Transactions on Network Science and Engineering.

[3]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[4]  Ivan Wang-Hei Ho,et al.  VANET Meets Deep Learning: The Effect of Packet Loss on the Object Detection Performance , 2019, 2019 IEEE 89th Vehicular Technology Conference (VTC2019-Spring).

[5]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[6]  Yang Wang,et al.  Data Subset Selection With Imperfect Multiple Labels , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Yan Zhang,et al.  Software Defined Networking for Energy Harvesting Internet of Things , 2018, IEEE Internet of Things Journal.

[8]  Jiaoyan Chen,et al.  Deep Learning From Multiple Crowds: A Case Study of Humanitarian Mapping , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Aditya Krishna Menon,et al.  Learning with Symmetric Label Noise: The Importance of Being Unhinged , 2015, NIPS.

[10]  Dacheng Tao,et al.  Classification with Noisy Labels by Importance Reweighting , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Francesco Malandrino,et al.  Active Learning-based Classification in Automated Connected Vehicles , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[12]  Dacheng Tao,et al.  Multiclass Learning With Partially Corrupted Labels , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Giancarlo Fortino,et al.  Clustering-Learning-Based Long-Term Predictive Localization in 5G-Envisioned Internet of Connected Vehicles , 2021, IEEE Transactions on Intelligent Transportation Systems.

[14]  Yue Huang,et al.  Cost-Effective Vehicle Type Recognition in Surveillance Images With Deep Active Learning and Web Data , 2020, IEEE Transactions on Intelligent Transportation Systems.

[15]  Yang Yu,et al.  Development of an Efficient Driving Strategy for Connected and Automated Vehicles at Signalized Intersections: A Reinforcement Learning Approach , 2020, IEEE Transactions on Intelligent Transportation Systems.

[16]  Vic Murray,et al.  About the Textbook , 2014 .

[17]  Lei Zhang,et al.  Active Self-Paced Learning for Cost-Effective and Progressive Face Identification , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Yao Zhang,et al.  A Decision-Making Strategy for Vehicle Autonomous Braking in Emergency via Deep Reinforcement Learning , 2020, IEEE Transactions on Vehicular Technology.

[19]  Rong Jin,et al.  Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Ruimao Zhang,et al.  Cost-Effective Active Learning for Deep Image Classification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Biagio Ciuffo,et al.  Traffic simulation data for Antwerp , 2017 .

[22]  Jan Paul Siebert,et al.  Vehicle Recognition Using Rule Based Methods , 1987 .

[23]  Audun Jøsang,et al.  A Logic for Uncertain Probabilities , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[24]  Ming Yang,et al.  Collaborative Active Visual Recognition from Crowds: A Distributed Ensemble Approach , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Xindong Wu,et al.  Active Learning With Imbalanced Multiple Noisy Labeling , 2015, IEEE Transactions on Cybernetics.

[26]  Xiang Cheng,et al.  Wireless Toward the Era of Intelligent Vehicles , 2019, IEEE Internet of Things Journal.

[27]  Franco Blanchini,et al.  The joint network/control design of platooning algorithms can enforce guaranteed safety constraints , 2019, Ad Hoc Networks.

[28]  Feng Lyu,et al.  Vehicular Communication Networks in the Automated Driving Era , 2018, IEEE Communications Magazine.

[29]  Elisabeth Uhlemann,et al.  Time for Autonomous Vehicles to Connect [Connected Vehicles] , 2018, IEEE Vehicular Technology Magazine.

[30]  Mingyan Liu,et al.  An Online Learning Approach to Improving the Quality of Crowd-Sourcing , 2015, SIGMETRICS.