Decentralized Data Collection for Robotic Fleet Learning: A Game-Theoretic Approach

Fleets of networked autonomous vehicles (AVs) collect terabytes of sensory data, which is often transmitted to central servers (the “cloud”) for training machine learning (ML) models. Ideally, these fleets should upload all their data, especially from rare operating contexts, in order to train robust ML models. However, this is infeasible due to prohibitive network bandwidth and data labeling costs. Instead, we propose a cooperative data sampling strategy where geodistributed AVs collaborate to collect a diverse ML training dataset in the cloud. Since the AVs have a shared objective but minimal information about each other’s local data distribution and perception model, we can naturally cast cooperative data collection as an N-player mathematical game. We show that our cooperative sampling strategy uses minimal information to converge to a centralized oracle policy with complete information about all AVs. Moreover, we theoretically characterize the performance benefits of our game-theoretic strategy compared to greedy sampling. Finally, we experimentally demonstrate that our method outperforms standard benchmarks by up to 21.9% on 4 perception datasets, including for autonomous driving in adverse weather conditions. Crucially, our experimental results on real-world datasets closely align with our theoretical guarantees.

[1]  Abhishek V A,et al.  Federated Learning: Collaborative Machine Learning without Centralized Training Data , 2022, international journal of engineering technology and management sciences.

[2]  D. Jakovetić,et al.  Personalized Federated Learning via Convex Clustering , 2022, 2022 IEEE International Smart Cities Conference (ISC2).

[3]  S. Dustdar,et al.  Fog Robotics—Understanding the Research Challenges , 2021, IEEE Internet Computing.

[4]  Zhouyuan Huo,et al.  Privacy-Preserving Asynchronous Vertical Federated Learning Algorithms for Multiparty Collaborative Learning , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Marco Pavone,et al.  Sampling Training Data for Continual Learning Between Robots and the Cloud , 2020, ISER.

[6]  Ali H. Sayed,et al.  Optimal Importance Sampling for Federated Learning , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Osvaldo Simeone,et al.  Decentralized Federated Learning via SGD over Wireless D2D Networks , 2020, 2020 IEEE 21st International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[8]  Solmaz Niknam,et al.  Federated Learning for Wireless Communications: Motivation, Opportunities, and Challenges , 2019, IEEE Communications Magazine.

[9]  John Kubiatowicz,et al.  A Fog Robotics Approach to Deep Robot Learning: Application to Object Recognition and Grasp Planning in Surface Decluttering , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[10]  Marco Pavone,et al.  Network offloading policies for cloud robotics: a learning-based approach , 2019, Autonomous Robots.

[11]  Sebastian Caldas,et al.  Expanding the Reach of Federated Learning by Reducing Client Resource Requirements , 2018, ArXiv.

[12]  Kai Yang,et al.  Active Learning for Wireless IoT Intrusion Detection , 2018, IEEE Wireless Communications.

[13]  Trevor Darrell,et al.  BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning , 2018, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Zoubin Ghahramani,et al.  Deep Bayesian Active Learning with Image Data , 2017, ICML.

[15]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[16]  Pieter Abbeel,et al.  Image Object Label 3 D CAD Model Candidate Grasps Google Object Recognition Engine Google Cloud Storage Select Feasible Grasp with Highest Success Probability Pose EstimationCamera Robots Cloud 3 D Sensor , 2014 .

[17]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[18]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[19]  Sandeep P. Chinchali,et al.  Decentralized Sharing and Valuation of Fleet Robotic Data , 2021, CoRL.

[20]  Stéphane Durand,et al.  Analysis of Best Response Dynamics in Potential Games. (Analyse de la meilleure dynamique de réponse dans les jeux potentiels) , 2018 .

[21]  Bruce Bueno de Mesquita,et al.  An Introduction to Game Theory , 2014 .

[22]  Ken Goldberg,et al.  Cloud Robotics and Automation: A Survey of Related Work , 2013 .

[23]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[24]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[25]  W. Saad,et al.  Game Theory in Wireless and Communication Networks , 2008 .

[26]  W. K. KIM,et al.  The Existence of Nash Equilibrium in n-Person Games with C-Concavity , 2002 .

[27]  Eric van Damme,et al.  Non-Cooperative Games , 2000 .

[28]  Daphne Koller,et al.  Active learning: theory and applications , 2001 .

[29]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.