Towards Adaptive Fault Tolerance on ROS for Advanced Driver Assistance Systems

The use of over-the-air updates has attracted very much interest these last few years with the software-intensive development of embedded systems in the car industry. The development of autonomous driving and ADAS (Advanced Driver Assistance Systems) renders over-the-air updates mandatory, for both user satisfaction and economic reasons. How to make sure that remote updates of critical ADAS do not have an impact on safety? This is the question we tackle in our work with a major car manufacturer. This paper is a progress report. We summarize our approach involving AFT (Adaptive Fault Tolerance) implemented on ROS (Robot Operating System), describe the simulation platform we have developed to experiment and validate over-the-air updates of ADAS and AFT, and finally draw some lessons learnt and perspectives.

[1]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[2]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[3]  Jean-Claude Laprie,et al.  From Dependability to Resilience , 2008, DSN 2008.

[4]  Matthieu Roy,et al.  Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS , 2016, 2016 IEEE 17th International Symposium on High Assurance Systems Engineering (HASE).

[5]  Matthieu Roy,et al.  Architecting Resilient Computing Systems: a Component-Based Approach. (Conception et implémentation de systèmes résilients par une approche à composants) , 2013 .

[6]  Gene Cooperman,et al.  DMTCP: Transparent checkpointing for cluster computations and the desktop , 2007, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[7]  Seyed Masoud Sadjadi,et al.  Composing adaptive software , 2004, Computer.

[8]  Jean-Charles Fabre,et al.  A Metaobject Architecture for Fault-Tolerant Distributed Systems: The FRIENDS Approach , 1998, IEEE Trans. Computers.

[9]  Gustavo Alonso,et al.  Understanding replication in databases and distributed systems , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[10]  하수철,et al.  [서평]「Component Software」 - Beyond Object-Oriented Programming - , 2000 .

[11]  Israel Koren,et al.  Adaptive fault-tolerance fault-tolerance for cyber-physical systems , 2013, 2013 International Conference on Computing, Networking and Communications (ICNC).

[12]  Alistair Cockburn,et al.  Agile Software Development: The Business of Innovation , 2001, Computer.

[13]  Gregor Kiczales,et al.  Aspect-oriented programming , 1996, CSUR.

[14]  K. H. Kim,et al.  Adaptive fault tolerance: issues and approaches , 1990, [1990] Proceedings. Second IEEE Workshop on Future Trends of Distributed Computing Systems.