Adaptive stochastic optimization for cooperative coverage with a swarm of Micro Aerial Vehicles

The use of multi-robot teams has gained a lot of attention in recent years. This is due to the extended capabilities that the teams offer compared to the use of a single robot for the same task. Moreover, as these platforms become more and more affordable and robust, the use of teams of aerial vehicles is becoming a viable alternative. This thesis focuses on the problem of deploying a swarm of Micro Aerial Vehicles (MAV) to perform surveillance coverage missions over an unknown terrain of arbitrary morphology. Since the terrain's morphology is unknown and it can be quite complex and non-convex, standard algorithms are not applicable to the particular problem treated in this thesis. To overcome this, a new approach based on the Cognitive-based Adaptive Optimization (CAO) algorithm is proposed and evaluated. A fundamental property of this approach is that it shares the same convergence characteristics as those of constrained gradient-descent algorithms, which require perfect knowledge of the terrain's morphology to optimize coverage. In addition, it is also proposed a different formulation of the problem in order to obtain a distributed solution, which allows us to overcome the drawbacks of a centralized approach and to consider also limited communication capabilities. Rigorous mathematical arguments and extensive simulations establish that the proposed approach provides a scalable and efficient methodology that incorporates any particular physical constraints and limitations able to navigate the robots to an arrangement that (locally) optimizes the surveillance coverage. The proposed method is finally implemented in a real swarm of MAVs to carry out surveillance coverage in an outdoor complex area.

[1]  J. L. Maryak,et al.  Global random optimization by simultaneous perturbation stochastic approximation , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[2]  Wolfram Burgard,et al.  Multi-Level Surface Maps for Outdoor Terrain Mapping and Loop Closing , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[4]  Micha Sharir,et al.  Efficient algorithms for geometric optimization , 1998, CSUR.

[5]  Roland Siegwart,et al.  Distributed Coverage Control on Surfaces in 3D Space , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Simon Lacroix,et al.  Digital elevation map building from low altitude stereo imagery , 2002, Robotics Auton. Syst..

[7]  Nathan Michael,et al.  Persistent surveillance with a team of MAVs , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Christian Laugier,et al.  Navigating between people: A stochastic optimization approach , 2012, 2012 IEEE International Conference on Robotics and Automation.

[9]  Katia P. Sycara,et al.  Coverage control for mobile anisotropic sensor networks , 2011, 2011 IEEE International Conference on Robotics and Automation.

[10]  Vijay Kumar,et al.  Distributed Coverage and Exploration in Unknown Non-convex Environments , 2010, DARS.

[11]  Madan M. Gupta,et al.  Neural Networks and Fuzzy Basis Functions for Functional Approximation , 1995 .

[12]  James C. Spall,et al.  Introduction to stochastic search and optimization - estimation, simulation, and control , 2003, Wiley-Interscience series in discrete mathematics and optimization.

[13]  R. Simmons,et al.  COMPANION: A Constraint-Optimizing Method for Person-Acceptable Navigation , 2009, RO-MAN 2009 - The 18th IEEE International Symposium on Robot and Human Interactive Communication.

[14]  Agostino Martinelli,et al.  Potential field based approach for coordinate exploration with a multi-robot team , 2010, 2010 IEEE Safety Security and Rescue Robotics.

[15]  James C. Spall,et al.  AN OVERVIEW OF THE SIMULTANEOUS PERTURBATION METHOD FOR EFFICIENT OPTIMIZATION , 1998 .

[16]  Elon Rimon,et al.  Spanning-tree based coverage of continuous areas by a mobile robot , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[17]  Mokhtar S. Bazaraa,et al.  Nonlinear Programming: Theory and Algorithms , 1993 .

[18]  Leila Takayama,et al.  Influences on proxemic behaviors in human-robot interaction , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  M. Horowitz HUMAN SPATIAL BEHAVIOR. , 1965, American journal of psychotherapy.

[20]  Wolfram Burgard,et al.  A real-time algorithm for mobile robot mapping with applications to multi-robot and 3D mapping , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[21]  Simon Lacroix,et al.  Reactive navigation in outdoor environments using potential fields , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[22]  Sean P. Meyn,et al.  The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning , 2000, SIAM J. Control. Optim..

[23]  Harold J. Kushner,et al.  Stochastic Approximation Algorithms and Applications , 1997, Applications of Mathematics.

[24]  Roland Siegwart,et al.  Voronoi coverage of non-convex environments with a group of networked robots , 2010, 2010 IEEE International Conference on Robotics and Automation.

[25]  Gaurav S. Sukhatme,et al.  Spreading Out: A Local Approach to Multi-robot Coverage , 2002, DARS.

[26]  H. Kushner Asymptotic global behavior for stochastic approximation and diffusions with slowly decreasing noise effects: Global minimization via Monte Carlo , 1987 .

[27]  Elias B. Kosmatopoulos,et al.  Multi-robot 3D coverage of unknown terrains , 2011, IEEE Conference on Decision and Control and European Control Conference.

[28]  F. Bullo,et al.  Distributed deployment of asynchronous guards in art galleries , 2006, 2006 American Control Conference.

[29]  S. Hirche,et al.  Voronoi based coverage control with anisotropic sensors , 2008, 2008 American Control Conference.

[30]  Mac Schwager,et al.  Distributed Coverage Control with Sensory Feedback for Networked Robots , 2006, Robotics: Science and Systems.

[31]  Yu-Chi Ho,et al.  On the numerical solutions of stochastic optimization problem , 1997, IEEE Trans. Autom. Control..

[32]  Elias B. Kosmatopoulos,et al.  Cognitive-based adaptive control for cooperative multi-robot coverage , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Elias B. Kosmatopoulos,et al.  Adaptive-based distributed cooperative multi-robot coverage , 2011, Proceedings of the 2011 American Control Conference.

[34]  John Lygeros,et al.  Probabilistic certification of pan-tilt-zoom camera surveillance systems , 2011, IEEE Conference on Decision and Control and European Control Conference.

[35]  George J. Pappas,et al.  Sensor-based robot deployment algorithms , 2010, 49th IEEE Conference on Decision and Control (CDC).

[36]  T. Shermer Recent Results in Art Galleries , 1992 .

[37]  Adam Kendon,et al.  Spacing and Orientation in Co-present Interaction , 2009, COST 2102 Training School.

[38]  F. Bullo,et al.  Visibility-based multi-agent deployment in orthogonal environments , 2007, 2007 American Control Conference.

[39]  Mac Schwager,et al.  Optimal coverage for multiple hovering robots with downward facing cameras , 2009, 2009 IEEE International Conference on Robotics and Automation.

[40]  Gaurav S. Sukhatme,et al.  Mobile Sensor Network Deployment using Potential Fields : A Distributed , Scalable Solution to the Area Coverage Problem , 2002 .

[42]  Douglas W. Gage,et al.  Command Control for Many-Robot Systems , 1992 .

[43]  V. Torczon,et al.  Direct search methods: then and now , 2000 .

[44]  Gerd Hirzinger,et al.  Energy-efficient Autonomous Four-rotor Flying Robot Controlled at 1 kHz , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[45]  Chaouki T. Abdallah,et al.  An adaptive coverage control algorithm for deployment of nonholonomic mobile sensors , 2010, 49th IEEE Conference on Decision and Control (CDC).

[46]  Dusan M. Stipanovic,et al.  Effective Coverage Control for Mobile Sensor Networks With Guaranteed Collision Avoidance , 2007, IEEE Transactions on Control Systems Technology.

[47]  Rachid Alami,et al.  A Human Aware Mobile Robot Motion Planner , 2007, IEEE Transactions on Robotics.

[48]  Roland Siegwart,et al.  Monocular‐SLAM–based navigation for autonomous micro helicopters in GPS‐denied environments , 2011, J. Field Robotics.

[49]  G. Klein,et al.  Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[50]  H. Robbins A Stochastic Approximation Method , 1951 .

[51]  Marios M. Polycarpou,et al.  High-order neural network structures for identification of dynamical systems , 1995, IEEE Trans. Neural Networks.

[52]  Roland Siegwart,et al.  Real-time metric state estimation for modular vision-inertial systems , 2011, 2011 IEEE International Conference on Robotics and Automation.

[53]  Jean-Claude Latombe,et al.  Robot motion planning , 1970, The Kluwer international series in engineering and computer science.

[54]  Hauke Strasdat,et al.  Real-time monocular SLAM: Why filter? , 2010, 2010 IEEE International Conference on Robotics and Automation.

[55]  J. Kiefer,et al.  Stochastic Estimation of the Maximum of a Regression Function , 1952 .

[56]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[57]  Vijay Kumar,et al.  Sensing and coverage for a network of heterogeneous robots , 2008, 2008 47th IEEE Conference on Decision and Control.

[58]  Alex Pentland,et al.  Capturing Individual and Group Behavior with Wearable Sensors , 2009, AAAI Spring Symposium: Human Behavior Modeling.

[59]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[60]  Ehud Rivlin,et al.  TangentBug: A Range-Sensor-Based Navigation Algorithm , 1998, Int. J. Robotics Res..

[61]  K. Kitazawa,et al.  Pedestrian Vision and Collision Avoidance Behavior: Investigation of the Information Process Space of Pedestrians Using an Eye Tracker , 2010 .

[62]  Gang George Yin Rates of Convergence for a Class of Global Stochastic Optimization Algorithms , 1999, SIAM J. Optim..

[63]  J. Blum Multidimensional Stochastic Approximation Methods , 1954 .

[64]  Alejandro Ribeiro,et al.  Robot deployment with end-to-end communication constraints , 2011, IEEE Conference on Decision and Control and European Control Conference.

[65]  Francesco Bullo,et al.  A ladybug exploration strategy for distributed adaptive coverage control , 2008, 2008 IEEE International Conference on Robotics and Automation.

[66]  Agostino Martinelli,et al.  Distributed Coverage Control for a Multi-Robot Team in a Non-Convex Environment , 2009 .

[67]  Li-Chen Fu,et al.  Human-Centered Robot Navigation—Towards a Harmoniously Human–Robot Coexisting Environment , 2011, IEEE Transactions on Robotics.

[68]  Ruggero Carli,et al.  Dynamic partitioning and coverage control with asynchronous one-to-base-station communication , 2011, IEEE Conference on Decision and Control and European Control Conference.

[69]  Howie Choset,et al.  Coverage Path Planning: The Boustrophedon Cellular Decomposition , 1998 .

[70]  J. Dippon,et al.  Weighted Means in Stochastic Approximation of Minima , 1997 .

[71]  Rachid Alami,et al.  How may I serve you?: a robot companion approaching a seated person in a helping context , 2006, HRI '06.

[72]  J. Spall Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[73]  Milos Zefran,et al.  Performing coverage on nonconvex domains , 2008, 2008 IEEE International Conference on Control Applications.

[74]  Guang-Bin Huang,et al.  Convex incremental extreme learning machine , 2007, Neurocomputing.

[75]  Zezhong Xu,et al.  3D Map Building Based on Stereo Vision , 2006, 2006 IEEE International Conference on Networking, Sensing and Control.

[76]  F Dobson,et al.  Eye contact. , 1993, Nursing times.

[77]  Maxim Likhachev,et al.  Robotic comfort zones , 2000, SPIE Optics East.

[78]  Roland Siegwart,et al.  Optimal surveillance coverage for teams of micro aerial vehicles in GPS-denied environments using onboard vision , 2012, Auton. Robots.

[79]  G. Pflug Stochastic Approximation Methods for Constrained and Unconstrained Systems - Kushner, HJ.; Clark, D.S. , 1980 .

[80]  Hai-Tao Fang,et al.  ANNEALING OF ITERATIVE STOCHASTIC SCHEMES , 1997 .

[81]  Ron Meir,et al.  Approximation bounds for smooth functions in C(Rd) by neural and mixture networks , 1998, IEEE Trans. Neural Networks.

[82]  Multi-Robot 3D Coverage of Unknown Areas , 2012 .

[83]  Francesco Bullo,et al.  Maximizing visibility in nonconvex polygons: nonsmooth analysis and gradient algorithm design , 2005, Proceedings of the 2005, American Control Conference, 2005..

[84]  M. A. Styblinski,et al.  Experiments in nonconvex optimization: Stochastic approximation with function smoothing and simulated annealing , 1990, Neural Networks.

[85]  Anne Spalanzani,et al.  Understanding human interaction for probabilistic autonomous navigation using Risk-RRT approach , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[86]  John N. Tsitsiklis,et al.  Gradient Convergence in Gradient methods with Errors , 1999, SIAM J. Optim..

[87]  Roland Siegwart,et al.  Vision based MAV navigation in unknown and unstructured environments , 2010, 2010 IEEE International Conference on Robotics and Automation.

[88]  Payman Sadegh,et al.  Constrained optimization via stochastic approximation with a simultaneous perturbation gradient approximation , 1997, Autom..

[89]  Markos Papageorgiou,et al.  Adaptive Fine-Tuning of Nonlinear Control Systems With Application to the Urban Traffic Control Strategy TUC , 2007, IEEE Transactions on Control Systems Technology.

[90]  Jorge Urrutia,et al.  Art Gallery and Illumination Problems , 2000, Handbook of Computational Geometry.

[91]  O. Khatib,et al.  Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[92]  Petros A. Ioannou,et al.  Robust Adaptive Control , 2012 .

[93]  Takeo Kanade,et al.  High resolution terrain map from multiple sensor data , 1990, EEE International Workshop on Intelligent Robots and Systems, Towards a New Frontier of Applications.

[94]  P. Kokotovic,et al.  Inverse Optimality in Robust Stabilization , 1996 .

[95]  Roland Siegwart,et al.  Intuitive 3D Maps for MAV Terrain Exploration and Obstacle Avoidance , 2011, J. Intell. Robotic Syst..

[96]  J. Spall Accelerated second-order stochastic optimization using only function measurements , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[97]  H. Robbins,et al.  A CONVERGENCE THEOREM FOR NON NEGATIVE ALMOST SUPERMARTINGALES AND SOME APPLICATIONS**Research supported by NIH Grant 5-R01-GM-16895-03 and ONR Grant N00014-67-A-0108-0018. , 1971 .

[98]  D. Draper,et al.  Stochastic Optimization: a Review , 2002 .

[99]  Michael C. Fu,et al.  Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences , 2003, TOMC.

[100]  E. Hall The hidden dimension: an anthropologist examines man's use of space in public and private , 1969 .

[101]  J. Spall II . 6 Stochastic Optimization , 2004 .

[102]  Gaurav S. Sukhatme,et al.  An incremental deployment algorithm for mobile robot teams , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[103]  Sergio A. Velastin,et al.  Intelligent distributed surveillance systems: a review , 2005 .

[104]  Elias B. Kosmatopoulos,et al.  Large Scale Nonlinear Control System Fine-Tuning Through Learning , 2009, IEEE Transactions on Neural Networks.

[105]  E. B. Kosmotapoulos An adaptive optimization scheme with satisfactory transient performance. , 2009 .

[106]  J. Cortes,et al.  Coverage control by robotic networks with limited-range anisotropic sensory , 2008, 2008 American Control Conference.

[107]  S. Mitter,et al.  RECURSIVE STOCHASTIC ALGORITHMS FOR GLOBAL OPTIMIZATION IN , 2022 .

[108]  J. O'Rourke Art gallery theorems and algorithms , 1987 .

[109]  Abhijit Gosavi,et al.  Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .

[110]  Gaurav S. Sukhatme,et al.  Constrained coverage for mobile sensor networks , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[111]  Roland Siegwart,et al.  3D surveillance coverage using maps extracted by a monocular SLAM algorithm , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[112]  George J. Pappas,et al.  Multi-UAV Cooperative Surveillance with Spatio-Temporal Specifications , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[113]  Efficient global optimization using SPSA , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).