论文信息 - A gradient optimization approach to adaptive multi-robot control

A gradient optimization approach to adaptive multi-robot control

This thesis proposes a unified approach for controlling a group of robots to reach a goal configuration in a decentralized fashion. As a motivating example, robots are controlled to spread out over an environment to provide sensor coverage. This example gives rise to a cost function that is shown to be of a surprisingly general nature. By changing a single free parameter, the cost function captures a variety of different multi-robot objectives which were previously seen as unrelated. Stable, distributed controllers are generated by taking the gradient of this cost function. Two fundamental classes of multi-robot behaviors are delineated based on the convexity of the underlying cost function. Convex cost functions lead to consensus (all robots move to the same position), while any other behavior requires a nonconvex cost function. The multi-robot controllers are then augmented with a stable on-line learning mechanism to adapt to unknown features in the environment. In a sensor coverage application, this allows robots to learn where in the environment they are most needed, and to aggregate in those areas. The learning mechanism uses communication between neighboring robots to enable distributed learning over the multi-robot system in a provably convergent way. Three multi-robot controllers are then implemented on three different robot platforms. Firstly, a controller for deploying robots in an environment to provide sensor coverage is implemented on a group of 16 mobile robots. They learn to aggregate around a light source while covering the environment. Secondly, a controller is implemented for deploying a group of three flying robots with downward facing cameras to monitor an environment on the ground. Thirdly, the multi-robot model is used as a basis for modeling the behavior of a herd of cows using a system identification approach. The controllers in this thesis are distributed, theoretically proven, and implemented on multi-robot platforms. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

Mac Schwager | Daniela Rus | D. Rus | M. Schwager

[1] J.K. Hedrick,et al. An overview of emerging results in cooperative UAV control , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[2] Jean-Jacques E. Slotine,et al. On partial contraction analysis for coupled nonlinear oscillators , 2004, Biological Cybernetics.

[3] Robert M. Sanner,et al. Gaussian Networks for Direct Adaptive Control , 1991, 1991 American Control Conference.

[4] Weiping Li,et al. Applied Nonlinear Control , 1991 .

[5] Fred N. Ares,et al. Grazing values and management of black grama and tobosa grasslands and associated shrub ranges of the Southwest. , 1962 .

[6] Mac Schwager,et al. Consensus learning for distributed coverage control , 2008, 2008 IEEE International Conference on Robotics and Automation.

[7] Jie Lin,et al. Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[8] Magnus Egerstedt,et al. Data-Driven Generation of Low-Complexity Control Programs , 2004 .

[9] John N. Tsitsiklis,et al. Comments on "Coordination of Groups of Mobile Autonomous Agents Using Nearest Neighbor Rules" , 2007, IEEE Trans. Autom. Control..

[10] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .

[11] Nikolaus Correll,et al. System Identification of Self-Organizing Robotic Swarms , 2006, DARS.

[12] Naomi Ehrich Leonard,et al. Cooperative Filters and Control for Cooperative Exploration , 2010, IEEE Transactions on Automatic Control.

[13] Dean M. Anderson,et al. Virtual fencing--past, present and future , 2007 .

[14] M. Athans,et al. State Estimation for Discrete Systems with Switching Parameters , 1978, IEEE Transactions on Aerospace and Electronic Systems.

[15] Ruggero Carli,et al. Average consensus on networks with quantized communication , 2009 .

[16] George J. Pappas,et al. Controlling Connectivity of Dynamic Graphs , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[17] Howie Choset,et al. Coverage for robotics – A survey of recent results , 2001, Annals of Mathematics and Artificial Intelligence.

[18] Y. Bar-Shalom,et al. Multisensor resource deployment using posterior Cramer-Rao bounds , 2004, IEEE Transactions on Aerospace and Electronic Systems.

[19] Gordon F. Royle,et al. Algebraic Graph Theory , 2001, Graduate texts in mathematics.

[20] S. M. Rutter,et al. An automatic system to record foraging behaviour in free-ranging ruminants , 1997 .

[21] Vijay Kumar,et al. Leader-to-formation stability , 2004, IEEE Transactions on Robotics and Automation.

[22] Richard M. Murray,et al. Consensus problems in networks of agents with switching topology and time-delays , 2004, IEEE Transactions on Automatic Control.

[23] C. Guestrin,et al. Near-optimal sensor placements: maximizing information while minimizing communication cost , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[24] Jean-Jacques E. Slotine,et al. Adaptive sliding controller synthesis for non-linear systems , 1986 .

[25] Emilio Frazzoli,et al. Equitable partitioning policies for robotic networks , 2009, 2009 IEEE International Conference on Robotics and Automation.

[26] Hartmut Logemann,et al. Asymptotic Behaviour of Nonlinear Systems , 2004, Am. Math. Mon..

[27] J.N. Tsitsiklis,et al. Convergence in Multiagent Coordination, Consensus, and Flocking , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[28] Mac Schwager,et al. Unifying Geometric, Probabilistic, and Potential Field Approaches to Multi-robot Coverage Control , 2009, ISRR.

[29] Francesco Bullo,et al. A ladybug exploration strategy for distributed adaptive coverage control , 2008, 2008 IEEE International Conference on Robotics and Automation.

[30] R. Srikant,et al. Quantized Consensus , 2006, 2006 IEEE International Symposium on Information Theory.

[31] Zack J. Butler,et al. Controlling mobile sensors for monitoring events with coverage constraints , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[32] Vladimir Pavlovic,et al. Learning Switching Linear Models of Human Motion , 2000, NIPS.

[33] Mubarak Shah,et al. Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[34] Peter I. Corke,et al. From Robots to Animals: Virtual Fences for Controlling Cattle , 2006, Int. J. Robotics Res..

[35] Mac Schwager,et al. Optimal coverage for multiple hovering robots with downward facing cameras , 2009, 2009 IEEE International Conference on Robotics and Automation.

[36] Calin Belta,et al. Abstraction and control for Groups of robots , 2004, IEEE Transactions on Robotics.

[37] Emilio Frazzoli,et al. Dynamic multi-vehicle routing with multiple classes of demands , 2009, 2009 American Control Conference.

[38] Gerd Hirzinger,et al. Energy-efficient Autonomous Four-rotor Flying Robot Controlled at 1 kHz , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[39] F. Bullo,et al. Motion Coordination with Distributed Information , 2007 .

[40] Gaurav S. Sukhatme,et al. Mobile Sensor Network Deployment using Potential Fields : A Distributed , Scalable Solution to the Area Coverage Problem , 2002 .

[41] Tim J. Ellis,et al. Multi camera image tracking , 2006, Image Vis. Comput..

[42] Vijay Kumar,et al. Simultaneous Coverage and Tracking (SCAT) of Moving Targets with Robot Networks , 2008, WAFR.

[43] A. Banerjee. Convex Analysis and Optimization , 2006 .

[44] Geoffrey E. Hinton,et al. Variational Learning for Switching State-Space Models , 2000, Neural Computation.

[45] E. Ryan. An Integral Invariance Principle for Differential Inclusions with Applications in Adaptive Control , 1998 .

[46] Giancarlo Ferrari-Trecate,et al. Analysis of coordination in multi-agent systems through partial difference equations , 2006, IEEE Transactions on Automatic Control.

[47] GROUP LIFE , 1951 .

[48] J. P. Lasalle. Some Extensions of Liapunov's Second Method , 1960 .

[49] Randal W. Beard,et al. Cooperative Surveillance with Multiple UAVs , 2008 .

[50] Wei Li,et al. Distributed Cooperative coverage Control of Sensor Networks , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[51] Said Salhi,et al. Facility Location: A Survey of Applications and Methods , 1996 .

[52] Jake K. Aggarwal,et al. Tracking Human Motion in Structured Environments Using a Distributed-Camera System , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[53] Takeo Kanade,et al. Algorithms for cooperative multisensor surveillance , 2001, Proc. IEEE.

[54] Felipe Cucker,et al. Emergent Behavior in Flocks , 2007, IEEE Transactions on Automatic Control.

[55] Vijay Kumar,et al. Planning and Control of Mobile Robots in Image Space from Overhead Cameras , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[56] Ioannis M. Rekleitis,et al. Distributed coverage with multi-robot system , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[57] M. A. Dahleh,et al. Constraints on locational optimization problems , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[58] Peter I. Corke,et al. The Design and Evaluation of a Mobile Sensor/Actuator Network for Autonomous Animal Control , 2007, 2007 6th International Symposium on Information Processing in Sensor Networks.

[59] Yong Wang,et al. Energy-efficient computing for wildlife tracking: design tradeoffs and early experiences with ZebraNet , 2002, ASPLOS X.

[60] J. Jewkes,et al. Theory of Location of Industries. , 1933 .

[61] John N. Tsitsiklis,et al. Distributed Asynchronous Deterministic and Stochastic Gradient Optimization Algorithms , 1984, 1984 American Control Conference.

[62] Dimos V. Dimarogonas,et al. Connectedness Preserving Distributed Swarm Aggregation for Multiple Kinematic Robots , 2008, IEEE Transactions on Robotics.

[63] Andreas Krause,et al. Near-optimal Observation Selection using Submodular Functions , 2007, AAAI.

[64] N. S. Urquhart,et al. Using digital pedometers to monitor travel of cows grazing arid rangeland , 1986 .

[65] James D. McLurkin. Stupid robot tricks : a behavior-based distributed algorithm library for programming swarms of robots , 2004 .

[66] John N. Tsitsiklis,et al. Problems in decentralized decision making and computation , 1984 .

[67] Harley Flanders,et al. Differentiation Under the Integral Sign , 1973 .

[68] Gregory Dudek,et al. Multi-robot collaboration for robust exploration , 2004, Annals of Mathematics and Artificial Intelligence.

[69] M. Hirsch,et al. Differential Equations, Dynamical Systems, and Linear Algebra , 1974 .

[70] Supun Samarasekera,et al. Aerial video surveillance and exploitation , 2001, Proc. IEEE.

[71] Gaurav S. Sukhatme,et al. Sensor coverage using mobile robots and stationary nodes , 2002, SPIE ITCom.

[72] Hanumant Singh,et al. Toward large-area mosaicing for underwater scientific applications , 2003 .

[73] Mac Schwager,et al. Distributed Coverage Control with Sensory Feedback for Networked Robots , 2006, Robotics: Science and Systems.

[74] Shahin Sirouspour,et al. Optimal positioning of multiple cameras for object recognition using Cramer-Rao lower bound , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[75] Emilio Frazzoli,et al. Efficient routing of multiple vehicles with no explicit communications , 2008 .

[76] Jean-Jacques E. Slotine,et al. A theoretical study of different leader roles in networks , 2006, IEEE Transactions on Automatic Control.

[77] James M. Rehg,et al. Data-Driven MCMC for Learning and Inference in Switching Linear Dynamic Systems , 2005, AAAI.

[78] Lennart Ljung,et al. System Identification: Theory for the User , 1987 .

[79] MartonosiMargaret,et al. Energy-efficient computing for wildlife tracking , 2002 .

[80] Sonia Martínez,et al. Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[81] George J. Pappas,et al. Potential Fields for Maintaining Connectivity of Mobile Networks , 2007, IEEE Transactions on Robotics.

[82] Petter Ögren,et al. Cooperative control of mobile sensor networks:Adaptive gradient climbing in a distributed environment , 2004, IEEE Transactions on Automatic Control.

[83] Vicsek,et al. Novel type of phase transition in a system of self-driven particles. , 1995, Physical review letters.

[84] Kevin M. Passino,et al. Stability analysis of swarms , 2003, IEEE Trans. Autom. Control..

[85] Nikolaus Correll,et al. Multirobot inspection of industrial machinery , 2009 .

[86] Ralph L. Hollis,et al. Complete distributed coverage of rectilinear environments , 2000 .

[87] Mac Schwager,et al. From Theory to Practice: Distributed Coverage Control Experiments with Groups of Robots , 2008, ISER.

[88] Mac Schwager,et al. Decentralized, Adaptive Control for Coverage with Networked Robots , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[89] S. Sastry,et al. Adaptive Control: Stability, Convergence and Robustness , 1989 .

[90] Mac Schwager,et al. Decentralized, Adaptive Coverage Control for Networked Robots , 2009, Int. J. Robotics Res..

[91] John Wainright,et al. Climate and Climatological Variations in the Jornada Basin , 2006 .

[92] Francesco Bullo,et al. Esaim: Control, Optimisation and Calculus of Variations Spatially-distributed Coverage Optimization and Control with Limited-range Interactions , 2022 .

[93] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[94] George J. Pappas,et al. Flocking in Fixed and Switching Networks , 2007, IEEE Transactions on Automatic Control.

[95] Ali Jadbabaie,et al. Distributed Geodesic Control Laws for Flocking of Nonholonomic Agents , 2007, IEEE Transactions on Automatic Control.

[96] Roberto Cipolla,et al. Multiview Photometric Stereo , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[97] Weiping Li,et al. Composite adaptive control of robot manipulators , 1989, Autom..

[98] Mac Schwager,et al. Data‐driven identification of group dynamics for motion prediction and control , 2008, J. Field Robotics.

[99] Francesco Bullo,et al. Distributed Control of Robotic Networks , 2009 .

[100] Vijay Kumar,et al. Sensing and coverage for a network of heterogeneous robots , 2008, 2008 47th IEEE Conference on Decision and Control.

[101] Randy A. Freeman,et al. Decentralized Environmental Modeling by Mobile Sensor Networks , 2008, IEEE Transactions on Robotics.

[102] F Mondada,et al. Social Integration of Robots into Groups of Cockroaches to Control Self-Organized Choices , 2007, Science.

[103] Mac Schwager,et al. Robust classification of animal tracking data , 2007 .

[104] Jonathan P. How,et al. Cooperative Vision Based Estimation and Tracking Using Multiple UAVs , 2007 .