Limited benefit of joint estimation in multi‐agent iterative learning

This paper studies iterative learning control (ILC) in a multi-agent framework, wherein a group of agents simultaneously and repeatedly perform the same task. Assuming similarity between the agents, we investigate whether exchanging information between the agents improves an individual's learning performance. That is, does an individual agent benefit from the experience of the other agents? We consider the multi-agent iterative learning problem as a two-step process of: first, estimating the repetitive disturbance of each agent; and second, correcting for it. We present a comparison of an agent's disturbance estimate in the case of (I) independent estimation, where each agent has access only to its own measurement, and (II) joint estimation, where information of all agents is globally accessible. When the agents are identical and noise comes from measurement only, joint estimation yields a noticeable improvement in performance. However, when process noise is encountered or when the agents have an individual disturbance component, the benefit of joint estimation is negligible. Copyright © 2011 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society

[1]  K. Lee,et al.  Semi-empirical model-based multivariable iterative learning control of an RTP system , 2005 .

[2]  Kevin L. Moore,et al.  An iterative learning control algorithm for systems with measurement noise , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[3]  K. Moore,et al.  Iterative Learning Control: Robustness and Monotonic Convergence for Interval Systems , 2010 .

[4]  B. Francis,et al.  A lifting technique for linear periodic systems with applications to sampled-data control , 1991 .

[5]  Si-Zhao Joe Qin,et al.  A two-stage iterative learning control technique combined with real-time feedback for independent disturbance rejection , 2004, Autom..

[6]  Guanrong Chen,et al.  Kalman Filtering with Real-time Applications , 1987 .

[7]  Raffaello D'Andrea,et al.  Optimization-based iterative learning for precise quadrocopter trajectory tracking , 2012, Autonomous Robots.

[8]  Jay H. Lee,et al.  Control of Wafer Temperature Uniformity in Rapid Thermal Processing Using an Optimal Iterative Learning Control Technique , 2000 .

[9]  R. Longchamp,et al.  Iterative Learning Control based on Stochastic Approximation , 2008 .

[10]  YangQuan Chen,et al.  Iterative learning control for multi-agent formation , 2009, 2009 ICCAS-SICE.

[11]  Jian-Xin Xu,et al.  Iterative Learning Control , 1998 .

[12]  Sandra Upson Update: Radiation sensor fine-tunes cancer treatments , 2008, IEEE Spectrum.

[13]  Zeungnam Bien,et al.  Iterative learning control: analysis, design, integration and applications , 1998 .

[14]  Andrew G. Alleyne,et al.  A Cross-Coupled Iterative Learning Control Design for Precision Motion Control , 2008, IEEE Transactions on Control Systems Technology.

[15]  R. Tousain,et al.  Design strategy for iterative learning control based on optimal control , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[16]  Eric Guizzo,et al.  Three Engineers, Hundreds of Robots, One Warehouse , 2008, IEEE Spectrum.

[17]  A.G. Alleyne,et al.  A survey of iterative learning control , 2006, IEEE Control Systems.

[18]  Suguru Arimoto,et al.  Bettering operation of Robots by learning , 1984, J. Field Robotics.

[19]  Michel Verhaegen,et al.  Filtering and System Identification: Frontmatter , 2007 .

[20]  V. Verdult,et al.  Filtering and System Identification: A Least Squares Approach , 2007 .

[21]  Javier Alonso-Mora,et al.  Independent vs. joint estimation in multi-agent iterative learning control , 2010, 49th IEEE Conference on Decision and Control (CDC).

[22]  Michel Verhaegen,et al.  A structured matrix approach to efficient calculation of LQG repetitive learning controllers in the lifted setting , 2010, Int. J. Control.

[23]  Raffaello D'Andrea,et al.  Coordinating Hundreds of Cooperative, Autonomous Vehicles in Warehouses , 2007, AI Mag..

[24]  David H. Owens,et al.  Basis functions and parameter optimisation in high-order iterative learning control , 2006, Autom..

[25]  M. Phan,et al.  Higher-order iterative learning control by pole placement and noise filtering , 2002 .

[26]  Jay H. Lee,et al.  Model-based iterative learning control with a quadratic criterion for time-varying linear systems , 2000, Autom..

[27]  Raffaello D'Andrea,et al.  Optimization-based iterative learning control for trajectory tracking , 2009, 2009 European Control Conference (ECC).

[28]  Mikael Norrlöf,et al.  An adaptive iterative learning control algorithm with experiments on an industrial robot , 2002, IEEE Trans. Robotics Autom..

[29]  Richard W. Longman,et al.  A mathematical theory of learning control for linear discrete multivariable systems , 1988 .

[30]  Mikael Norrlöf,et al.  DISTURBANCE REJECTION USING AN ILC ALGORITHM WITH ITERATION VARYING FILTERS , 2004 .