ACTIVE LENGTH CONTROL OF SHAPE MEMORY ALLOY WIRES VIA REINFORCEMENT LEARNING

The ability to actively control the shape of aerospace structures has initiated research regarding the use of Shape Memory Alloy actuators. These actuators can be used for morphing or shape change by controlling their temperature, which is effectively done by applying a voltage difference across their length. The ability to characterize this temperaturestrain relationship using Reinforcement Learning has been previously accomplished, but in order to control Shape Memory Alloy wires it is more beneficial to learn the voltage-position relationship. Numerical simulation using Reinforcement Learning has been used for determining the temperature-strain relationship for characterizing the major and minor hysteresis loops, and determining a limited control policy relating applied temperature to desired strain. Since Reinforcement Learning creates a non-parametric control policy, and there is not currently a general parametric model for this control policy, determining the voltage-position relationship for a Shape Memory Alloy is done separately. This paper extends earlier numerical simulation results and experimental results in temperature-strain space by applying a similar Reinforcement Learning algorithm to voltage-position space using an experimental hardware apparatus. Results presented in the paper show the ability to converge on a near-optimal control policy for Shape Memory Alloy length control by means of an improved Reinforcement Learning algorithm. These results demonstrate the power of Reinforcement Learning as a method of constructing a policy capable of controlling Shape Memory Alloy wire length.

[1]  Dimitris C. Lagoudas,et al.  Thermomechanical modeling of polycrystalline SMAs under cyclic loading, Part III: evolution of plastic strains and two-way shape memory effect , 1999 .

[2]  Leslie Pack Kaelbling,et al.  Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning , 2008, Int. J. Robotics Res..

[3]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[4]  F. Falk,et al.  Pseudoelastic stress-strain curves of polycrystalline shape memory alloys calculated from single crystal data , 1989 .

[5]  Ashwin Ram,et al.  Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces , 1997, Adapt. Behav..

[6]  D. Lagoudas,et al.  Thermomechanical modeling of polycrystalline SMAs under cyclic loading, Part IV: modeling of minor hysteresis loops , 1999 .

[7]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[8]  Luciano G. Machado Shape memory alloys for vibration isolation and damping , 2007 .

[9]  Constantinos Mavroidis,et al.  5.1 CONVENTIONAL ACTUATORS, SHAPE MEMORY ALLOYS, AND ELECTRORHEOLOGICAL FLUIDS , 1999 .

[10]  D. Lagoudas Shape memory alloys : modeling and engineering applications , 2008 .

[11]  Antonio Concilio,et al.  Wing Shape Control through an SMA-Based Device , 2009 .

[12]  Jae-Sung Bae,et al.  Aeroelastic Considerations on Shape Control of an Adaptive Wing , 2005 .

[13]  Dimitris C. Lagoudas,et al.  Simplified shape memory alloy (SMA) material model for vibration isolation , 2001, SPIE Smart Structures and Materials + Nondestructive Evaluation and Health Monitoring.

[14]  Marcel Berveiller,et al.  Potentiel pseudoelastique et plasticite de transformation martensitique dans les monoet polycristaux metalliques , 1987 .

[15]  A. Kurdila,et al.  Hysteresis Modeling of SMA Actuators for Control Applications , 1998 .

[16]  Monish D. Tandale,et al.  A Reinforcement Learning - Adaptive Control Architecture for Morphing , 2004, J. Aerosp. Comput. Inf. Commun..

[17]  H. Banks,et al.  Identification of Hysteretic Control Influence Operators Representing Smart Actuators, Part II: Convergent Approximations , 1997 .

[18]  George Dimitri Konidaris,et al.  An Architecture for Behavior-Based Reinforcement Learning , 2005, Adapt. Behav..

[19]  D. Lagoudas,et al.  A UNIFIED THERMODYNAMIC CONSTITUTIVE MODEL FOR SMA AND FINITE ELEMENT ANALYSIS OF ACTIVE METAL MATRIX COMPOSITES , 1996 .

[20]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[21]  J. N. Kudva,et al.  Overview of the DARPA Smart Wing Project , 2004 .

[22]  John Yen,et al.  Design and Implementation of a Shape Memory Alloy Actuated Reconfigurable Airfoil , 2003 .

[23]  Kenton Conrad Kirkpatrick,et al.  Reinforcement Learning for Characterizing Hysteresis Behavior of Shape Memory Alloys , 2007, J. Aerosp. Comput. Inf. Commun..

[24]  Monish D. Tandale,et al.  Characterization of Shape Memory Alloy Behavior and Position Control Using Reinforcement Learning , 2005 .

[25]  Monish D. Tandale,et al.  Improved Adaptive–Reinforcement Learning Control for Morphing Unmanned Air Vehicles , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[26]  Shimon Whiteson,et al.  Empirical Studies in Action Selection with Reinforcement Learning , 2007, Adapt. Behav..