论文信息 - STL2vec: Signal Temporal Logic Embeddings for Control Synthesis With Recurrent Neural Networks

STL2vec: Signal Temporal Logic Embeddings for Control Synthesis With Recurrent Neural Networks

In this paper, a method for learning a recurrent neural network (RNN) controller that maximizes the robustness of signal temporal logic (STL) specifications is presented. In contrast to previous methods, we consider synthesizing the RNN controller for which the user is able to select an STL specification arbitrarily from multiple STL specifications. To obtain such a controller, we propose a novel notion called STL2vec, which represents a vector representation of the STL specifications and exhibits their similarities. The construction of the STL2vec is useful since it allows us to enhance the efficiency and performance of the RNN controller. We validate our proposed method through the examples of the path planning problem.

[1] SHAKIBA YAGHOUBI,et al. Worst-case Satisfaction of STL Specifications Using Feedforward Neural Network Controllers: A Lagrange Multipliers Approach , 2019, 2020 Information Theory and Applications Workshop (ITA).

[2] Stefanos Nikolaidis,et al. Learning From Demonstrations Using Signal Temporal Logic in Stochastic and Continuous Domains , 2021, IEEE Robotics and Automation Letters.

[3] Dejan Nickovic,et al. Monitoring Temporal Properties of Continuous Signals , 2004, FORMATS/FTRTFT.

[4] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[6] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7] Xin Rong,et al. word2vec Parameter Learning Explained , 2014, ArXiv.

[8] Marco Pavone,et al. Backpropagation through signal temporal logic specifications: Infusing logical structure into gradient-based methods , 2020, WAFR.

[9] Calin Belta,et al. Control from Signal Temporal Logic Specifications with Smooth Cumulative Quantitative Semantics , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[10] S. Shankar Sastry,et al. Stochastic predictive freeway ramp metering from Signal Temporal Logic specifications , 2017, 2017 American Control Conference (ACC).

[11] Ron Koymans,et al. Specifying real-time properties with metric temporal logic , 1990, Real-Time Systems.

[12] Jyotirmoy V. Deshmukh,et al. Structured Reward Shaping using Signal Temporal Logic specifications , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13] Fred Kröger,et al. Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.

[14] Calin Belta,et al. Recurrent Neural Network Controllers for Signal Temporal Logic Specifications Subject to Safety Constraints , 2020, IEEE Control Systems Letters.

[15] Calin Belta,et al. A formal methods approach to interpretable reinforcement learning for robotic planning , 2019, Science Robotics.

[16] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[17] Calin Belta,et al. Model-Based Safe Policy Search from Signal Temporal Logic Specifications Using Recurrent Neural Networks , 2021, ArXiv.

[18] Calin Belta,et al. Robust temporal logic model predictive control , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[19] Alberto L. Sangiovanni-Vincentelli,et al. Model predictive control with signal temporal logic specifications , 2014, 53rd IEEE Conference on Decision and Control.

[20] Houssam Abbas,et al. Smooth operator: Control using the smooth robustness of temporal logic , 2017, 2017 IEEE Conference on Control Technology and Applications (CCTA).

[21] Calin Belta,et al. Arithmetic-Geometric Mean Robustness for Control from Signal Temporal Logic Specifications , 2019, 2019 American Control Conference (ACC).

[22] Calin Belta,et al. A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks , 2017, 2018 Annual American Control Conference (ACC).