A Principled Approximation Framework for Optimal Control of Semi-Markov Jump Linear Systems

We consider continuous-time, finite-horizon, optimal quadratic control of semi-Markov jump linear systems (S-MJLS), and develop principled approximations through Markov-like representations for the holding-time distributions. We adopt a phase-type approximation for holding-time distributions, which is known to be consistent, and translates an S-MJLS into a specific MJLS with partially observable modes (MJLSPOM), where the modes in a cluster have the same dynamic, the same cost weighting matrices, and the same control policy. For a general MJLSPOM, we give necessary and sufficient conditions for optimal (switched) linear controllers. When specialized to our particular MJLSPOM, we additionally establish the existence of an optimal linear controller, as well as its optimality within the class of general controllers satisfying standard smoothness conditions. The known equivalence between phase-type distributions and positive linear systems allows us to leverage existing modeling tools, but possibly with large computational costs. Motivated by this, we propose a matrix-exponential approximation of holding-time distributions, resulting in pseudo-MJLSPOM representation, wherein the transition rates could be negative. Such a representation is of relatively low order, and maintains the same optimality conditions as for the MJLSPOM representation, but could violate nonnegativity of holding-time density functions. A two-step procedure consisting of a local pulling-up modification and a filtering technique is constructed to enforce nonnegativity.

[1]  C. O'Cinneide Phase-type distributions: open problems and a few properties , 1999 .

[2]  Qi-Ming He,et al.  Fundamentals of Matrix-Analytic Methods , 2013, Springer New York.

[3]  William G. Marchal,et al.  Distribution Estimation Using Laplace Transforms , 1998, INFORMS J. Comput..

[4]  Alessandro N. N. Vargas Advances in the Control of Markov Jump Linear Systems with No Mode Observation , 2016 .

[5]  H. Chizeck,et al.  Controllability, stabilizability, and continuous-time Markovian jump linear quadratic control , 1990 .

[6]  Taylor L. Booth Statistical Properties of Random Digital Sequences , 1968, IEEE Transactions on Computers.

[7]  S. Rinaldi,et al.  Positive Linear Systems: Theory and Applications , 2000 .

[8]  Michael Athans,et al.  The Matrix Minimum Principle , 1967, Inf. Control..

[9]  B. Conolly Structured Stochastic Matrices of M/G/1 Type and Their Applications , 1991 .

[10]  A. Cumani On the canonical representation of homogeneous markov processes modelling failure - time distributions , 1982 .

[11]  Saeid Jafari,et al.  On the optimal control of a class of degradable systems modeled by semi-Markov jump linear systems , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[12]  Weiyi Liu,et al.  Probabilistic Trajectory Prediction and Conflict Detection for Air Traffic Control , 2011 .

[13]  K. A. Loparo,et al.  A probabilistic approach to dynamic power system security , 1990 .

[14]  Mark Fackrell,et al.  Fitting with Matrix-Exponential Distributions , 2005 .

[15]  Christopher J. BISHOPAbstra,et al.  Orthogonal Functions , 2022 .

[16]  Christian Commault,et al.  Phase-type distributions and representations: Some results and open problems for system theory , 2003 .

[17]  Franciszek Grabski,et al.  Semi-Markov Processes: Applications in System Reliability and Maintenance , 2014 .

[18]  Hassan S. Bakouch,et al.  Probability, Markov chains, queues, and simulation , 2011 .

[19]  Filippo Petroni,et al.  Reliability measures for indexed semi-Markov chains applied to wind energy production , 2013, Reliab. Eng. Syst. Saf..

[20]  A. Antoulas,et al.  H 2 Model Reduction for Large-scale Linear Dynamical Systems * , 2022 .

[21]  W. Wonham Random differential equations in control theory , 1970 .

[22]  Tadashi Dohi,et al.  A refined EM algorithm for PH distributions , 2011, Perform. Evaluation.

[23]  James Lam,et al.  Internal positivity preserved model reduction , 2010, Int. J. Control.

[24]  M. L. Shooman A study of occurrence rates of EMI to aircraft with a focus on HIRF , 1993, [1993 Proceedings] AIAA/IEEE Digital Avionics Systems Conference.

[25]  Philipp Reinecke,et al.  Program packages for computations with PH, ME distributions and MAP, RAP processes , 2014 .

[26]  Peter Buchholz,et al.  A novel approach for fitting probability distributions to real trace data with the EM algorithm , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[27]  Khashayar Khorasani,et al.  A Decentralized Markovian Jump ${\cal H}_{\infty}$ Control Routing Strategy for Mobile Multi-Agent Networked Systems , 2011, IEEE Transactions on Control Systems Technology.

[28]  C. O'Cinneide Characterization of phase-type distributions , 1990 .

[29]  Mark Fackrell,et al.  A semi-infinite programming approach to identifying matrix-exponential distributions , 2012, Int. J. Syst. Sci..

[30]  M. Perman,et al.  Semi-Markov models with an application to power-plant reliability analysis , 1997 .

[31]  Filippo Petroni,et al.  First and second order semi-Markov chains for wind speed modeling , 2012, 1206.2452.

[32]  Peng Shi,et al.  Stochastic stability of Ito differential equations with semi-Markovian jump parameters , 2006, IEEE Transactions on Automatic Control.

[33]  Yang Shi,et al.  Stochastic stability and robust stabilization of semi‐Markov jump linear systems , 2013 .

[34]  M. Connor Calculus of Variations and Optimal Control Theory , 1967 .

[35]  Patrizio Colaneri,et al.  Stability and Stabilization of Semi-Markov Jump Linear Systems With Exponentially Modulated Periodic Distributions of Sojourn Time , 2017, IEEE Transactions on Automatic Control.

[36]  Gunter Bolch,et al.  Queueing Networks and Markov Chains , 2005 .

[37]  Gerhard Kurz,et al.  Finite-horizon dynamic compensation of Markov Jump Linear Systems without mode observation , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[38]  C. Singh Equivalent Rate Approach to Semi-Markov Processes , 1980, IEEE Transactions on Reliability.

[39]  M. Hestenes Calculus of variations and optimal control theory , 1966 .

[40]  L. Farina On the existence of a positive realization , 1996 .

[41]  M. Khammash,et al.  Decentralized power system stabilizer design using linear parameter varying approach , 2004, IEEE Transactions on Power Systems.

[42]  H. Neudecker A Note on Kronecker Matrix Products and Matrix Equation Systems , 1969 .

[43]  J. Magnus,et al.  Matrix differential calculus with applications to simple, Hadamard, and Kronecker products. , 1985 .

[44]  Patrizio Colaneri,et al.  Stability and Stabilization of Discrete-Time Semi-Markov Jump Linear Systems via Semi-Markov Kernel Approach , 2016, IEEE Transactions on Automatic Control.

[45]  A. Bobbio,et al.  A benchmark for ph estimation algorithms: results for acyclic-ph , 1994 .

[46]  Ligang Wu,et al.  State estimation and sliding mode control for semi-Markovian jump systems with mismatched uncertainties , 2015, Autom..

[47]  Hiroyuki Okamura,et al.  Fitting Phase-Type Distributions and Markovian Arrival Processes: Algorithms and Tools , 2016 .