Predictive Control for Unknown Dynamics With Observation Loss: A Temporal Game-Theoretic Approach | IEEE Journals & Magazine | IEEE Xplore

Predictive Control for Unknown Dynamics With Observation Loss: A Temporal Game-Theoretic Approach


Abstract:

This article is concerned with a model-free predictive control problem on systems with unknown dynamics. Different from existing predictive control, the predictive feedba...Show More

Abstract:

This article is concerned with a model-free predictive control problem on systems with unknown dynamics. Different from existing predictive control, the predictive feedback strategy is designed to take control inputs to cover the infinite horizon when the system exists the observation loss. To predict future control inputs, a temporal game-theoretic approach is presented to model such a predictive control issue as an optimization problem and ensure the optimality of the system performance. Moreover, a predictive algebraic Riccati equation (PARE) is constructed to solve such an optimization problem. By leveraging offline datasets and the real-time data of state and input, a data-driven parallel computational framework is developed to iteratively solve the PARE. In this way, the prior knowledge of the systems is avoided and the computational complexity of the proposed algorithm is reduced. Finally, numerical and practical examples are presented to show the applicability of the proposed results.
Published in: IEEE Transactions on Industrial Electronics ( Volume: 71, Issue: 3, March 2024)
Page(s): 2965 - 2977
Date of Publication: 17 April 2023

ISSN Information:

Funding Agency:

References is not available for this document.

I. Introduction

The objective of predictive control is to predict the future output of a process based on the current control inputs of the system and historical information. Early studies on predictive control can be dated back to Taken's theorem proposed in 1980 [1], which reconstructed the phase space by integrating the state sequence and generating the data from the delayed sliding time window. The proposed Taken's theorem can predict the next state sequence of an autonomous system and improve the prediction robustness of dynamics. After that, the phase space reconstruction method has been widely applied [2], [3]. At the same time, one natural trend is using the predictive control algorithm to solve optimization problems. Many feasible control approaches have been developed to solve such a problem, the two main types of approaches are model predictive control (MPC) [4] and data-driven predictive control (DPC) [5].

Select All
1.
F. Takens, Detecting Strange Attractors in Turbulence, Berlin, Germany:Springer, 1981.
2.
S. Wang, H. Shuai and Q. Liu, "Phase space reconstruction driven spatio-temporal feature learning for dynamic facial expression recognition", IEEE Trans. Affect. Comput., vol. 13, no. 3, pp. 1466-1476, Jul.–Sep. 2022.
3.
H. Kim, R. Eykholt and J. Salas, "Nonlinear dynamics delay times and embedding windows", Physica D Nonlinear Phenomena, vol. 127, no. 1/2, pp. 48-60, 1999.
4.
H. Han and J. Qiao, "Nonlinear model-predictive control for industrial processes: An application to wastewater treatment process", IEEE Trans. Ind. Electron., vol. 61, no. 4, pp. 1970-1982, Apr. 2014.
5.
Y. Jiang and Z. P. Jiang, "Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics", Automatica, vol. 48, no. 10, pp. 2699-2704, 2012.
6.
X. Yang, H. Zhang and Z. Wang, "Data-based optimal consensus control for multiagent systems with policy gradient reinforcement learning", IEEE Trans. Neural Netw. Learn. Syst., vol. 33, no. 8, pp. 3872-3883, Aug. 2022.
7.
W. Gan, D. Zhu, Z. Hu, X. Shi, L. Yang and Y. Chen, "Model predictive adaptive constraint tracking control for underwater vehicles", IEEE Trans. Ind. Electron., vol. 67, no. 9, pp. 7829-7840, Sep. 2020.
8.
Y. Wang and S. Boyd, "Fast model predictive control using online optimization", IEEE Trans. Control Syst. Technol., vol. 18, no. 2, pp. 267-278, Mar. 2009.
9.
E. F. Camacho and C. B. Alba, Model Predictive Control, 2007.
10.
P. Patrinos and A. Bemporad, "An accelerated dual gradient-projection algorithm for embedded linear model predictive control", IEEE Trans. Autom. Control, vol. 59, no. 1, pp. 18-33, Jan. 2014.
11.
U. Rosolia and F. Borrelli, "Learning model predictive control for iterative tasks. A data-driven control framework", IEEE Trans. Autom. Control, vol. 63, no. 7, pp. 1883-1896, Jul. 2018.
12.
J. G. Rueda-Escobedo, E. Fridman and J. Schiffer, "Data-driven control for linear discrete-time delay systems", IEEE Trans. Autom. Control, vol. 67, no. 7, pp. 3321-3336, Jul. 2022.
13.
D. Liu and G. Yang, "Prescribed performance model-free adaptive integral sliding mode control for discrete-time nonlinear systems", IEEE Trans. Neural Netw. Learn. Syst., vol. 30, no. 7, pp. 2222-2230, Jul. 2019.
14.
Y. Zhang, L. Ma, C. Yang and W. Dai, "Reinforcement learning-based sliding mode tracking control for the two-time-scale systems: Dealing with actuator attacks", IEEE Trans. Circuits Syst. II Exp. Briefs, vol. 69, no. 9, pp. 3819-3823, Sep. 2022.
15.
L. Cao, S. Gao and D. Zhao, "Data-driven model-free sliding mode learning control for a class of discrete-time nonlinear systems", Trans. Inst. Meas. Control, vol. 42, no. 13, pp. 2533-2547, 2020.
16.
S. B. Taieb, A. Sorjamaa and G. Bontempi, "Multiple-output modeling for multi-step-ahead time series forecasting", Neurocomputing, vol. 73, no. 10-12, pp. 1950-1957, 2010.
17.
D. Xu, B. Jiang and P. Shi, "A novel model-free adaptive control design for multivariable industrial processes", IEEE Trans. Ind. Electron., vol. 61, no. 11, pp. 6391-6398, Nov. 2014.
18.
J. Coulson, J. Lygeros and F. Dörfler, "Data-enabled predictive control: In the shallows of the DeePC", Proc. IEEE 18th Eur. Control Conf., pp. 307-312, 2019.
19.
J. Berberich, J. Köhler, M. A. Müller and F. Allgöwer, "Data-driven model predictive control: Closed-loop guarantees and experimental results", AT-Automatisierungstechnik, vol. 69, no. 7, pp. 608-618, 2021.
20.
L. Ma, X. Liu, X. Kong and K. Y. Lee, "Iterative learning model predictive control based on iterative data-driven modeling", IEEE Trans. Neural Netw. Learn. Syst., vol. 32, no. 8, pp. 3377-3390, Aug. 2021.
21.
X. Yang, H. Zhang, Z. Wang, H. Yan and C. Zhang, "Data-based predictive control via multistep policy gradient reinforcement learning", IEEE Trans. Cybern., vol. 53, no. 5, pp. 2818-2828, May 2023.
22.
R. Bellman and R. Kalaba, "Dynamic programming and adaptive processes: Mathematical foundation", IRE Trans. Autom. Control, no. 1, pp. 5-10, 1960.
23.
R. Bellman, "Dynamic programming", Science, vol. 153, no. 3731, pp. 34-37, 1966.
24.
R. Gopalakrishnan, J. R. Marden and A. Wierman, "An architectural view of game theoretic control", ACM Sigmetrics Perform. Eval. Rev., vol. 38, no. 3, pp. 31-36, 2011.
25.
H. Zhang, H. Jiang, Y. Luo and G. Xiao, "Data-driven optimal consensus control for discrete-time multi-agent systems with unknown dynamics using reinforcement learning method", IEEE Trans. Ind. Electron., vol. 64, no. 5, pp. 4091-4100, May 2017.
26.
J. Moon and H. J. Yang, "Linear-quadratic time-inconsistent mean-field type Stackelberg differential games: Time-consistent open-loop solutions", IEEE Trans. Autom. Control, vol. 66, no. 1, pp. 375-382, Jan. 2021.
27.
C. Peng and W. Zhang, "Pareto optimality in infinite horizon mean-field stochastic cooperative linear-quadratic difference games", IEEE Trans. Autom. Control, Aug. 2022.
28.
T. Bjork and A. Murgoci, "A general theory of Markovian time inconsistent stochastic control problems", Dec. 2010, [online] Available: https://ssrn.com/abstract=1694759.
29.
Y. Ni, J. Zhang and M. Krstic, "Time-inconsistent mean-field stochastic LQ problem: Open-loop time-consistent control", IEEE Trans. Autom. Control, vol. 63, no. 9, pp. 2771-2786, Sep. 2018.
30.
D. Martin, Manifold theory: An introduction for mathematical physicists, 2002.

Contact IEEE to Subscribe

References

References is not available for this document.