Journals & Magazines >IEEE Transactions on Cybernet... >Volume: 54 Issue: 8

Intelligent-Critic-Based Tracking Control of Discrete-Time Input-Affine Systems and Approximation Error Analysis With Application Verification

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In recent years, the application of function approximators, such as neural networks and polynomials, has ushered in a new stage of development in solving optimal control ...Show More

Metadata

Abstract:

In recent years, the application of function approximators, such as neural networks and polynomials, has ushered in a new stage of development in solving optimal control problems. However, considering the existence of approximation errors, the stability of the controlled system cannot be guaranteed. Therefore, in view of the prevalence of approximation errors, we investigate optimal tracking control problems for discrete-time systems. First, a novel value function is introduced into the intelligent critic framework. Second, an implicit method is utilized to demonstrate the boundedness of the iterative value functions with approximation errors. An explicit method is applied to prove the stability of the system with approximation errors. Furthermore, an evolving policy is designed to iteratively tackle the optimal tracking control problem and demonstrate the stability of the system. Finally, the effectiveness of the developed method is verified through numerical as well as practical examples.

Published in: IEEE Transactions on Cybernetics ( Volume: 54, Issue: 8, August 2024)

Page(s): 4690 - 4701

Date of Publication: 05 October 2023

ISSN Information:

PubMed ID: 37796676

DOI: 10.1109/TCYB.2023.3312320

Funding Agency:

Contents

I. Introduction

Adaptive dynamic programming (ADP) originated from dynamic programming [1] and reinforcement learning (RL) [2]. It is an efficient method for solving optimal control problems. Compared to the traditional approach of directly solving the Hamilton–Jacobi–Bellman (HJB) equation, ADP is applicable to systems with unknown models and is capable of handling the “curse of dimensionality” [3], [4]. This method has shown great potential in wastewater systems [5], power systems [6], [7], aerospace [8], cyber security [9], [10], and so forth. The utilization of function approximators is a pivotal element for the success of ADP. However, this component also presents some tricky problems, such as approximation errors. Currently, a widely used assumption is that function approximators have the perfect approximation performance [11], [12], but this rarely holds in nonlinear systems. If the approximation errors propagate throughout the iterative process, even small errors may trigger the “resonance” type phenomenon, which seriously affects the stability of the system. In particular, in some fields involving the safety of personal and property, such effects may cause horrific consequences. The related analysis is meaningful in ADP.

References is not available for this document.

MIT Libraries

MIT Libraries

Intelligent-Critic-Based Tracking Control of Discrete-Time Input-Affine Systems and Approximation Error Analysis With Application Verification

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Intelligent-Critic-Based Tracking Control of Discrete-Time Input-Affine Systems and Approximation Error Analysis With Application Verification

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References