Conferences >2010 Chinese Control and Deci...

Iteration algorithm for solving the optimal strategies of a class of nonaffine nonlinear quadratic zero-sum games

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

A iteration algorithm is derived to solve the optimal strategies of continuous-time nonaffine nonlinear quadratic zero-sum game in this paper. The nonaffine nonlinear qua...Show More

Metadata

Abstract:

A iteration algorithm is derived to solve the optimal strategies of continuous-time nonaffine nonlinear quadratic zero-sum game in this paper. The nonaffine nonlinear quadratic zero-sum game is transformed into an equivalent sequence of linear quadratic zero-sum games. The associated Hamiltion-Jacobi-Isaacs (HJI) equation is transformed into a sequence of algebraic Riccati equations. The optimal strategies of the zero-sum game are obtained by iteration. The convergence of the iteration algorithm is proved under very mild conditions of local Lipschitz continuity. Finally, this approach is applied to a numerical example to demonstrate its convergence and effectiveness.

Published in: 2010 Chinese Control and Decision Conference

Date of Conference: 26-28 May 2010

Date Added to IEEE Xplore: 01 July 2010

ISBN Information:

ISSN Information:

DOI: 10.1109/CCDC.2010.5498189

Conference Location: Xuzhou, China

Contents

1 INTRODUCTION

A large class of real systems are controlled by more than one controller or decision maker with each using an individual strategy. These controllers often operate in a group with a general quadratic performance index function as game theory which has been widely applied in management, military battles, power networks and different types of contest [1]–[6]. The two-player zero-sum game with a general quadratic performance index function is an important part of the game theory. Two players work on the performance index function together and minimax it. Over the past decades, the optimal strategies of linear zero-sum game and affine nonlinear zero-sum game have received a great deal of attention in the literature [6], [8]–[12], [15], which have the form $$\dot{x}(t)=f(x)+g(x)u+k(x)d \eqno{\hbox{(1)}}$$ with the performance index function $$V(x)={1\over 2}\int_{t_{0}}^{\infty}(x^{T}x+u^{T}u-\gamma^{2}d^{T}d)dt \eqno{\hbox{(2)}}$$ where is the state, and are the inputs. seeks to minimize the performance index function while seeks to maximize it. In [8], Al-Tamimi et al. applied the heuristic dynamic programming and dual heuristic dynamic programming structures to solve a discrete-time linear quadratic zero-sum game problem in which the state and action spaces are continuous. Then, they designed the optimal strategies of the discrete-time linear quadratic zero-sum game without knowing the system dynamical matrices by the model-free Q-learning approach [9]. A Class of continuous-time affine nonlinear quadratic zero-sum game problem was researched by Wei et al. in [13]. Abu-Khalaf et al. studied the affine nonlinear zero-sum game problem in [10] and used neural networks to solve it in [11]. It is worthy of mentioning that most of the above discussions are focused on the linear or affine nonlinear zero-sum game problems.

References is not available for this document.

MIT Libraries

MIT Libraries

Iteration algorithm for solving the optimal strategies of a class of nonaffine nonlinear quadratic zero-sum games

Abstract:

Metadata

Abstract:

ISSN Information:

1 INTRODUCTION

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Iteration algorithm for solving the optimal strategies of a class of nonaffine nonlinear quadratic zero-sum games

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1 INTRODUCTION

References