I. Introduction
Neural Networks (NNs) have been widely used in the controller designs for nonlinear systems where stability is typically the only consideration [1], [2]. Optimality, which is difficult to achieve, is generally preferred for the control of nonlinear systems than stability alone. The optimal control of linear systems accompanied by quadratic cost functions can be achieved by solving the well-known Riccati equation [3], however, the optimal control of nonlinear discrete-time systems often requires solving the nonlinear Hamilton–Jacobi–Bellman (HJB) equation, which does not have a closed-form solution. To extend the results of linear optimal control theory to nonlinear systems, the state dependent Riccati equation (SDRE) [4] was proposed for suboptimal control under certain tight assumptions including the need for full system dynamics.