Abstract:
In this paper we present the stability and convergence results for dynamic programming-based reinforcement learning applied to linear quadratic regulation (LQR). The spec...Show MoreMetadata
Abstract:
In this paper we present the stability and convergence results for dynamic programming-based reinforcement learning applied to linear quadratic regulation (LQR). The specific algorithm we analyze is based on Q-learning and it is proven to converge to an optimal controller provided that the underlying system is controllable and a particular signal vector is persistently excited. This is the first convergence result for DP-based reinforcement learning algorithms for a continuous problem.
Date of Conference: 29 June 1994 - 01 July 1994
Date Added to IEEE Xplore: 06 August 2002
Print ISBN:0-7803-1783-1