Conferences >Proceedings of the 45th IEEE ...

Approximate Dynamic Programming Based on Expansive Projections

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We present a general method to obtain convergent approximate value iteration algorithms with function approximation. The result is applicable to any arbitrary approximati...Show More

Metadata

Abstract:

We present a general method to obtain convergent approximate value iteration algorithms with function approximation. The result is applicable to any arbitrary approximation architecture and generalizes existing results in the literature derived for particular approximation schemes. Additionally, we show how to obtain a convergent approximate mapping whose fixed point is the projection in the approximation space of a fixed point of the exact dynamic programming mapping with regards to a suitable subset norm. This result relies on evaluating the difference between successive iterates in the selected subset norm, which provides convergent procedures for any arbitrary approximation architecture.

Published in: Proceedings of the 45th IEEE Conference on Decision and Control

Date of Conference: 13-15 December 2006

Date Added to IEEE Xplore: 07 May 2007

Print ISBN:1-4244-0171-2

Print ISSN: 0191-2216

DOI: 10.1109/CDC.2006.376823

Conference Location: San Diego, CA, USA

Contents

I. Introduction

It is a well known fact that the process of finding the optimal solution to large scale Markov Decision Processes (MDP) is generally very demanding [9]. Approximate Dynamic Programming (ADP) algorithms [11] offer a plausible alternative of finding approximate solutions to MDP problems that would otherwise be intractable. One appealing ADP technique consists of incorporating a function approximation scheme into the problem in order to artificially reduce its complexity and searching for an approximate solution in a lower dimensional space [3], [11], [12]. Even though this approach has proved successful in practical applications, e.g. [14], there exist divergent counter-examples that make a strong case against the robustness of the technique [4].

References is not available for this document.

MIT Libraries

MIT Libraries

Approximate Dynamic Programming Based on Expansive Projections

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Approximate Dynamic Programming Based on Expansive Projections

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

References