Conferences >2010 IEEE International Confe...

Optimal maintenance policies for three-states POMDP with quality measurement errors

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Partially Observed Markov Decision Process (POMDP) has been used to model decision making under uncertainty in several areas. A few areas of application include: manufact...Show More

Metadata

Abstract:

Partially Observed Markov Decision Process (POMDP) has been used to model decision making under uncertainty in several areas. A few areas of application include: manufacturing, healthcare, business and military applications. In the POMDP context, systems are considered as multi-state systems with hidden states. The common thing among all POMDP models is the existence of measurements utilized to infer about the actual hidden state of the system on hand. However, measurements, in general, are not error free. The impact of measurement errors on the POMDP optimal decision polices is formulated and studied for a three-state deteriorating machine with two quality outcomes and possible quality measurement errors. The decision making problem is modeled as a Three-Layers Hidden Markov Decision Process (TLHMDP). The objective function of the POMDP problem is shown to be a piecewise linear convex one. The impact of measurement errors in the POMDP context is demonstrated by numerical example.

Published in: 2010 IEEE International Conference on Industrial Engineering and Engineering Management

Date of Conference: 07-10 December 2010

Date Added to IEEE Xplore: 23 December 2010

ISBN Information:

ISSN Information:

DOI: 10.1109/IEEM.2010.5674294

Conference Location: Macao, China

Contents

I. INTRODUCTION

POMDP has very wide areas of applications including: financial, medical, communications and others. Maintenance is a major area of POMDP applications. Formally, a Markov Decision Process (MDP) can be defined as a dynamic decision making framework that aims at optimally controlling a Markov stochastic process over a given number of future stages, such that, a set of available control actions influence the state transition of the Markov chain at each stage. A PODMP is a generalization of the MDP where the true/actual state of the system is not known exactly to the controller or decision maker; instead, an output signal can be measured from the system. This signal is assumed to be probabilistically related to the true/actual state of the system. Hence, the system is “partially observed” and control actions are made based on the “belief state” of the system. The belief state is a state occupancy vector that has a number of elements equal to the number of system states. Each element represents the probability of the system being in one of its states. Mainly, a POMDP decision making framework consists of the following steps: •

Take control action

•

Gain or loss takes place

•

Observe a signal from the system

•

Update belief state (state occupancy vector)

•

Start next stage

References is not available for this document.

Optimal maintenance policies for three-states POMDP with quality measurement errors

Abstract:

Metadata

Abstract:

ISSN Information:

I. INTRODUCTION

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Optimal maintenance policies for three-states POMDP with quality measurement errors

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. INTRODUCTION

Authors

Figures

References

Citations

Keywords

Metrics

References