Geometric convergence of value-iteration in multichain Markov decision problems

Abstract

This paper considers undiscounted Markov decision problems. With no restriction (on either the periodicity or chain structure of the problem) we show that the value iteration method for finding maximal gain policies exhibits a geometric rate of convergence, whenever convergence occurs. In addition, we study the behaviour of the value-iteration operator; we give bounds for the number of steps needed for contraction, describe the ultimate behaviour of the convergence factor and give conditions for the existence of a uniform convergence rate.

Authors: Awi Federgruen

Format: Journal Article

Publication Date: March 1, 1979

Journal: Advances in Applied Probability

Full Citation

Federgruen, Awi

. “Geometric convergence of value-iteration in multichain Markov decision problems.”

Advances in Applied Probability

vol.

, (March 01, 1979):

188

217

Abstract

Full Citation

External CSS

Accessibility Panel

Language Settings