Application of Lemke's method to a class of Markov decision problems

Nobuo Matsubayashi, Hisakazu Nishino

研究成果: Article査読

3 被引用数 (Scopus)

抄録

This paper presents an application of Lemke's method to a class of Markov decision problems, appearing in the optimal stopping problems, and other well-known optimization problems. We consider a special case of the Markov decision problems with finitely many states, where the agent can choose one of the alternatives: getting a fixed reward immediately or paying the penalty for one term. We show that the problem can be reduced to a linear complementarity problem that can be solved by Lemke's method with the number of iterations less than the number of states. The reduced linear complementarity problem does not necessarily satisfy the copositive-plus condition. Nevertheless we show that the Lemke's method succeeds in solving the problem by proving that the problem satisfies a necessary and sufficient condition for the extended Lemke's method to compute a solution in the piecewise linear complementarity problem.

本文言語English
ページ(範囲)584-590
ページ数7
ジャーナルEuropean Journal of Operational Research
116
3
DOI
出版ステータスPublished - 1999 8 1
外部発表はい

ASJC Scopus subject areas

  • コンピュータ サイエンス(全般)
  • モデリングとシミュレーション
  • 経営科学およびオペレーションズ リサーチ
  • 情報システムおよび情報管理

フィンガープリント

「Application of Lemke's method to a class of Markov decision problems」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル