A NONLINEAR-PROGRAMMING MODEL FOR PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES - FINITE-HORIZON CASE

SERIN, YAŞAR

doi:10.1016/0377-2217(94)00091-p

A NONLINEAR-PROGRAMMING MODEL FOR PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES - FINITE-HORIZON CASE

SERIN Y. Y.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, cilt.86, sa.3, ss.549-564, 1995 (SSCI, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 86 Sayı: 3
Basım Tarihi: 1995
Doi Numarası: 10.1016/0377-2217(94)00091-p
Dergi Adı: EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
Derginin Tarandığı İndeksler: Social Sciences Citation Index (SSCI), Scopus
Sayfa Sayıları: ss.549-564
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

The concept of partially observable Markov decision processes was born to handle the problem of lack of information about the state of a Markov decision process. If the state of the system is unknown to the decision maker then an obvious approach is to gather information that is helpful in selecting an action, This problem was already solved using the theory of Markov processes. We construct a nonlinear programming model for the same problem and develop a solution algorithm that turns out to be a policy iteration algorithm. The policies found this way are easier to use than the policies found by the existing method, although they have the same optimal objective value.