Markov decision processes with restricted observations: Finite horizon case

Serin, YAŞAR; AVSAR, ZEYNEP

doi:10.1002/(sici)1520-6750(199708)44:5<439::aid-nav3>3.0.co;2-5

Markov decision processes with restricted observations: Finite horizon case

Atıf İçin Kopyala

Serin Y. Y., AVSAR Z. M.

NAVAL RESEARCH LOGISTICS, cilt.44, sa.5, ss.439-456, 1997 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 44 Sayı: 5
Basım Tarihi: 1997
Doi Numarası: 10.1002/(sici)1520-6750(199708)44:5<439::aid-nav3>3.0.co;2-5
Dergi Adı: NAVAL RESEARCH LOGISTICS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.439-456
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In this article we consider a Markov decision process subject to the constraints that result from some observability restrictions. We assume that the state of the Markov process under consideration is unobservable. The states are grouped so that the group that a state belongs to is observable. So, we want to find an optimal decision rule depending on the observable groups instead of the states. This means that the same decision applies to all the states in the same group. We prove that a deterministic optimal policy exists for the finite horizon. An algorithm is developed to compute policies minimizing the total expected discounted cost over a finite horizon. (C) 1997 John Wiley & Sons, Inc.