Markov decision processes under observability constraints

Serin, YAŞAR; Kulkarni, V

doi:10.1007/s001860400402

Markov decision processes under observability constraints

Serin Y. Y., Kulkarni V.

MATHEMATICAL METHODS OF OPERATIONS RESEARCH, cilt.61, sa.2, ss.311-328, 2005 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 61 Sayı: 2
Basım Tarihi: 2005
Doi Numarası: 10.1007/s001860400402
Dergi Adı: MATHEMATICAL METHODS OF OPERATIONS RESEARCH
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.311-328
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

We develop an algorithm to compute optimal policies for Markov decision processes subject to constraints that result from some observability restrictions on the process. We assume that the state of the Markov process is unobservable. There is an observable process related to the unobservable state. So, we want to find a decision rule depending only on this observable process. The objective is to minimize the expected average cost over an infinite horizon. We also analyze the possibility of performing observations in more detail to obtain improved policies.