Effectiveness of considering state similarity for reinforcement learning


Girgin S., Polat F. , Alhajj R.

INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, cilt.4224, ss.163-171, 2006 (SCI İndekslerine Giren Dergi) identifier

  • Cilt numarası: 4224
  • Basım Tarihi: 2006
  • Dergi Adı: INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS
  • Sayfa Sayıları: ss.163-171

Özet

This paper presents a novel approach that locates states with similar sub-policies, and incorporates them into the reinforcement learning framework for better learning performance. This is achieved by identifying common action sequences of states, which are derived from possible optimal policies and reflected into a tree structure. Based on the number of such sequences, we define a similarity function between two states, which helps to reflect updates on the action-value function of a state to all similar states. This way, experience acquired during learning can be applied to a broader context. The effectiveness of the method is demonstrated empirically.