Learning intelligent behavior in a non-stationary and partially observable environment


Senkul S., Polat F.

ARTIFICIAL INTELLIGENCE REVIEW, vol.18, no.2, pp.97-115, 2002 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 18 Issue: 2
  • Publication Date: 2002
  • Doi Number: 10.1023/a:1019935502139
  • Journal Name: ARTIFICIAL INTELLIGENCE REVIEW
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.97-115
  • Keywords: agent learning, multi-agent systems, Q-learning, reinforcement learning, REINFORCEMENT
  • Middle East Technical University Affiliated: Yes

Abstract

Individual learning in an environment where more than one agent exist is a challenging task. In this paper, a single learning agent situated in an environment where multiple agents exist is modeled based on reinforcement learning. The environment is non-stationary and partially accessible from an agents' point of view. Therefore, learning activities of an agent is influenced by actions of other cooperative or competitive agents in the environment. A prey-hunter capture game that has the above characteristics is defined and experimented to simulate the learning process of individual agents. Experimental results show that there are no strict rules for reinforcement learning. We suggest two new methods to improve the performance of agents. These methods decrease the number of states while keeping as much state as necessary.