Learning sequences of compatible actions among agents


Polat F., Abul O.

ARTIFICIAL INTELLIGENCE REVIEW, vol.17, no.1, pp.21-37, 2002 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 17 Issue: 1
  • Publication Date: 2002
  • Doi Number: 10.1023/a:1015009422110
  • Journal Name: ARTIFICIAL INTELLIGENCE REVIEW
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.21-37
  • Middle East Technical University Affiliated: Yes

Abstract

Action coordination in multiagent systems is a difficult task especially in dynamic environments. If the environment possesses cooperation, least communication, incompatibility and local information constraints, the task becomes even more difficult. Learning compatible action sequences to achieve a designated goal under these constraints is studied in this work. Two new multiagent learning algorithms called QACE and NoCommQACE are developed. To improve the performance of the QACE and NoCommQACE algorithms four heuristics, state iteration, means-ends analysis, decreasing reward and do-nothing, are developed. The proposed algorithms are tested on the blocks world domain and the performance results are reported.