Using Generative Adversarial Nets on Atari Games for Feature Extraction in Deep Reinforcement Learning

AYDIN A., SÜRER E.

28th Signal Processing and Communications Applications Conference (SIU), ELECTR NETWORK, 5 - 07 Ekim 2020, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu49456.2020.9302454
Basıldığı Ülke: ELECTR NETWORK
Anahtar Kelimeler: deep learning, reinforcement learning, generative adversarial networks
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Deep Reinforcement Learning (DRL) has been successfully applied in several research domains such as robot navigation and automated video game playing. However, these methods require excessive computation and interaction with the environment, so enhancements on sample efficiency are required. The main reason for this requirement is that sparse and delayed rewards do not provide an effective supervision for representation learning of deep neural networks. In this study, Proximal Policy Optimization (PPO) algorithm is augmented with Generative Adversarial Networks (GANs) to increase the sample efficiency by enforcing the network to learn efficient representations without depending on sparse and delayed rewards as supervision. The results show that an increased performance can be obtained by jointly training a DRL agent with a GAN discriminator.