Reinforcement learning-based aggregation for robot swarms

Sadeghi Amjadi, Arash; BİLALOĞLU, CEM; TURGUT, ALİ; Na, Seongin; ŞAHİN, EROL; Krajník, Tomáš; Arvin, Farshad

doi:10.1177/10597123231202593

Reinforcement learning-based aggregation for robot swarms

Atıf İçin Kopyala

Sadeghi Amjadi A., BİLALOĞLU C., TURGUT A. E., Na S., ŞAHİN E., Krajník T., ...Daha Fazla

Adaptive Behavior, cilt.32, sa.3, ss.265-281, 2024 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 32 Sayı: 3
Basım Tarihi: 2024
Doi Numarası: 10.1177/10597123231202593
Dergi Adı: Adaptive Behavior
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus, Academic Search Premier, Aerospace Database, Animal Behavior Abstracts, Applied Science & Technology Source, Aquatic Science & Fisheries Abstracts (ASFA), BIOSIS, Communication Abstracts, Computer & Applied Sciences, INSPEC, Psycinfo
Sayfa Sayıları: ss.265-281
Anahtar Kelimeler: aggregation, bio-inspired, reinforcement learning, Swarm robotics
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Aggregation, the gathering of individuals into a single group as observed in animals such as birds, bees, and amoeba, is known to provide protection against predators or resistance to adverse environmental conditions for the whole. Cue-based aggregation, where environmental cues determine the location of aggregation, is known to be challenging when the swarm density is low. Here, we propose a novel aggregation method applicable to real robots in low-density swarms. Previously, Landmark-Based Aggregation (LBA) method had used odometric dead-reckoning coupled with visual landmarks and yielded better aggregation in low-density swarms. However, the method’s performance was affected adversely by odometry drift, jeopardizing its application in real-world scenarios. In this article, a novel Reinforcement Learning-based Aggregation method, RLA, is proposed to increase aggregation robustness, thus making aggregation possible for real robots in low-density swarm settings. Systematic experiments conducted in a kinematic-based simulator and on real robots have shown that the RLA method yielded larger aggregates, is more robust to odometry noise than the LBA method, and adapts better to environmental changes while not being sensitive to parameter tuning, making it better deployable under real-world conditions.