Could We Create A Training Set For Image Captioning Using Automatic Translation?

SAMET N., Hicsonmez S., Duygulu P., AKBAŞ E.

25th Signal Processing and Communications Applications Conference (SIU), Antalya, Türkiye, 15 - 18 Mayıs 2017, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası:
Doi Numarası: 10.1109/siu.2017.7960638
Basıldığı Şehir: Antalya
Basıldığı Ülke: Türkiye
Anahtar Kelimeler: Image captioning, computer vision, machine translation
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Automatic image captioning has received increasing attention in recent years. Although there are many English datasets developed for this problem, there is only one Turkish dataset and it is very small compared to its English counterparts. Creating a new dataset for image captioning is a very costly and time consuming task. This work is a first step towards transferring the available, large English datasets into Turkish. We translated English captioning datasets into Turkish by using an automated translation tool and we trained an image captioning model on the automatically obtained Turkish captions. Our experiments show that this model yields the best performance so far on Turkish captioning.