Could We Create A Training Set For Image Captioning Using Automatic Translation?

SAMET N., Hicsonmez S., Duygulu P., AKBAŞ E.

25th Signal Processing and Communications Applications Conference (SIU), Antalya, Turkey, 15 - 18 May 2017 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/siu.2017.7960638
  • City: Antalya
  • Country: Turkey
  • Keywords: Image captioning, computer vision, machine translation
  • Middle East Technical University Affiliated: Yes


Automatic image captioning has received increasing attention in recent years. Although there are many English datasets developed for this problem, there is only one Turkish dataset and it is very small compared to its English counterparts. Creating a new dataset for image captioning is a very costly and time consuming task. This work is a first step towards transferring the available, large English datasets into Turkish. We translated English captioning datasets into Turkish by using an automated translation tool and we trained an image captioning model on the automatically obtained Turkish captions. Our experiments show that this model yields the best performance so far on Turkish captioning.