TasvirEt: A Benchmark Dataset for Automatic Turkish Description Generation from Images

Unal M. E., Citamak B., Yagcioglu S., Erdem A., Erdem E., İKİZLER CİNBİŞ N., ...Daha Fazla

24th Signal Processing and Communication Application Conference (SIU), Zonguldak, Türkiye, 16 - 19 Mayıs 2016, ss.1977-1980, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Basıldığı Şehir: Zonguldak
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.1977-1980
Anahtar Kelimeler: Image captioning, computer vision, natural language processing
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Automatically describing images with natural sentences is considered to be a challenging research problem that has recently been explored. Although the number of methods proposed to solve this problem increases over time, since the datasets used commonly in this field contain only English descriptions, the studies have mostly been limited to single language, namely English. In this study, for the first time in the literature, a new dataset is proposed which enables generating Turkish descriptions from images, which can be used as a benchmark for this purpose. Furthermore, two approaches are proposed, again for the first time in the literature, for image captioning in Turkish with the dataset we named as TasvirEt. Our findings indicate that the new Turkish dataset and the approaches used here can be successfully used for automatically describing images in Turkish.