TasvirEt: A Benchmark Dataset for Automatic Turkish Description Generation from Images


Unal M. E. , Citamak B., Yagcioglu S., Erdem A., Erdem E., İKİZLER CİNBİŞ N., ...More

24th Signal Processing and Communication Application Conference (SIU), Zonguldak, Turkey, 16 - 19 May 2016, pp.1977-1980 identifier

  • Publication Type: Conference Paper / Full Text
  • City: Zonguldak
  • Country: Turkey
  • Page Numbers: pp.1977-1980
  • Keywords: Image captioning, computer vision, natural language processing

Abstract

Automatically describing images with natural sentences is considered to be a challenging research problem that has recently been explored. Although the number of methods proposed to solve this problem increases over time, since the datasets used commonly in this field contain only English descriptions, the studies have mostly been limited to single language, namely English. In this study, for the first time in the literature, a new dataset is proposed which enables generating Turkish descriptions from images, which can be used as a benchmark for this purpose. Furthermore, two approaches are proposed, again for the first time in the literature, for image captioning in Turkish with the dataset we named as TasvirEt. Our findings indicate that the new Turkish dataset and the approaches used here can be successfully used for automatically describing images in Turkish.