A Deep Learning Framework on Generation of Image Descriptions with Bidirectional Recurrent Neural Networks

Thomas J. J., Pillai N.

1st International Conference on Intelligent Computing and Optimization (ICO), Pattaya, Tayland, 4 - 05 Ekim 2018, cilt.866, ss.219-230, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 866
Doi Numarası: 10.1007/978-3-030-00979-3_22
Basıldığı Şehir: Pattaya
Basıldığı Ülke: Tayland
Sayfa Sayıları: ss.219-230
Anahtar Kelimeler: Deep learning, Convolutional Neural Network, Data augmentation, Malaysian food chain
Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

The aim of the paper is to develop a deep learning framework for a model that generates natural descriptions of pictures (data) and their sections so as to search out a lot of insights. Image recognition is one in all the promising applications of visual objects. In this study, a small-scale food image data set consisting of 5115 pictures of fourteen classes and an eight-layer CNN was made to acknowledge these pictures. CNN performed far better with associate degree overall accuracy 54%. The approach influences information sets of images and their patterns bi directional recurrent neural network (BRNN) will concerning the intern-model correspondences between prediction and visual information for calorie estimation. Data expansion techniques were applied to extend the dimensions of trained images, that achieved a considerably improved accuracy of 74% stop the over fitting issue that occurred to the CNN while not misclassified.