A Comparative Text Classification Study with Deep Learning-Based Algorithms


Koksal O., Akgul O.

9th International Conference on Electrical and Electronics Engineering (ICEEE), Alanya, Türkiye, 29 - 31 Mart 2022, ss.387-391 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/iceee55327.2022.9772587
  • Basıldığı Şehir: Alanya
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.387-391
  • Anahtar Kelimeler: text classification, deep learning, convolutional neural network, recurrent neural network, LSTM, GRU
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

As a well-known Natural Language Processing (NLP) task, text classification can be defined as the process of categorizing documents depending on their content. In this process, selecting classification algorithms and tuning classification parameters are crucial for efficient classification. In recent years, many deep learning algorithms have been used successfully in text classification tasks. This paper performed a comparative study utilizing and optimizing several deep learning-based algorithms. We have implemented deep neural networks (DNN), convolutional neural networks (CNN), long shortest-term memory (LSTM), and gated recurrent units (GRU). In addition, we performed extensive experiments by tuning hyperparameters to improve classification accuracy. In addition, we implemented word embeddings techniques to acquire feature vectors of text data. Then we compared our word embeddings results with traditional TF-IDF vectorization results of DNN and CNN. In our experiments, we used an open-source Turkish News benchmarking dataset to compare our results with previous studies in the literature. Our experimental results revealed significant improvements in classification performance using word embeddings with deep learning-based algorithms and tuning hyperparameters. Furthermore, our work outperformed previous results on the selected dataset.