Zero and Few-shot Hate Speech Detection in Social Media Messages Related to Earthquake Disaster

Sahin U., Kucukkaya I. E., Ozcelik O., Toraman Ç.

31st IEEE Conference on Signal Processing and Communications Applications (SIU), İstanbul, Türkiye, 5 - 08 Temmuz 2023, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu59756.2023.10224056
Basıldığı Şehir: İstanbul
Basıldığı Ülke: Türkiye
Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Microblog platforms provide active communication channels during crisis situations such as earthquakes. Data from these platforms plays an important role in alleviating such crises. In times of crisis, it is important to develop quick and reliable methods. In this study, we perform hate speech detection in earthquake-related social media messages using zero-shot and few-shot methods that work with minimal labeled data. We collect Turkish messages shared on Twitter from the 2023 Turkey-Syria earthquake and label them according to their hate speech content. First, we evaluate the zero-shot performance of the proposed models that are trained on a different hate speech dataset on the newly created earthquake dataset. Then, using the newly labeled data, we fine-tune the models with few-shot learning, and evaluate their performance on the same dataset. The results show that transformer-based language models perform better than traditional models when there is not enough labeled data, and the few-shot method outperforms the zero-shot method. Additionally, we examine the effect of varying the size of the training data used for few-shot learning.