ARC-NLP at CheckThat! 2022: Contradiction for Harmful Tweet Detection


Toraman Ç., Ozcelik O., Şahinuç F., Sahin U.

2022 Conference and Labs of the Evaluation Forum, CLEF 2022, Bologna, İtalya, 5 - 08 Eylül 2022, cilt.3180, ss.722-739 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 3180
  • Basıldığı Şehir: Bologna
  • Basıldığı Ülke: İtalya
  • Sayfa Sayıları: ss.722-739
  • Anahtar Kelimeler: Claim Detection, Contradiction, COVID-19, Harmful Tweet Detection, Language Model, Tweet, Worthiness Checking
  • Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

The target task of our team in CLEF2022 CheckThat! Lab challenge is Task-1C, harmful tweet detection. We propose a novel approach, called ARC-NLP-contra, which is a contradiction check approach by using the idea that harmful tweets contradict with the real-life facts in the scope of COVID-19 pandemic. Besides, we propose and examine two other models. The first model, called ARC-NLP-hc, is a traditional approach that utilizes hand-crafted tweet and user features. The second model, called ARC-NLP-pretrain, pretrains a Transformer-based language model by using COVID-related Turkish tweets. We compare the performances of these three models, and submit the highest performing model in the preliminary experiments to the challenge. We make submissions for Task-1A, 1B, 1C in Turkish and Task-1C in English. We have the winning solution for Task-1C, harmful tweet detection in Turkish, using ARC-NLP-contra that is our contradiction check approach.