ARC-NLP at CASE 2022 Task 1: Ensemble Learning for Multilingual Protest Event Detection


Sahin U., Ozcelik O., Kucukkaya I. E., Toraman Ç.

5th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, CASE 2022, Abu Dhabi, Birleşik Arap Emirlikleri, 7 - 08 Aralık 2022, ss.175-183 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Basıldığı Şehir: Abu Dhabi
  • Basıldığı Ülke: Birleşik Arap Emirlikleri
  • Sayfa Sayıları: ss.175-183
  • Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Automated socio-political protest event detection is a challenging task when multiple languages are considered. In CASE 2022 Task 1, we propose ensemble learning methods for multilingual protest event detection in four subtasks with different granularity levels from document-level to entity-level. We develop an ensemble of fine-tuned Transformer-based language models, along with a post-processing step to regularize the predictions of our ensembles. Our approach places the first place in 6 out of 16 leaderboards organized in seven languages including English, Mandarin, and Turkish.