Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter


Ozdikis O., Senkul P., Oguztuzun H.

IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, İstanbul, Türkiye, 26 - 29 Ağustos 2012, ss.20-24 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/asonam.2012.14
  • Basıldığı Şehir: İstanbul
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.20-24
  • Anahtar Kelimeler: Event Detection, Clustering, Micro-blogging, Twitter, Tweets in Turkish, Semantics, Word Co-occurrences, TRACKING, TURKISH
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

This paper aims to enhance event detection methods in a micro-blogging platform, namely Twitter. The enhancement technique we propose is based on lexico-semantic expansion of tweet contents while applying document similarity and clustering algorithms. Considering the length limitations and idiosyncratic spelling in Twitter environment, it is possible to take advantage of word similarities and to enrich texts with similar words. The semantic expansion technique we implement is based on syntagmatic and paradigmatic relationships between words, extracted from their co-occurrence statistics. As our technique does not depend on an existing ontology or a lexicon database such as WordNet, it should be applicable for any language. The proposed technique is applied on a tweet set collected for three days from the users in Turkey. The results indicate earlier detection of events and improvements in accuracy.