Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter

Ozdikis O., Senkul P., Oguztuzun H.

IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, İstanbul, Turkey, 26 - 29 August 2012, pp.20-24 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/asonam.2012.14
  • City: İstanbul
  • Country: Turkey
  • Page Numbers: pp.20-24
  • Keywords: Event Detection, Clustering, Micro-blogging, Twitter, Tweets in Turkish, Semantics, Word Co-occurrences, TRACKING, TURKISH
  • Middle East Technical University Affiliated: Yes


This paper aims to enhance event detection methods in a micro-blogging platform, namely Twitter. The enhancement technique we propose is based on lexico-semantic expansion of tweet contents while applying document similarity and clustering algorithms. Considering the length limitations and idiosyncratic spelling in Twitter environment, it is possible to take advantage of word similarities and to enrich texts with similar words. The semantic expansion technique we implement is based on syntagmatic and paradigmatic relationships between words, extracted from their co-occurrence statistics. As our technique does not depend on an existing ontology or a lexicon database such as WordNet, it should be applicable for any language. The proposed technique is applied on a tweet set collected for three days from the users in Turkey. The results indicate earlier detection of events and improvements in accuracy.