Incremental clustering with vector expansion for online event detection in microblogs


Ozdikis O., KARAGÖZ P., Oguztuzun H.

SOCIAL NETWORK ANALYSIS AND MINING, cilt.7, sa.1, 2017 (ESCI) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 7 Sayı: 1
  • Basım Tarihi: 2017
  • Doi Numarası: 10.1007/s13278-017-0476-8
  • Dergi Adı: SOCIAL NETWORK ANALYSIS AND MINING
  • Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus
  • Anahtar Kelimeler: Online event detection, Clustering, Vector expansion, Statistical text analysis, Microblogs, TWITTER
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Identifying similarities in microblog posts for event detection poses challenges due to short texts with idiosyncratic spellings, irregular writing styles, abbreviations and synonyms. In order to overcome these challenges, we present an enhancement to the incremental clustering techniques by detecting similar terms in microblog posts in a temporal context. We devise an unsupervised method to measure the similarities online using co-occurrence-based techniques and use them in a vector expansion process. The results of our evaluation performed on a tweet set indicate that the proposed vector expansion method helps identify similarities in tweets despite differences in their content. This facilitates the clustering of tweets and detection of events with higher accuracy without incurring a high execution cost.