Incremental clustering with vector expansion for online event detection in microblogs


Ozdikis O., KARAGÖZ P. , Oguztuzun H.

SOCIAL NETWORK ANALYSIS AND MINING, vol.7, no.1, 2017 (Journal Indexed in ESCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 7 Issue: 1
  • Publication Date: 2017
  • Doi Number: 10.1007/s13278-017-0476-8
  • Title of Journal : SOCIAL NETWORK ANALYSIS AND MINING
  • Keywords: Online event detection, Clustering, Vector expansion, Statistical text analysis, Microblogs, TWITTER

Abstract

Identifying similarities in microblog posts for event detection poses challenges due to short texts with idiosyncratic spellings, irregular writing styles, abbreviations and synonyms. In order to overcome these challenges, we present an enhancement to the incremental clustering techniques by detecting similar terms in microblog posts in a temporal context. We devise an unsupervised method to measure the similarities online using co-occurrence-based techniques and use them in a vector expansion process. The results of our evaluation performed on a tweet set indicate that the proposed vector expansion method helps identify similarities in tweets despite differences in their content. This facilitates the clustering of tweets and detection of events with higher accuracy without incurring a high execution cost.