Incremental clustering with vector expansion for online event detection in microblogs


Ozdikis O., KARAGÖZ P., Oguztuzun H.

SOCIAL NETWORK ANALYSIS AND MINING, vol.7, no.1, 2017 (ESCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 7 Issue: 1
  • Publication Date: 2017
  • Doi Number: 10.1007/s13278-017-0476-8
  • Journal Name: SOCIAL NETWORK ANALYSIS AND MINING
  • Journal Indexes: Emerging Sources Citation Index (ESCI), Scopus
  • Keywords: Online event detection, Clustering, Vector expansion, Statistical text analysis, Microblogs, TWITTER
  • Middle East Technical University Affiliated: Yes

Abstract

Identifying similarities in microblog posts for event detection poses challenges due to short texts with idiosyncratic spellings, irregular writing styles, abbreviations and synonyms. In order to overcome these challenges, we present an enhancement to the incremental clustering techniques by detecting similar terms in microblog posts in a temporal context. We devise an unsupervised method to measure the similarities online using co-occurrence-based techniques and use them in a vector expansion process. The results of our evaluation performed on a tweet set indicate that the proposed vector expansion method helps identify similarities in tweets despite differences in their content. This facilitates the clustering of tweets and detection of events with higher accuracy without incurring a high execution cost.