Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation


Creative Commons License

Bolelli L., Ertekin Ş., Giles C. L.

31st European Conference on Information Research, Toulouse, Fransa, 6 - 09 Nisan 2009, cilt.5478, ss.776-777 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 5478
  • Doi Numarası: 10.1007/978-3-642-00958-7_84
  • Basıldığı Şehir: Toulouse
  • Basıldığı Ülke: Fransa
  • Sayfa Sayıları: ss.776-777
  • Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Algorithms that enable the process of automatically mining distinct topics in document collections have become, increasingly important clue to their applications in many fields and the extensive growth of the number of documents in various domains. In this paper, we propose a generative model based on latent Dirichlet allocation that integrates the temporal ordering of the documents into the generative process in an iterative fashion. The document collection is divided into time segments where the, discovered topics in each segment, is propagated to influence the, topic discovery in the subsequent time segments. Our experimental results on a collection of academic papers from CiteSeer repository show that, segmented topic model call effectively detect, distinct; topics and their evolution over time.