Finding Topic Trends in Digital Libraries


Creative Commons License

Bolelli L., Ertekin Ş., Zhou D., Giles C. L.

9th Annual International ACM/IEEE Joint Conference on Digital Libraries, Texas, Amerika Birleşik Devletleri, 15 - 19 Haziran 2009, ss.69-72 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1145/1555400.1555411
  • Basıldığı Şehir: Texas
  • Basıldığı Ülke: Amerika Birleşik Devletleri
  • Sayfa Sayıları: ss.69-72
  • Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

We propose a generative model based on latent Dirichlet allocation for mining distinct topics in document collections by integrating the temporal ordering of documents into the generative process. The document collection is divided into time segments where the discovered topics in each segment is propagated to influence the topic discovery in the subsequent time segments. We conduct experiments on the collection of academic papers from Cite Seer repository. We augment the text corpus with the addition of user queries and tags and integrate the citation graph to boost the weight of the topical terms. The experiment results show that segmented topic model can effectively detect distinct topics and their evolution over time.