Improving the prediction of page access by using semantically enhanced clustering


Sen E., TOROSLU İ. H., KARAGÖZ P.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, cilt.47, sa.1, ss.165-192, 2016 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 47 Sayı: 1
  • Basım Tarihi: 2016
  • Doi Numarası: 10.1007/s10844-016-0398-3
  • Dergi Adı: JOURNAL OF INTELLIGENT INFORMATION SYSTEMS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.165-192
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

There are many parameters that may affect the navigation behaviour of web users. Prediction of the potential next page that may be visited by the web user is important, since this information can be used for prefetching or personalization of the page for that user. One of the successful methods for the determination of the next web page is to construct behaviour models of the users by clustering. The success of clustering is highly correlated with the similarity measure that is used for calculating the similarity among navigation sequences. This work proposes a new approach for determining the next web page by extending the standard clustering with the content-based semantic similarity method. Semantics of web-pages are represented as sets of concepts, and thus, user session are modelled as sequence of sets. As a result, session similarity is defined as an alignment of two sequences of sets. The success of the proposed method has been shown through applying it on real life web log data.