Discovering better navigation sequences for the session construction problem


Bayir M. A., Toroslu I. H., Demirbas M., COŞAR A.

DATA & KNOWLEDGE ENGINEERING, cilt.73, ss.58-72, 2012 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 73
  • Basım Tarihi: 2012
  • Doi Numarası: 10.1016/j.datak.2011.11.005
  • Dergi Adı: DATA & KNOWLEDGE ENGINEERING
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.58-72
  • Anahtar Kelimeler: Web mining, Mining methods and algorithms, EFFICIENT ALGORITHM, WEB, FRAMEWORK, RECONSTRUCTION
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In this paper, we propose a novel page view based session model and session construction method to address the Web Usage Mining (WUM) problem. Unlike the simple session models, where sessions are sequences of web pages requested from the server (or served from a browser/proxy cache) and viewed in the browser (which may not guarantee a direct relationship between subsequent web pages in the session), we define a more realistic session model in which a session is a set of paths traversed in the web graph that corresponds to a user navigation performed by following links on web pages. We define the session construction process from raw server logs as a new graph problem and present a novel algorithm, Smart-SRA (Smart Session Reconstruction Algorithm), to solve this problem efficiently. An experimental evaluation based on data collected from real web access scenarios showed that Smart-SRA produces more accurate user sessions than the session construction methods found in the literature. (C) 2011 Elsevier B.V. All rights reserved.