Second Chance: A Hybrid Approach for Dynamic Result Caching and Prefetching in Search Engines

Ozcan, Rifat; ALTINGÖVDE, İSMAİL; Barla Cambazoglu, B.; ULUSOY, ÖZGÜR

doi:10.1145/2536777

Second Chance: A Hybrid Approach for Dynamic Result Caching and Prefetching in Search Engines

Atıf İçin Kopyala

Ozcan R., ALTINGÖVDE İ. S., Barla Cambazoglu B., ULUSOY Ö.

ACM TRANSACTIONS ON THE WEB, cilt.8, sa.1, 2013 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 8 Sayı: 1
Basım Tarihi: 2013
Doi Numarası: 10.1145/2536777
Dergi Adı: ACM TRANSACTIONS ON THE WEB
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Anahtar Kelimeler: Algorithms, Experimentation, Performance, Web search engines, dynamic result caching, result prefetching
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Web search engines are known to cache the results of previously issued queries. The stored results typically contain the document summaries and some data that is used to construct the final search result page returned to the user. An alternative strategy is to store in the cache only the result document IDs, which take much less space, allowing results of more queries to be cached. These two strategies lead to an interesting trade-off between the hit rate and the average query response latency. In this work, in order to exploit this trade-off, we propose a hybrid result caching strategy where a dynamic result cache is split into two sections: an HTML cache and a docID cache. Moreover, using a realistic cost model, we evaluate the performance of different result prefetching strategies for the proposed hybrid cache and the baseline HTML-only cache. Finally, we propose a machine learning approach to predict singleton queries, which occur only once in the query stream. We show that when the proposed hybrid result caching strategy is coupled with the singleton query predictor, the hit rate is further improved.