Cache-Based Query Processing for Search Engines

Cambazoglu, B.; Altingovde, İSMAİL; Ozcan, Rifat; Ulusoy, Ozgur

doi:10.1145/2382616.2382617

Cache-Based Query Processing for Search Engines

Cambazoglu B. B., Altingovde İ. S., Ozcan R., Ulusoy O.

ACM TRANSACTIONS ON THE WEB, cilt.6, sa.4, 2012 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 6 Sayı: 4
Basım Tarihi: 2012
Doi Numarası: 10.1145/2382616.2382617
Dergi Adı: ACM TRANSACTIONS ON THE WEB
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Anahtar Kelimeler: Design, Experimentation, Performance, Web search engine, result caching, result aggregation, query view, system availability, WEB SEARCH
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In practice, a search engine may fail to serve a query due to various reasons such as hardware/network failures, excessive query load, lack of matching documents, or service contract limitations (e.g., the query rate limits for third-party users of a search service). In this kind of scenarios, where the backend search system is unable to generate answers to queries, approximate answers can be generated by exploiting the previously computed query results available in the result cache of the search engine. In this work, we propose two alternative strategies to implement this cache-based query processing idea. The first strategy aggregates the results of similar queries that are previously cached in order to create synthetic results for new queries. The second strategy forms an inverted index over the textual information (i.e., query terms and result snippets) present in the result cache and uses this index to answer new queries. Both approaches achieve reasonable result qualities compared to processing queries with an inverted index built on the collection.