Cache-Based Query Processing for Search Engines


Creative Commons License

Cambazoglu B. B., Altingovde İ. S., Ozcan R., Ulusoy O.

ACM TRANSACTIONS ON THE WEB, cilt.6, sa.4, 2012 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 6 Sayı: 4
  • Basım Tarihi: 2012
  • Doi Numarası: 10.1145/2382616.2382617
  • Dergi Adı: ACM TRANSACTIONS ON THE WEB
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Anahtar Kelimeler: Design, Experimentation, Performance, Web search engine, result caching, result aggregation, query view, system availability, WEB SEARCH
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In practice, a search engine may fail to serve a query due to various reasons such as hardware/network failures, excessive query load, lack of matching documents, or service contract limitations (e.g., the query rate limits for third-party users of a search service). In this kind of scenarios, where the backend search system is unable to generate answers to queries, approximate answers can be generated by exploiting the previously computed query results available in the result cache of the search engine. In this work, we propose two alternative strategies to implement this cache-based query processing idea. The first strategy aggregates the results of similar queries that are previously cached in order to create synthetic results for new queries. The second strategy forms an inverted index over the textual information (i.e., query terms and result snippets) present in the result cache and uses this index to answer new queries. Both approaches achieve reasonable result qualities compared to processing queries with an inverted index built on the collection.