Cache-Based Query Processing for Search Engines


Creative Commons License

Cambazoglu B. B. , Altingovde İ. S. , Ozcan R., Ulusoy O.

ACM TRANSACTIONS ON THE WEB, cilt.6, 2012 (SCI İndekslerine Giren Dergi) identifier identifier

  • Cilt numarası: 6 Konu: 4
  • Basım Tarihi: 2012
  • Doi Numarası: 10.1145/2382616.2382617
  • Dergi Adı: ACM TRANSACTIONS ON THE WEB

Özet

In practice, a search engine may fail to serve a query due to various reasons such as hardware/network failures, excessive query load, lack of matching documents, or service contract limitations (e.g., the query rate limits for third-party users of a search service). In this kind of scenarios, where the backend search system is unable to generate answers to queries, approximate answers can be generated by exploiting the previously computed query results available in the result cache of the search engine. In this work, we propose two alternative strategies to implement this cache-based query processing idea. The first strategy aggregates the results of similar queries that are previously cached in order to create synthetic results for new queries. The second strategy forms an inverted index over the textual information (i.e., query terms and result snippets) present in the result cache and uses this index to answer new queries. Both approaches achieve reasonable result qualities compared to processing queries with an inverted index built on the collection.