A Cost-Aware Strategy for Query Result Caching in Web Search Engines


Creative Commons License

Altingovde İ. S., Ozcan R., Ulusoy O.

31st European Conference on Information Research, Toulouse, France, 6 - 09 April 2009, vol.5478, pp.628-636 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 5478
  • Doi Number: 10.1007/978-3-642-00958-7_59
  • City: Toulouse
  • Country: France
  • Page Numbers: pp.628-636
  • Middle East Technical University Affiliated: No

Abstract

Search engines and large scale IR systems need to cache query results for efficiency and scalability purposes. In this study, we propose to explicitly incorporate the query costs in the static caching policy. To this end, a query's cost is represented by its execution time, which involves CPU time to decompress the postings and compute the query-document similarities to obtain the final top-N answers. Simulation results using a large Web crawl data and a real query reveal that the proposed strategy improves overall system performance in terms, of the total query execution time.