Space Efficient Caching of Query Results in Search Engines


Ozcan R., ALTINGÖVDE İ. S. , Ulusoy O.

23rd International Symposium on Computer and Information Sciences (ISCIS), İstanbul, Turkey, 27 - 29 October 2008, pp.558-563 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/iscis.2008.4717960
  • City: İstanbul
  • Country: Turkey
  • Page Numbers: pp.558-563

Abstract

Web search engines serve millions of query requests per day. Caching query results is one of the most crucial mechanisms to cope with such a demanding load. In this paper, we propose an efficient storage model to cache document identifiers of query results. Essentially, we first cluster queries that have common result documents. Next, for each cluster, we attempt to store those common document identifiers in a more compact manner. Experimental results reveal that the proposed storage model achieves space reduction of up to 4%. The proposed model is envisioned to improve the cache hit rate and system throughput as it allows storing more query results within a particular cache space, in return to a negligible increase in the cost of preparing the final query result page.