Space Efficient Caching of Query Results in Search Engines


Ozcan R., ALTINGÖVDE İ. S. , Ulusoy O.

23rd International Symposium on Computer and Information Sciences (ISCIS), İstanbul, Türkiye, 27 - 29 Ekim 2008, ss.558-563 identifier identifier

  • Cilt numarası:
  • Doi Numarası: 10.1109/iscis.2008.4717960
  • Basıldığı Şehir: İstanbul
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.558-563

Özet

Web search engines serve millions of query requests per day. Caching query results is one of the most crucial mechanisms to cope with such a demanding load. In this paper, we propose an efficient storage model to cache document identifiers of query results. Essentially, we first cluster queries that have common result documents. Next, for each cluster, we attempt to store those common document identifiers in a more compact manner. Experimental results reveal that the proposed storage model achieves space reduction of up to 4%. The proposed model is envisioned to improve the cache hit rate and system throughput as it allows storing more query results within a particular cache space, in return to a negligible increase in the cost of preparing the final query result page.