31st European Conference on Information Research, Toulouse, France, 6 - 09 April 2009, vol.5478, pp.628-636
Search engines and large scale IR systems need to cache query results for efficiency and scalability purposes. In this study, we propose to explicitly incorporate the query costs in the static caching policy. To this end, a query's cost is represented by its execution time, which involves CPU time to decompress the postings and compute the query-document similarities to obtain the final top-N answers. Simulation results using a large Web crawl data and a real query reveal that the proposed strategy improves overall system performance in terms, of the total query execution time.