论文信息 - A Fault-Tolerant Cache Service for Web Search Engines

A Fault-Tolerant Cache Service for Web Search Engines

Large Web search engines are constructed as a collection of services that are deployed on dedicated clusters of distributed-memory processors. In particular, efficient user query throughput heavily relies on using result cache services devoted to maintaining the answers to most frequent queries. Load balancing and fault tolerance are critical to this service. This paper proposes the design of a result cache service based on consistent hashing and a strategy for enabling fault tolerance. Performance evaluation is performed by using actual queries from a commercial search engine. The results show that the proposed cache service outperforms baseline approaches, decreases the average query response time, increases query throughput and efficiently recovers performance after processor failures.

Emilio Luque | Dolores Rexachs | Mauricio Marin | Veronica Gil-Costa | Carlos Gómez-Pantoja

[1] Robert Morris,et al. Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[2] Werner Vogels,et al. Dynamo: amazon's highly available key-value store , 2007, SOSP.

[3] Costin Raiciu,et al. ROAR: increasing the flexibility and performance of distributed search , 2009, SIGCOMM '09.

[4] Aristides Gionis,et al. The impact of caching on search engines , 2007, SIGIR.

[5] Yasushi Saito,et al. Optimistic replication , 2005, CSUR.

[6] Anthony P. Reeves,et al. Strategies for Dynamic Load Balancing on Highly Parallel Computers , 1993, IEEE Trans. Parallel Distributed Syst..

[7] Veronica Gil Costa,et al. New caching techniques for web search engines , 2010, HPDC '10.

[8] Fabrizio Silvestri,et al. Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data , 2006, TOIS.

[9] Emilio Luque,et al. Providing Non-stop Service for Message-Passing Based Parallel Applications with RADIC , 2008, Euro-Par.

[10] Brad Fitzpatrick,et al. Distributed caching with memcached , 2004 .

[11] Torsten Suel,et al. Improved techniques for result caching in web search engines , 2009, WWW '09.

[12] Veronica Gil Costa,et al. Location cache for web queries , 2009, CIKM.