Queuing Theory-based Latency/Power Tradeoff Models for Replicated Search Engines

Large-scale search engines are built upon huge infrastructures involving thousands of computers in order to achieve fast response times. In contrast, the energy consumed (and hence the financial cost) is also high, leading to environmental damage. This paper proposes new approaches to increase energy and financial savings in large-scale search engines, while maintaining good query response times. We aim to improve current state-of-the-art models used for balancing power and latency, by integrating new advanced features. On one hand, we propose to improve the power savings by completely powering down the query servers that are not necessary when the load of the system is low. Besides, we consider energy rates into the model formulation. On the other hand, we focus on how to accurately estimate the latency of the whole system by means of Queueing Theory. Experiments using actual query logs attest the high energy (and financial) savings regarding current baselines. To the best of our knowledge, this is the first paper in successfully applying stationary Queueing Theory models to estimate the latency in a large-scale search engine.

[1]  Carl M. Harris,et al.  Fundamentals of queueing theory (2nd ed.). , 1985 .

[2]  Fabrizio Silvestri,et al.  Mining Query Logs: Turning Search Usage Data into Knowledge , 2010, Found. Trends Inf. Retr..

[3]  Susan T. Dumais,et al.  Modeling and predicting behavioral dynamics on the web , 2012, WWW.

[4]  D. Kendall Stochastic Processes Occurring in the Theory of Queues and their Analysis by the Method of the Imbedded Markov Chain , 1953 .

[5]  Bruno Sinopoli,et al.  Reducing data center energy consumption via coordinated cooling and load management , 2008, CLUSTER 2008.

[6]  Salim Hariri,et al.  Autonomic power and performance management for computing systems , 2006, 2006 IEEE International Conference on Autonomic Computing.

[7]  Iadh Ounis,et al.  Towards green information retrieval: studying the suitability of queueing theory in reducing power consumption , 2014 .

[8]  Thomas F. Wenisch,et al.  Power management of online data-intensive services , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[9]  I. Adan,et al.  QUEUEING THEORY , 1978 .

[10]  Xue Liu,et al.  Challenges Towards Elastic Power Management in Internet Data Centers , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems Workshops.

[11]  Gobinda G. Chowdhury,et al.  An agenda for green information retrieval research , 2012, Inf. Process. Manag..

[12]  R. Syski,et al.  Fundamentals of Queueing Theory , 1999, Technometrics.

[13]  Raik Stolletz,et al.  Approximation of the non-stationary M(t)/M(t)/c(t)-queue using stationary queueing models: The stationary backlog-carryover approach , 2008, Eur. J. Oper. Res..

[14]  Christoforos E. Kozyrakis,et al.  Automatic power management schemes for Internet servers and data centers , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[15]  Ben He,et al.  Terrier : A High Performance and Scalable Information Retrieval Platform , 2022 .

[16]  Vittaldas V. Prabhu,et al.  Modeling Green Fabs - A Queuing Theory Approach for Evaluating Energy Performance , 2012, APMS.

[17]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[18]  Craig MacDonald,et al.  A self-adapting latency/power tradeoff model for replicated search engines , 2014, WSDM.

[19]  Lidan Wang,et al.  Learning to efficiently rank , 2010, SIGIR.

[20]  Özgür Ulusoy,et al.  A financial cost metric for result caching , 2013, SIGIR.

[21]  Roi Blanco,et al.  Energy-price-driven query processing in multi-center web search engines , 2011, SIGIR '11.

[22]  Mor Harchol-Balter,et al.  How data center size impacts the effectiveness of dynamic power management , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[23]  Christos Kozyrakis,et al.  Full-System Power Analysis and Modeling for Server Environments , 2006 .