Obtaining the Knowledge of a Server Performance from Non-Intrusively Measurable Metrics

Most network services are provided by server computers. To provide these services with good quality, the server performance must be managed adequately. For the server management, the performance information is commonly obtained from the operating system (OS) and hardware of the managed computer. However, this method has a disadvantage. If the performance is degraded by excessive load or hardware faults, it becomes difficult to collect and transmit information. Thus, it is necessary to obtain the information without interfering with the server’s OS and hardware. This paper investigates a technique that utilizes non-intrusively measureable metrics that are obtained through passive traffic monitoring and electric currents monitored by the sensors attached to the power supply. However, these metrics do not directly represent the performance experienced by users. Hence, it is necessary to discover the complicated function that maps the metrics to the true performance information. To discover this function from the measured samples, a machine learning technique based on a decision tree is examined. The technique is important because it is applicable to the power management of server clusters and the immigration control of virtual servers.

[1]  Kyu-Young Whang,et al.  A linear-time probabilistic counting algorithm for database applications , 1990, TODS.

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  David Mosberger,et al.  httperf—a tool for measuring web server performance , 1998, PERV.

[4]  Enrique V. Carrera,et al.  Load balancing and unbalancing for power and performance in cluster-based systems , 2001 .

[5]  Hyang-Ah Kim,et al.  Counting network flows in real time , 2003, GLOBECOM '03. IEEE Global Telecommunications Conference (IEEE Cat. No.03CH37489).

[6]  Doris K. Cope Press Release: November 4, 2003 , 2004 .

[7]  Zied Elouedi,et al.  Naive Bayes vs decision trees in intrusion detection systems , 2004, SAC '04.

[8]  Raouf Boutaba,et al.  Flow-based Characteristic Analysis of Internet Application Traffic , 2004 .

[9]  S. Zielinski,et al.  International Court of Justice , 2002, International Organization.

[10]  An observation-based approach towards self-managing web servers , 2006, Comput. Commun..

[11]  George Varghese,et al.  Bitmap algorithms for counting active flows on high speed links , 2003, IMC '03.

[12]  Andrew W. Moore,et al.  A Machine Learning Approach for Efficient Traffic Classification , 2007, 2007 15th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[13]  Kang G. Shin,et al.  Online Web Cluster Capacity Estimation and Its Application to Energy Conservation , 2007, IEEE Transactions on Parallel and Distributed Systems.

[14]  Kiyoshi Kobayashi,et al.  Minimizing False Positives of a Decision Tree Classifier for Intrusion Detection on the Internet , 2008, Journal of Network and Systems Management.

[15]  Stephen R. Marsland,et al.  Machine Learning - An Algorithmic Perspective , 2009, Chapman and Hall / CRC machine learning and pattern recognition series.

[16]  Non-members,et al.  WWW Server Load Balancing Technique Employing Passive Measurement of Server Performance , 2009 .

[17]  Satoru Ohta,et al.  Machine Learning Approach to the Power Management of Server Clusters , 2011, 2011 IEEE 11th International Conference on Computer and Information Technology.

[18]  Jing Xu,et al.  A multi-objective approach to virtual machine management in datacenters , 2011, ICAC '11.

[19]  Shan Zhu,et al.  Real-Time Flow Counting in IP Networks : Strict Analysis and Design Issues , 2012 .

[20]  Satoru Ohta,et al.  Performance Degradation Detection of Virtual Machines Via Passive Measurement and Machine Learning , 2014, Int. J. Adapt. Resilient Auton. Syst..

[21]  Satoru Ohta,et al.  Power Management of Server Clusters via Machine Learning and Passive Traffic Measurement , .