An entropy-based algorithm for data elimination in time-driven software instrumentation

While monitoring, instrumented long running parallel applications generate huge amount of instrumentation data. Processing and storing this data incurs overhead, and perturbs the execution. A technique that eliminates unnecessary instrumentation data and lowers the intrusion without loosing any performance information is valuable for tool developers. This paper presents a new algorithm for software instrumentation to measure the amount of information content of instrumentation data to be collected. The algorithm is based on entropy concept introduced in information theory, and it makes selective data collection for a time-driven software monitoring system possible.

[1]  D. S. Jones,et al.  Elementary information theory , 1979 .

[2]  Richard T. Snodgrass,et al.  A relational approach to monitoring complex systems , 1988, TOCS.

[3]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[4]  Geoffrey C. Fox,et al.  Matrix algorithms on a hypercube I: Matrix multiplication , 1987, Parallel Comput..

[5]  James R. Larus,et al.  Abstract execution: A technique for efficiently tracing programs , 1990, Softw. Pract. Exp..

[6]  Bernd Mohr,et al.  Distributed Performance Monitoring: Methods, Tools, and Applications , 1994, IEEE Trans. Parallel Distributed Syst..

[7]  Jae S. Lim,et al.  Two-Dimensional Signal and Image Processing , 1989 .

[8]  Barton P. Miller,et al.  IPS-2: The Second Generation of a Parallel Program Measurement System , 1990, IEEE Trans. Parallel Distributed Syst..

[9]  Laxmikant V. Kalé,et al.  Performance and modularity benefits of message-driven execution , 2004, J. Parallel Distributed Comput..

[10]  Martin Cuma,et al.  A general framework to understand parallel performance in heterogeneous clusters: analysis of a new adaptive parallel genetic algorithm , 2005, J. Parallel Distributed Comput..

[11]  D.A. Reed,et al.  An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[12]  J. C. Yan,et al.  Performance tuning with AIMS/spl minus/an Automated Instrumentation and Monitoring System for multicomputers , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[13]  James R. Larus,et al.  Rewriting executable files to measure program behavior , 1994, Softw. Pract. Exp..