Software probes: towards a quick method for machine characterization and application performance prediction

Computers perform different applications in different ways. To characterize an application performance into a machine, the usual method is a throughout execution of it. This work is a step into a synthetic probe able to characterize a master-worker application's performance in a fraction of the time required to run it entirely. This is specially important for CPU-intensive scientific applications, who runs for very long, as it makes sense that it runs as efficiently (and fast) as possible. To know how, and for how long a master-worker application is going to run can guide the decision to use this machine or not. Our software probe takes into account only the performance-relevant parts of the application, discovering a program's relevant phases. Running solely these significant phases is a powerful way to quickly characterize the application's performance on a machine. It can help to select the best computing nodes in a grid or in a multi-cluster to run this application, and even quickly predict the total execution time for this application/data set in the machine analyzed. We also present ongoing work on a fully synthetic probe generated from programs' phases.

[1]  Rajiv Kapoor,et al.  Pinpointing Representative Portions of Large Intel® Itanium® Programs with Dynamic Instrumentation , 2004, 37th International Symposium on Microarchitecture (MICRO-37'04).

[2]  J. Duell The design and implementation of Berkeley Lab's linux checkpoint/restart , 2005 .

[3]  Erich Strohmaier,et al.  Architecture independent performance characterization and benchmarking for scientific applications , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[4]  Paula Cecilia Fritzsche Podemos predecir en algoritmos paralelos no deterministas , 2007 .

[5]  E. M. O. Junior,et al.  Performance prediction and tuning in a multi-cluster environment , 2006 .

[6]  Brad Calder,et al.  How to use SimPoint to pick simulation points , 2004, PERV.

[7]  Xiaofeng Gao,et al.  A Performance Prediction Framework for Scientific Applications , 2003, International Conference on Computational Science.

[8]  Basilio B. Fraguela,et al.  Precise automatable analytical modeling of the cache behavior of codes with indirections , 2007, TACO.

[9]  Erich Strohmaier,et al.  Apex-Map: A Synthetic Scalable Benchmark Probe to Explore Data Access Performance on Highly Parallel Systems , 2005, Euro-Par.

[10]  Jesús Labarta,et al.  A Framework for Performance Modeling and Prediction , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[11]  Jason Duell,et al.  The design and implementation of Berkeley Lab's linuxcheckpoint/restart , 2005 .

[12]  Erich Strohmaier,et al.  APEX‐Map: a parameterized scalable memory access probe for high‐performance computing systems , 2007, Concurr. Comput. Pract. Exp..

[13]  Basilio B. Fraguela,et al.  Analytical modeling of codes with arbitrary data-dependent conditional structures , 2006, J. Syst. Archit..

[14]  Robert Kroeger,et al.  A case study in top-down performance estimation for a large-scale parallel application , 2006, PPoPP '06.

[15]  James E. Smith,et al.  Comparing program phase detection techniques , 2003, Proceedings. 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003. MICRO-36..

[16]  Brad Calder,et al.  Basic block distribution analysis to find periodic behavior and simulation points in applications , 2001, Proceedings 2001 International Conference on Parallel Architectures and Compilation Techniques.

[17]  Juan Touriño,et al.  Automated and accurate cache behavior analysis for codes with irregular access patterns , 2007, Concurr. Comput. Pract. Exp..

[18]  Brad Calder,et al.  Automatically characterizing large scale program behavior , 2002, ASPLOS X.

[19]  Brad Calder,et al.  The Strong correlation Between Code Signatures and Performance , 2005, IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005..

[20]  Adolfy Hoisie,et al.  Scalability analysis of multidimensional wavefront algorithms on large-scale SMP clusters , 1999, Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation.

[21]  Brad Calder,et al.  Discovering and Exploiting Program Phases , 2003, IEEE Micro.

[22]  Erich Strohmaier,et al.  APEX-Map: a parameterized scalable memory access probe for high-performance computing systems: Research Articles , 2007 .