PEMOGEN: Automatic adaptive performance modeling during program runtime
暂无分享,去创建一个
[1] Jean-Jacques Fuchs,et al. On sparse representations in arbitrary redundant bases , 2004, IEEE Transactions on Information Theory.
[2] David L Donoho,et al. Compressed sensing , 2006, IEEE Transactions on Information Theory.
[3] Michael Stumm,et al. Online performance analysis by statistical sampling of microprocessor performance counters , 2005, ICS '05.
[4] Allen D. Malony,et al. The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..
[5] Michael Laurenzano,et al. PSINS: An Open Source Event Tracer and Execution Simulator , 2009, 2009 DoD High Performance Computing Modernization Program Users Group Conference.
[6] Torsten Hoefler. Bridging Performance Analysis Tools and Analytic Performance Modeling for HPC , 2010, Euro-Par Workshops.
[7] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[8] Torsten Hoefler,et al. Performance modeling for systematic performance tuning , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
[9] References , 1971 .
[10] Jeffrey S. Vetter,et al. Scalable Analysis Techniques for Microprocessor Performance Counter Metrics , 2002, ACM/IEEE SC 2002 Conference (SC'02).
[11] Juan Gonzalez,et al. On-line detection of large-scale parallel application's structure , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).
[12] Torsten Hoefler,et al. Using automated performance modeling to find scalability bugs in complex codes , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
[13] S. R. Searle. Linear Models , 1971 .
[14] David H. Bailey,et al. The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..
[15] Alex Ramírez,et al. On the memory system requirements of future scientific applications: Four case-studies , 2011, 2011 IEEE International Symposium on Workload Characterization (IISWC).
[16] A. Lumsdaine,et al. LogGOPSim: simulating large-scale applications in the LogGOPS model , 2010, HPDC '10.
[17] Allen D. Malony,et al. Capturing performance knowledge for automated analysis , 2008, HiPC 2008.
[18] Vikram S. Adve,et al. LLVM: a compilation framework for lifelong program analysis & transformation , 2004, International Symposium on Code Generation and Optimization, 2004. CGO 2004..
[19] Y. Dodge. on Statistical data analysis based on the L1-norm and related methods , 1987 .
[20] Wenguang Chen,et al. PHANTOM: predicting performance of parallel applications on large-scale parallel machines using a single node , 2010, PPoPP '10.
[21] M. R. Osborne. An effective method for computing regression quantiles , 1992 .
[22] Sally A. McKee,et al. An Approach to Performance Prediction for Parallel Applications , 2005, Euro-Par.
[23] D. Sengupta. Linear models , 2003 .
[24] A. Ng. Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.
[25] Sally A. McKee,et al. Methods of inference and learning for performance modeling of parallel applications , 2007, PPoPP.
[26] P. Tseng,et al. Statistical Data Analysis Based on the L1-Norm and Related Methods , 2002 .
[27] Wenguang Chen,et al. OpenUH: an optimizing, portable OpenMP compiler , 2007, Concurr. Comput. Pract. Exp..
[28] Lars Koesterke,et al. PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
[29] Torsten Hoefler,et al. Characterizing the Influence of System Noise on Large-Scale Applications by Simulation , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
[30] Frank Mueller,et al. ScalaExtrap: trace-based communication extrapolation for spmd programs , 2011, PPoPP '11.
[31] Juan Gonzalez,et al. Automatic detection of parallel applications computation phases , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[32] Laxmikant V. Kalé,et al. Simulating Large Scale Parallel Applications Using Statistical Models for Sequential Execution Blocks , 2010, 2010 IEEE 16th International Conference on Parallel and Distributed Systems.
[33] Laurent El Ghaoui,et al. An Homotopy Algorithm for the Lasso with Online Observations , 2008, NIPS.
[34] Alfred V. Aho,et al. The Design and Analysis of Computer Algorithms , 1974 .
[35] Torsten Hoefler,et al. Performance Modeling and Comparative Analysis of the MILC Lattice QCD Application su3_rmd , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).
[36] Isabelle Guyon,et al. An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..
[37] Dmitry M. Malioutov,et al. Homotopy continuation for sparse signal representation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[38] Huan Liu. Feature Selection , 2010, Encyclopedia of Machine Learning.
[39] A. E. Hoerl,et al. Ridge regression: biased estimation for nonorthogonal problems , 2000 .
[40] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.
[41] Bernd Mohr,et al. Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications , 2008, Parallel Tools Workshop.
[42] Nathan R. Tallent,et al. HPCToolkit: performance tools for scientific computing , 2008 .
[43] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.