A Framework for Enabling OpenMP Autotuning
暂无分享,去创建一个
Prasanna Balaprakash | Bronis R. de Supinski | Thomas Scogland | Mary W. Hall | Vinu Sreenivasan | Rajath Javali | B. Supinski | Prasanna Balaprakash | T. Scogland | Vinu Sreenivasan | Rajath Javali
[1] Maciej Cytowski,et al. Towards Autotuning of OpenMP Applications on Multicore Architectures , 2014, ArXiv.
[2] Jack J. Dongarra,et al. Automatically Tuned Linear Algebra Software , 1998, Proceedings of the IEEE/ACM SC98 Conference.
[3] Chun Chen,et al. Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy , 2005, International Symposium on Code Generation and Optimization.
[4] Samuel Williams,et al. Auto-tuning performance on multicore computers , 2008 .
[5] Richard W. Vuduc,et al. Effective Source-to-Source Outlining to Support Whole Program Empirical Optimization , 2009, LCPC.
[6] Rudolf Eigenmann,et al. Performance Analysis and Tuning of Automatically Parallelized OpenMP Applications , 2011, IWOMP.
[7] Luca Benini,et al. Autotuning and adaptivity in energy efficient HPC systems: the ANTAREX toolbox , 2018, CF.
[8] Prasanna Balaprakash,et al. Generating Efficient Tensor Contractions for GPUs , 2015, 2015 44th International Conference on Parallel Processing.
[9] Prasanna Balaprakash,et al. Autotuning in High-Performance Computing Applications , 2018, Proceedings of the IEEE.