Applying source level auto-vectorization to Aparapi Java
暂无分享,去创建一个
[1] Jack J. Purdum,et al. C programming guide , 1983 .
[2] Tianyi David Han,et al. Reducing branch divergence in GPU programs , 2011, GPGPU-4.
[3] Ahmed El-Mahdy,et al. Automatic Vectorization Using Dynamic Compilation and Tree Pattern Matching Technique in Jikes RVM , 2009 .
[4] Allen,et al. Optimizing Compilers for Modern Architectures , 2004 .
[5] Philip C. Pratt-Szeliga,et al. Rootbeer: Seamlessly Using GPUs from Java , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.
[6] Vivek Sarkar,et al. Accelerating Habanero-Java programs with OpenCL generation , 2013, PPPJ.
[7] Albert Cohen,et al. Vapor SIMD: Auto-vectorize once, run everywhere , 2011, International Symposium on Code Generation and Optimization (CGO 2011).
[8] Pat Hanrahan,et al. Data Parallel Computation on Graphics Hardware , 2003 .
[9] Aaftab Munshi,et al. The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).
[10] Tia Newhall,et al. Chestnut: a GPU programming language for non-experts , 2012, PMAM '12.
[11] Rainer Plaga. The GRAAL project , 1999 .
[12] William E. Byrd,et al. Declarative Parallel Programming for GPUs , 2011, PARCO.
[13] Erik R. Altman,et al. The Liquid Metal Blokus Duo Design , 2013, 2013 International Conference on Field-Programmable Technology (FPT).
[14] Klemons Im,et al. [Transcendental meditation]. , 1975, Ugeskrift for laeger.
[15] J. Mark Bull,et al. Benchmarking Java against C and Fortran for scientific applications , 2001, JGI '01.
[16] J. Xu. OpenCL – The Open Standard for Parallel Programming of Heterogeneous Systems , 2009 .
[17] Ken Kennedy,et al. Optimizing Compilers for Modern Architectures: A Dependence-based Approach , 2001 .
[18] Xiao-Feng Li,et al. Vectorization for Java , 2010, NPC.
[19] G. P. Nikishkov,et al. Comparison of C and Java performance in finite element computations , 2003 .