Compiling and Optimizing Java 8 Programs for GPU Execution
暂无分享,去创建一个
Vivek Sarkar | Akihiro Hayashi | Kazuaki Ishizaki | Gita Koblents | Vivek Sarkar | Akihiro Hayashi | Gita Koblents | Kazuaki Ishizaki
[1] James F. Power,et al. Platform independent dynamic Java virtual machine analysis: the Java Grande Forum benchmark suite , 2001, JGI '01.
[2] Vivek Sarkar,et al. HJ-OpenCL: Reducing the Gap Between the JVM and Accelerators , 2015, PPPJ.
[3] Dirk Grunwald,et al. Reducing indirect function call overhead in C++ programs , 1994, POPL '94.
[4] Vivek Sarkar,et al. Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs , 2013, LCPC.
[5] Michel Steuwer,et al. A Composable Array Function Interface for Heterogeneous Computing in Java , 2014, ARRAY@PLDI.
[7] Guy L. Steele,et al. The Java Language Specification , 1996 .
[8] Francky Catthoor,et al. Polyhedral parallel code generation for CUDA , 2013, TACO.
[9] David F. Bacon,et al. Compiling a high-level language for GPUs: (via language support for architectures and compilers) , 2012, PLDI.
[10] Vivek Sarkar,et al. Accelerating Habanero-Java programs with OpenCL generation , 2013, PPPJ.
[11] R. Govindarajan,et al. Fast and efficient automatic memory management for GPUs using compiler-assisted runtime coherence scheme , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).
[12] Feng Liu,et al. Dynamically managed data for CPU-GPU architectures , 2012, CGO '12.
[13] Vivek Sarkar,et al. Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection , 2015, PPPJ.
[14] Samuel P. Midkiff,et al. Automatic loop transformations and parallelization for Java , 2000, ICS '00.
[15] Vivek Sarkar,et al. HadoopCL: MapReduce on Distributed Heterogeneous Platforms through Seamless Integration of Hadoop and OpenCL , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.
[16] Nikola Grcevski,et al. Java Just-in-Time Compiler and Virtual Machine Improvements for Server and Middleware Applications , 2004, Virtual Machine Research and Technology Symposium.
[17] Wojciech Zaremba,et al. JaBEE: framework for object-oriented Java bytecode compilation and execution on graphics processor units , 2012, GPGPU-5.
[18] Uday Bondhugula,et al. A compiler framework for optimization of affine loop nests for gpgpus , 2008, ICS '08.
[19] Ondrej Lhoták,et al. Automatic parallelization for graphics processing units , 2009, PPPJ '09.
[20] Toshiaki Yasue,et al. A study of devirtualization techniques for a Java Just-In-Time compiler , 2000, OOPSLA '00.
[21] David I. August,et al. Automatic CPU-GPU communication management and optimization , 2011, PLDI '11.
[22] Guy L. Steele,et al. The Java Language Specification, Java SE 8 Edition , 2013 .
[23] Philip C. Pratt-Szeliga,et al. Rootbeer: Seamlessly Using GPUs from Java , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.
[24] Vivek Sarkar,et al. JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA , 2009, Euro-Par.
[25] Vivek Sarkar,et al. Array SSA form and its use in parallelization , 1998, POPL '98.