暂无分享,去创建一个
[1] Matteo Frigo,et al. Reducers and other Cilk++ hyperobjects , 2009, SPAA '09.
[2] Vikram S. Adve,et al. LLVM: a compilation framework for lifelong program analysis & transformation , 2004, International Symposium on Code Generation and Optimization, 2004. CGO 2004..
[3] Paolo D'Alberto,et al. Multiple-Campaign Ad-Targeting Deployment: Parallel Response Modeling, Calibration and Scoring Without Personal User Information , 2015 .
[4] Haichen Shen,et al. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning , 2018 .
[5] João M. P. Cardoso,et al. Optimizing OpenCL Code for Performance on FPGA: k-Means Case Study With Integer Data Sets , 2020, IEEE Access.
[6] Cedric Nugteren,et al. CLBlast: A Tuned OpenCL BLAS Library , 2017, IWOCL.
[7] She Muses. Spiral , 2021, Encyclopedic Dictionary of Archaeology.
[8] Steven G. Johnson,et al. FFTW: an adaptive software architecture for the FFT , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[9] Ramachandra Achar,et al. A Comparative Study of MAGMA and cuBLAS Libraries for GPU based Vector Fitting , 2020, 2020 IEEE 11th Latin American Symposium on Circuits & Systems (LASCAS).
[10] P. D'Alberto,et al. xDNN: Inference for Deep Convolutional Neural Networks , 2022, ACM Trans. Reconfigurable Technol. Syst..
[11] Thomas Fahringer,et al. SYCL-Bench: A Versatile Cross-Platform Benchmark Suite for Heterogeneous Computing , 2020, Euro-Par.
[12] Milind Girkar,et al. On the exploitation of loop-level parallelism in embedded applications , 2009, TECS.
[13] Nazeeruddin Mohammad,et al. A review of CUDA optimization techniques and tools for structured grid computing , 2019, Computing.