BHive: A Benchmark Suite and Measurement Framework for Validating x86-64 Basic Block Performance Models
暂无分享,去创建一个
Eric Atkinson | Michael Carbin | Ondřej Sýkora | Ajay Brahmakshatriya | Saman Amarasinghe | Saman P. Amarasinghe | Charith Mendis | Alex Renda | Yishen Chen | Charith Mendis | Michael Carbin | O. Sýkora | Alex Renda | Yishen Chen | Ajay Brahmakshatriya | Eric Hamilton Atkinson
[1] M. Kendall. A NEW MEASURE OF RANK CORRELATION , 1938 .
[2] John Whaley,et al. A portable sampling-based profiler for Java virtual machines , 2000, JAVA '00.
[3] Derek Bruening,et al. An infrastructure for adaptive dynamic optimization , 2003, International Symposium on Code Generation and Optimization, 2003. CGO 2003..
[4] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[5] Vikram S. Adve,et al. LLVM: a compilation framework for lifelong program analysis & transformation , 2004, International Symposium on Code Generation and Optimization, 2004. CGO 2004..
[6] Andrey Gubarev,et al. Dremel : Interactive Analysis of Web-Scale Datasets , 2011 .
[7] Christopher Frost,et al. Spanner: Google's Globally-Distributed Database , 2012, OSDI.
[8] M. Pharr,et al. ispc: A SPMD compiler for high-performance CPU programming , 2012, 2012 Innovative Parallel Computing (InPar).
[9] Ingo Wald,et al. Embree: a kernel framework for efficient CPU ray tracing , 2014, ACM Trans. Graph..
[10] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[11] Gerhard Wellein,et al. Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures , 2018, 2018 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS).
[12] Saman P. Amarasinghe,et al. goSLP: globally optimized superword level parallelism framework , 2018, Proc. ACM Program. Lang..
[13] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.
[14] Jan Reineke,et al. uops.info: Characterizing Latency, Throughput, and Port Usage of Instructions on Intel Microarchitectures , 2018, ASPLOS.
[15] Ben Juurlink,et al. Portable Cost Modeling for Auto-Vectorizers , 2019, 2019 IEEE 27th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS).
[16] Michael Carbin,et al. Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks , 2018, ICML.