A Proposed Data Partitioning Approach on Heterogeneous HPC Platforms: Data Locality Perspective
暂无分享,去创建一个
[1] Viktor K. Prasanna,et al. Block‐cyclic redistribution over heterogeneous networks , 2004, Cluster Computing.
[2] Torsten Hoefler,et al. An Overview of Topology Mapping Algorithms and Techniques in High‐Performance Computing , 2014, HiPC 2014.
[3] Ziming Zhong,et al. Data Partitioning on Heterogeneous Multicore and Multi-GPU Systems Using Functional Performance Models of Data-Parallel Applications , 2012, 2012 IEEE International Conference on Cluster Computing.
[4] Alexey L. Lastovetsky,et al. Column-Based Matrix Partitioning for Parallel Matrix Multiplication on Heterogeneous Processors Based on Functional Performance Models , 2011, Euro-Par Workshops.
[5] Alexey L. Lastovetsky,et al. Data distribution for dense factorization on computers with memory heterogeneity , 2007, Parallel Comput..
[6] Brett A. Becker,et al. Partitioning for Parallel Matrix-Matrix Multiplication with Heterogeneous Processors: The Optimal Solution , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.
[7] Satoshi Matsuoka,et al. An efficient, model-based CPU-GPU heterogeneous FFT library , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.
[8] Alexey L. Lastovetsky,et al. Data Partitioning with a Functional Performance Model of Heterogeneous Processors , 2007, Int. J. High Perform. Comput. Appl..
[9] Onur Mutlu,et al. The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality In GPUs , 2018, 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA).
[10] Yves Robert,et al. Matrix-matrix multiplication on heterogeneous platforms , 2000, Proceedings 2000 International Conference on Parallel Processing.
[11] Kai Lu,et al. Adaptive Optimization for Petascale Heterogeneous CPU/GPU Computing , 2010, 2010 IEEE International Conference on Cluster Computing.
[12] Alexey L. Lastovetsky,et al. Two-Dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on Their Functional Performance Models , 2009, Euro-Par Workshops.
[13] Siham Tabik,et al. A Data Partitioning Model for Highly Heterogeneous Systems , 2016, Euro-Par Workshops.
[14] Jesús Labarta,et al. Performance Modeling of HPC Applications , 2003, PARCO.
[15] John Shalf,et al. Programming Abstractions for Data Locality , 2014 .
[16] J. J. Collins,et al. An empirical study of data decomposition for software parallelization , 2017, J. Syst. Softw..
[17] Alexey Lastovetsky,et al. A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms , 2018, IEEE Transactions on Parallel and Distributed Systems.
[18] Peng Zhang,et al. A Survey of Homogeneous and Heterogeneous System Architectures in High Performance Computing , 2016, 2016 IEEE International Conference on Smart Cloud (SmartCloud).
[19] Mohammed J. Zaki,et al. Compile-Time Scheduling Algorithms for a Heterogeneous Network of Workstations , 1997, Comput. J..
[20] John Shalf,et al. Trends in Data Locality Abstractions for HPC Systems , 2017, IEEE Transactions on Parallel and Distributed Systems.
[21] Alexey L. Lastovetsky,et al. New Model-Based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters , 2017, IEEE Transactions on Parallel and Distributed Systems.
[22] Alexey L. Lastovetsky,et al. Heterogeneous Distribution of Computations Solving Linear Algebra Problems on Networks of Heterogeneous Computers , 2001, J. Parallel Distributed Comput..
[23] John Shalf,et al. Overlapping Data Transfers with Computation on GPU with Tiles , 2017, 2017 46th International Conference on Parallel Processing (ICPP).
[24] Brett A. Becker. High-Level Data Partitioning for Parallel Computing on Heterogeneous Hierarchical Computational Plat , 2010 .
[25] Alexey L. Lastovetsky,et al. Model-Based Optimization of EULAG Kernel on Intel Xeon Phi Through Load Imbalancing , 2017, IEEE Transactions on Parallel and Distributed Systems.
[26] Alexey Lastovetsky,et al. A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes , 2020, IEEE Access.
[27] Alexey L. Lastovetsky,et al. Data partitioning with a realistic performance model of networks of heterogeneous computers with task size limits , 2004, Third International Symposium on Parallel and Distributed Computing/Third International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks.