Efficient implementation of sorting on multi-core SIMD CPU architecture
暂无分享,去创建一个
Pradeep Dubey | Victor W. Lee | Sanjeev Kumar | Yen-Kuang Chen | Anthony D. Nguyen | William Macy | Jatin Chhugani | Mostafa Hagog | Akram Baransi | P. Dubey | Sanjeev Kumar | J. Chhugani | A. Nguyen | V. Lee | Yen-kuang Chen | William Macy | Mostafa Hagog | A. Baransi
[1] Donald E. Knuth,et al. The art of computer programming, volume 3: (2nd ed.) sorting and searching , 1998 .
[2] Yao Zhang,et al. Scan primitives for GPU computing , 2007, GH '07.
[3] Kenneth E. Batcher,et al. Sorting networks and their applications , 1968, AFIPS Spring Joint Computing Conference.
[4] Alexandru Nicolau,et al. Adaptive Bitonic Sorting: An Optimal Parallel Algorithm for Shared-Memory Machines , 1989, SIAM J. Comput..
[5] Yi Zhang,et al. A simple, fast parallel implementation of Quicksort and its performance evaluation on SUN Enterprise 10000 , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..
[6] Rhys S. Francis,et al. A Benchmark Parallel Sort for Shared Memory Multiprocessors , 1988, IEEE Trans. Computers.
[7] Saurabh Dighe,et al. An 80-Tile 1.28TFLOPS Network-on-Chip in 65nm CMOS , 2007, 2007 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.
[8] Toshio Nakatani,et al. AA-Sort: A New Parallel Sorting Algorithm for Multi-Core SIMD Processors , 2007, 16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007).
[9] Pat Hanrahan,et al. Photon mapping on programmable graphics hardware , 2003, HWWS '03.
[10] Dinesh Manocha,et al. GPUTeraSort: high performance graphics co-processor sorting for large database management , 2006, SIGMOD Conference.
[11] Gabriel Zachmann,et al. GPU-ABiSort: optimal parallel sorting on stream architectures , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.
[12] Philip S. Yu,et al. CellSort: High Performance Sorting on the Cell Processor , 2007, VLDB.
[13] M. Chial,et al. in simple , 2003 .
[14] Y. Mukaigawa,et al. Large Deviations Estimates for Some Non-local Equations I. Fast Decaying Kernels and Explicit Bounds , 2022 .
[15] Rhys Francis,et al. A Fast, Simple Algorithm to Balance a Parallel Multiway Merge , 1993, PARLE.
[16] Donald Ervin Knuth,et al. The Art of Computer Programming , 1968 .
[17] Pradeep Dubey,et al. Larrabee: A Many-Core x86 Architecture for Visual Computing , 2009, IEEE Micro.
[18] Shing-Tsaan Huang,et al. K-Way Bitonic Sort , 1989, IEEE Trans. Computers.
[19] Ulf Assarsson,et al. Fast parallel GPU-sorting using a hybrid algorithm , 2008, J. Parallel Distributed Comput..
[20] Michael Garland,et al. Designing efficient sorting algorithms for manycore GPUs , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[21] Donald E. Knuth,et al. The art of computer programming: sorting and searching (volume 3) , 1973 .
[22] Richard Box,et al. A fast, easy sort , 1991 .
[23] Michael Gschwind. Chip multiprocessing and the cell broadband engine , 2006, CF '06.