论文信息 - Scalable and Efficient Parallel Selection

Scalable and Efficient Parallel Selection

Selection algorithms find the \(k^{\mathrm {th}}\) smallest element from a set of elements. Although there are optimal parallel selection algorithms available for theoretical machines, these algorithms are not only difficult to implement but also inefficient in practice. Consequently, scalable applications can only use few special cases such as minimum and maximum, where efficient implementations exist. To overcome such limitations, we propose a general parallel selection algorithm that scales even on today’s largest supercomputers. Our approach is based on an efficient, unbiased median approximation method, recently introduced as median-of-3 reduction, and Hoare’s sequential QuickSelect idea from \(1961\). The resulting algorithm scales with a time complexity of \(\mathcal {O}(\log ^2 n)\) for \(n\) distributed elements while needing only \(\mathcal {O}(1)\) space. Furthermore, we prove it to be a practical solution by explaining implementation details and showing performance results for up to \(458,752\) processor cores.

Christian Siebert

[1] Mahmoud Fouz,et al. On Smoothed Analysis of Quicksort and Hoare’s Find , 2011, Algorithmica.

[2] Manuel Blum,et al. Time Bounds for Selection , 1973, J. Comput. Syst. Sci..

[3] H. Prodinger,et al. Analysis of Hoare's FIND algorithm with median-of-three partition , 1997 .

[4] C. A. R. Hoare,et al. Algorithm 64: Quicksort , 1961, Commun. ACM.

[5] W. Donald Frazer,et al. Samplesort: A Sampling Approach to Minimal Storage Tree Sorting , 1970, JACM.

[6] Jesper Larsson Träff,et al. Parallel Prefix (Scan) Algorithms for MPI , 2006, PVM/MPI.

[7] Yijie Han. Optimal parallel selection , 2003, SODA '03.

[8] William Gropp,et al. A Scalable MPI_Comm_split Algorithm for Exascale Computing , 2010, EuroMPI.

[9] Rolf Rabenseifner,et al. Optimization of Collective Reduction Operations , 2004, International Conference on Computational Science.

[10] Felix Wolf,et al. Parallel Sorting with Minimal Data , 2011, EuroMPI.

[11] C. A. R. Hoare. Algorithm 63: partition , 1961, CACM.