论文信息 - Stable duplicate-key extraction with optimal time and space bounds

Stable duplicate-key extraction with optimal time and space bounds

SummaryWe consider the problem of transforming a list L of records sorted on some key into two sublists L1 and L2 where, for each distinct key in L, L1 contains the first record of L that possesses the key and L2 contains all records of L with duplicate keys. We desire that our duplicate-key extraction algorithm perform the transformation in place and be stable (that is, records within each sublist must obey the original order given by L). This operation is useful in database and related file processing environments whenever only distinct keys need be considered. Moreover, stability in extraction insures that L can be efficiently restored at a later time with a stable merge of L1 and L2. Any procedure for performing duplicate-key extraction on a list of size n must require at least O(n) time and O(1) extra space, although the obvious algorithm for achieving either bound alone violates the other bound. We design a stable algorithm, using block-rearrangement techniques, and show that it is optimal in the theoretical sense that it achieves both lower bounds simultaneously. We also prove that its worst-case number of key comparisons and record exchanges sum to no more than 6 n, suggesting that the algorithm has practical application as well.

Michael A. Langston | Bing-Chao Huang

[1] Jeffrey S. Salowe,et al. Simplified Stable Merging Tasks , 1987, J. Algorithms.

[2] Donald Ervin Knuth,et al. The Art of Computer Programming , 1968 .

[3] Michael A. Langston,et al. Fast Stable Merging and Sorting in Constant Extra Space , 1992, Comput. J..

[4] Edward C. Horvath,et al. Stable Sorting in Asymptotically Optimal Time and Extra Space , 1978, JACM.

[5] Donald E. Knuth,et al. The Art of Computer Programming, Vol. 3: Sorting and Searching , 1974 .

[6] Heikki Mannila,et al. A Simple Linear-Time Algorithm for in Situ Merging , 1984, Inf. Process. Lett..

[7] Jeffrey S. Salowe,et al. Stable Unmerging in Linear Time and Constant Space , 1987, Inf. Process. Lett..

[8] Michael A. Langston,et al. Practical in-place merging , 1987, CACM.

[9] Michael A. Langston,et al. Stable Set and Multiset Operations in Optimal Time and Space , 1991, Inf. Process. Lett..

[10] Luis I. Trabb-Pardo. Stable sorting and merging with optimal space and time bounds. , 1974 .