Hardware sorters exploit inherent concurrency to improve the performance of sequential, softwarebased sorting algorithms. They are often based on Batcher’s odd-even or bitonic merging networks to attenuate the areagreedy hardware solutions. In this paper, a new hardware sorter architecture is presented. It is composed of smaller sorter circuits inspired on insertion sorting algorithms that contain as many data-slice cells as data to sort. Such sorters are easily scalable and require minimal control schemes. We outline the properties of the sorter modules in order to introduce some architectures that overcome their limitations. Such architectures are based on combining several of such sorters in parallel to speed up the computing of the final result. Different area and speed requirements lead to a variety of architectural considerations that produce optimal designs for each case. An application of such sorters designs is presented and implemented on FPGAs. The synthesis results are compared with other parallel sorting architectures.
[1]
Gilles Brassard,et al.
Fundamentals of Algorithmics
,
1995
.
[2]
Rajgopal Kannan.
A Pipelined Single-Bit Controlled Sorting Network with O ( N log 2 N ).
,
1997,
INFOCOM 1997.
[3]
Kenneth E. Batcher,et al.
Minimizing communication of a recirculating bitonic sorting network
,
1996,
Proceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing.
[4]
Rajgopal Kannan.
A pipelined single-bit controlled sorting network with O(Nlog/sup 2/N) bit complexity
,
1997,
Proceedings of INFOCOM '97.
[5]
David J. DeWitt,et al.
A taxonomy of parallel sorting
,
1984,
CSUR.
[6]
Mary Sheeran,et al.
The Design and Verification of a Sorter Core
,
2001,
CHARME.
[7]
Kenneth E. Batcher,et al.
Minimizing Communication in the Bitonic Sort
,
2000,
IEEE Trans. Parallel Distributed Syst..