Fast Four-Way Parallel Radix Sorting on GPUs

No Thumbnail Available
Date
2009
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association and Blackwell Publishing Ltd
Abstract
Efficient sorting is a key requirement for many computer science algorithms. Acceleration of existing techniques as well as developing new sorting approaches is crucial for many real-time graphics scenarios, database systems, and numerical simulations to name just a few. It is one of the most fundamental operations to organize and filter the ever growing massive amounts of data gathered on a daily basis. While optimal sorting models for serial execution on a single processor exist, efficient parallel sorting remains a challenge. In this paper, we present a hardware-optimized parallel implementation of the radix sort algorithm that results in a significant speed up over existing sorting implementations. We outperform all known General Processing Unit (GPU) based sorting systems by about a factor of two and eliminate restrictions on the sorting key space. This makes our algorithm not only the fastest, but also the first general GPU sorting solution.
Description

        
@article{
10.1111:j.1467-8659.2009.01542.x
, journal = {Computer Graphics Forum}, title = {{
Fast Four-Way Parallel Radix Sorting on GPUs
}}, author = {
Ha, Linh
and
Krueger, Jens
and
Silva, Claudio T.
}, year = {
2009
}, publisher = {
The Eurographics Association and Blackwell Publishing Ltd
}, ISSN = {
1467-8659
}, DOI = {
10.1111/j.1467-8659.2009.01542.x
} }
Citation
Collections