Improving SIMD Efficiency for Parallel Monte Carlo Light Transport on the GPU
Abstract
Monte Carlo Light Transport algorithms such as Path Tracing (PT), Bi-Directional Path Tracing (BDPT) and Metropolis Light Transport (MLT) make use of random walks to sample light transport paths. When parallelizing these algorithms on the GPU the stochastic termination of random walks results in an uneven workload between samples, which reduces SIMD efficiency. In this paper we propose to combine stream compaction and sample regeneration to keep SIMD efficiency high during random walk construction, in spite of stochastic termination. Furthermore, for BDPT and MLT, we propose to evaluate all bidirectional connections of a sample in parallel in order to balance the workload between GPU threads and improve SIMD efficiency during sample evaluation. We present efficient parallel GPU-only implementations for PT, BDPT, and MLT in CUDA.We show that our GPU implementations outperform similarCPU implementations by an order of magnitude.
Description
@inproceedings{10.1145:2018323.2018330,
booktitle = {Eurographics/ ACM SIGGRAPH Symposium on High Performance Graphics},
editor = {Carsten Dachsbacher and William Mark and Jacopo Pantaleoni},
title = {{Improving SIMD Efficiency for Parallel Monte Carlo Light Transport on the GPU}},
author = {Antwerpen, Dietger van},
year = {2011},
publisher = {ACM},
ISSN = {2079-8687},
ISBN = {978-1-4503-0896-0},
DOI = {10.1145/2018323.2018330}
}