CUB is a library of high-performance parallel primitives and other utilities for building CUDA software. CUB targets all layers of the CUDA programming model, greatly simplifying the development and maintenance of complex device-level, block-level, warp-level, and thread-level operations.
CUB is developed as an open-source project by
NVIDIA Research. Our
primary release v1.3.2 is now available!



Cheers,
Duane