A Fast and Generic GPU-Based Parallel Reduction Implementation | IEEE Conference Publication | IEEE Xplore