Only Buffer When You Need To: Reducing On-chip GPU Traffic with Reconfigurable Local Atomic Buffers | IEEE Conference Publication | IEEE Xplore