Loading [MathJax]/extensions/MathMenu.js
Performance evaluation of Intel® Transactional Synchronization Extensions for high-performance computing | IEEE Conference Publication | IEEE Xplore

Performance evaluation of Intel® Transactional Synchronization Extensions for high-performance computing


Abstract:

Intel has recently introduced Intel® Transactional Synchronization Extensions (Intel® TSX) in the Intel 4th Generation Core™ Processors. With Intel TSX, a processor can d...Show More

Abstract:

Intel has recently introduced Intel® Transactional Synchronization Extensions (Intel® TSX) in the Intel 4th Generation Core™ Processors. With Intel TSX, a processor can dynamically determine whether threads need to serialize through lock-protected critical sections. In this paper, we evaluate the first hardware implementation of Intel TSX using a set of high-performance computing (HPC) workloads, and demonstrate that applying Intel TSX to these workloads can provide significant performance improvements. On a set of real-world HPC workloads, applying Intel TSX provides an average speedup of 1.41x. When applied to a parallel user-level TCP/IP stack, Intel TSX provides 1.31x average bandwidth improvement on network intensive applications. We also demonstrate the ease with which we were able to apply Intel TSX to the various workloads.
Date of Conference: 17-22 November 2013
Date Added to IEEE Xplore: 14 August 2014
CD:978-1-4503-2378-9

ISSN Information:

Conference Location: Denver, CO, USA

1 Introduction

Due to limits in technology scaling, software developers have come to rely on thread-level parallelism to obtain sustainable performance improvement. However, except for the case where the computation is massively parallel (e.g., data-parallel applications), performance of threaded applications is often limited by how inter-thread synchronization is per-formed. For example, using coarse-grained locks can limit scalability, since the execution of lock-guarded critical sections is inherently serialized. Using fine-grained locks, in contrast, may provide good scalability, but increases locking overheads, and can often lead to subtle bugs.

Contact IEEE to Subscribe

References

References is not available for this document.