Conferences >2016 ACM/IEEE 43rd Annual Int...

Asymmetry-Aware Work-Stealing Runtimes

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Amdahl's law provides architects a compelling reason to introduce system asymmetry to optimize for both serial and parallel regions of execution. Asymmetry in a multicore...Show More

Metadata

Abstract:

Amdahl's law provides architects a compelling reason to introduce system asymmetry to optimize for both serial and parallel regions of execution. Asymmetry in a multicore processor can arise statically (e.g., from core microarchitecture) or dynamically (e.g., applying dynamic voltage/frequency scaling). Work stealing is an increasingly popular approach to task distribution that elegantly balances task-based parallelism across multiple worker threads. In this paper, we propose asymmetry-aware work-stealing (AAWS) runtimes, which are carefully designed to exploit both the static and dynamic asymmetry in modern systems. AAWS runtimes use three key hardware/software techniques: work-pacing, work-sprinting, and work-mugging. Work-pacing and work-sprinting are novel techniques that combine a marginal-utility-based approach with integrated voltage regulators to improve performance and energy efficiency in high-and low-parallel regions. Work-mugging is a previously proposed technique that enables a waiting big core to preemptively migrate work from a busy little core. We propose a simple implementation of work-mugging based on lightweight user-level interrupts. We use a vertically integrated research methodology spanning software, architecture, and VLSI to make the case that holistically combining static asymmetry, dynamic asymmetry, and work-stealing runtimes can improve both performance and energy efficiency in future multicore systems.

Published in: 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA)

Date of Conference: 18-22 June 2016

Date Added to IEEE Xplore: 25 August 2016

ISBN Information:

Print ISSN: 1063-6897

DOI: 10.1109/ISCA.2016.14

Conference Location: Seoul, Korea (South)

Contents

I. Introduction

Work stealing is a well-known approach to task distribution that elegantly balances task-based parallelism across multiple worker threads [10], [30]. In a work-stealing runtime, each worker thread enqueues and dequeues tasks onto the tail of its task queue. When a worker finds its queue empty, it attempts to steal a task from the head of another worker thread's task queue. Work stealing has been shown to have good performance, space requirements, and communication overhead in both theory [8] and practice [7], [22]. Optimizing work-stealing runtimes remains a rich research area [4], [12], [13], [15], [18], [4]–7, and work stealing is a critical component in many popular concurrency platforms including Intel's Cilk++, Intel's C++ Threading Building Blocks (TBB), Microsoft's. NET Task Parallel Library, Java's Fork/Join Framework, X10, and OpenMP. Most of the past research and current implementations use asymmetry-oblivious work-stealing runtimes. In this work, we propose asymmetry-aware work-stealing (AAWS) runtimes, which exploit both static asymmetry (e.g., different core microarchitectures) and dynamic asymmetry (e.g., per-core dynamic voltage/frequency scaling) to improve the performance and energy efficiency of multicore processors.

References is not available for this document.

Asymmetry-Aware Work-Stealing Runtimes

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Asymmetry-Aware Work-Stealing Runtimes

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References