Conferences >2023 IEEE International Paral...

A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper proposes a novel cache-oblivious blocking scheme based on a new triangular space-filling curve which preserves data locality. The proposed blocking-scheme redu...Show More

Metadata

Abstract:

This paper proposes a novel cache-oblivious blocking scheme based on a new triangular space-filling curve which preserves data locality. The proposed blocking-scheme reduces the movement of data within the host memory hierarchy for triangular matrix traversals, which inherently exhibit poor data locality, such as the in-place transposition of square matrices. We show that our cache-oblivious blocking-scheme can be generated iteratively in linear time and constant memory with regard to the number of entries present in the lower, or upper, triangle of the input matrix. In contrast to classical recursive cache-oblivious solutions, the iterative nature of our blocking-scheme does not inhibit other essential optimizations such as software prefetching. In order to assess the viability of our blocking-scheme as a cache-oblivious strategy, we applied it to the in-place transposition of square matrices. Extensive experiments show that our cache-oblivious transposition algorithm generally outperforms the cache-aware state-of-the-art algorithm in terms of throughput and energy efficiency in sequential as well as parallel environments.

Published in: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Date of Conference: 15-19 May 2023

Date Added to IEEE Xplore: 18 July 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/IPDPS54959.2023.00045

Conference Location: St. Petersburg, FL, USA

Funding Agency:

Contents

I. Introduction

The in-place transposition of square matrices is a well-explored problem in scientific computing [1]–[3]. In-place transposition routines are widely used to optimize the memory access patterns of different FFT methods. In some cases, such as in the "six-step" FFT variant, the transposition steps are found to represent the most significant performance bottle-neck [4]. The in-place transposition of square matrices is also an important building block for methods that compute the in-place transposition of rectangular matrices, where the input matrix is partitioned into square sub-matrices that must be transposed in-place, such as the Euclid's GCD method proposed by Gustavson et al. [5].

References is not available for this document.

MIT Libraries

MIT Libraries

A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References