Conferences >2013 IEEE 43rd International ...

The Impact of Address Arithmetic on the GPU Implementation of Fast Algorithms for the Vilenkin-Chrestenson Transform

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper considers the impact of address arithmetic in the Cooley-Tukey and the constant geometry fast algorithms for the Vilenkin-Chrestenson transform on their implem...Show More

Metadata

Abstract:

This paper considers the impact of address arithmetic in the Cooley-Tukey and the constant geometry fast algorithms for the Vilenkin-Chrestenson transform on their implementation for the graphics processing unit (GPU). We consider issues such as using different transform radices and analyze the number of GPU instructions and register usage in the OpenCL implementations of the considered algorithms. Further, we compare the program running times on the GPU and on the central processing unit (CPU). Experiments show that the GPU implementations are from 10 to 22 times faster than the C/C++ CPU implementations, depending on the transform radix and the number of variables in the processed function. The OpenCL implementation of the constant geometry algorithm translates into a lower number of GPU arithmetic and fetch instructions and uses less registers. This implementation requires up to 21% shorter processing times than the corresponding Cooley-Tukey algorithm implementation.

Published in: 2013 IEEE 43rd International Symposium on Multiple-Valued Logic

Date of Conference: 22-24 May 2013

Date Added to IEEE Xplore: 10 June 2013

ISBN Information:

ISSN Information:

DOI: 10.1109/ISMVL.2013.59

Conference Location: Toyama, Japan

Contents

I. Introduction

The Vilenkin-Chrestenson transform can be viewed as the generalization of the Walsh transform from binary to multiple-valued logic (MVL) functions [4], [20]. It has applications analogous to that of the Walsh transform in binary logic [9], [11]. In spite of the existence of fast algorithms, time needed for computing the Vilenkin-Chrestenson spectrum of a - valued function is a restrictive parameter in many applications. Therefore, accelerating the computation by using devices such as graphics processing units (GPUs) can be of practical importance. The existing algorithms are based on different factorizations of the Vilenkin-Chrestenson transform matrix which are tailored for the implementation on central processing units (CPUs). Mapping of the Vilenkin-Chrestenson transform to GPUs requires careful selection of a particular fast algorithm since different underlying factorizations have significant implications on the performance.

References is not available for this document.

MIT Libraries

MIT Libraries

The Impact of Address Arithmetic on the GPU Implementation of Fast Algorithms for the Vilenkin-Chrestenson Transform

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

The Impact of Address Arithmetic on the GPU Implementation of Fast Algorithms for the Vilenkin-Chrestenson Transform

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References