I. Introduction
The discrete cosine transform (DCT) [1], [2] is a robust approximation of the optimal Karhunen-Loève transform (KLT) for a first-order Markov source with large correlation coefficient. It has satisfactory performance in terms of energy compaction capability, and many fast DCT algorithms with efficient hardware and software implementations have been proposed. The DCT has found wide applications in image/video processing and other fields. It has become the heart of many international standards such as JPEG, H.26x, and the MPEG family [3] [4] [5].