4-D Wavelet-Based Multiview Video Coding | IEEE Journals & Magazine | IEEE Xplore

4-D Wavelet-Based Multiview Video Coding


Abstract:

The conventional multiview video coding (MVC) schemes, utilizing both neighboring temporal frames and view frames as possible references, have only shown a slight gain ov...Show More

Abstract:

The conventional multiview video coding (MVC) schemes, utilizing both neighboring temporal frames and view frames as possible references, have only shown a slight gain over those using temporal frames alone in terms of coding efficiency. The reason for this is that the neighboring temporal frames exhibit stronger correlation with the current frame and the view frames often fail to be selected as references. This paper proposes an elegant MVC framework using high dimensional wavelet, which rightly matches the inherent high dimension property of multiview video. It also makes a better usage of both temporal and view correlations thanks to the hierarchical decomposition. Besides the proposed framework, this paper also investigates MVC coding from the following aspects. First, a disparity-compensated view filter (DCVF) with pixel alignment is proposed, which can accommodate both global and local view disparities among view frames. The proposed DCVF and the existing motion-compensated temporal filter (MCTF) unify the view and temporal decompositions as a generic lifting transform. Second, an adaptive decomposition structure based on the analysis of the temporal and view correlations is proposed. A Lagrangian cost function is derived to determine the optimum decomposition structure. Third, the major components of the proposed MVC coding are figured out, including macroblock type design, subband coefficient coding, and rate allocation. Extensive experiments are carried out on the MPEG 3DAV test sequences and the superior performance of the proposed MVC coding is demonstrated. In addition, the proposed MVC framework can easily support temporal, spatial, SNR, as well as view scalabilities
Page(s): 1385 - 1396
Date of Publication: 13 November 2006

ISSN Information:


I. Introduction

A wide variety of future interactive multimedia applications, including free-viewpoint video or free-viewpoint television, 3-D television, immersive teleconference, and surveillance, are emerging. The existing and emerging technologies for modeling, coding, and rendering of real-world scenes for these applications have been reviewed at the convergence point of computer graphics, computer vision, and classical media [1]. It has been widely recognized that multiview video coding (MVC) is one of the key technologies for interactive multimedia applications [2]. Because of the huge data volume, efficient compression becomes more important in storage and delivery of multiview video than any other data. In most cases, multiview video is captured by multiple cameras from different angles and locations at the same time instant, and significant correlations may exist between the adjacent views.

Contact IEEE to Subscribe

References

References is not available for this document.