Loading [MathJax]/extensions/MathZoom.js
Inter-View Direct Mode for Multiview Video Coding | IEEE Journals & Magazine | IEEE Xplore

Inter-View Direct Mode for Multiview Video Coding


Abstract:

Global disparity between views is usually caused by the displacement between cameras, which can be accurately represented by a global geometric transformation. In this pa...Show More

Abstract:

Global disparity between views is usually caused by the displacement between cameras, which can be accurately represented by a global geometric transformation. In this paper, we first propose an inter-view motion model in terms of the global geometric transformation to represent the motion correlation between two adjacent views. Specifically, the motion vector of a pixel in one view may be directly derived from that in another view according to the inter-view motion model. Further, we propose an inter-view direct mode to signal the decoder that the motion of a macroblock (MB) can be achieved from the coded view without any coding bits. The proposed inter-view direct mode is further incorporated in the existing multiview video coding (MVC) schemes (i.e., AVC-based MVC and 4-D wavelet-based MVC), working together with the other classical coding modes. The mode selection at each MB is accomplished with the rate-distortion optimization technique. The proposed inter-view direct mode can significantly reduce bits to code motion vectors especially at low bit rates, thus improving the coding efficiency
Page(s): 1527 - 1532
Date of Publication: 30 November 2006

ISSN Information:

References is not available for this document.

I. Introduction

In modern video coding schemes, motion prediction is becoming more and more accurate for better coding efficiency. For example in MPEG-4 AVC/H.264 standard [1], a macroblock (MB) in P-picture may have up to 16 different motion vectors, while an MB in B-picture may even have up to 32 different motion vectors in the case of bidirectional prediction. However, it also implied that many bits have to be assigned to code these motion vectors. Although motion vectors are relatively efficiently coded considering that they are differentially coded compared to a motion vector predictor, taken as the median value of the motion vectors of the spatially adjacent blocks, MPEG-4 AVC/H.264 standard still introduces two direct modes: SKIP and DIRECT within P- and B-pictures respectively, for a further reduction to the amount of bits for coding motion vectors [2].

Select All
1.
T. Wiegand, G. J. Sullivan, G. Bjontegaard and A. Luthra, "Overview of the H.264/AVC video coding standard", IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 560-576, Jul. 2003.
2.
A. M. Tourapis, F. Wu and S. Li, "Direct mode coding for bipredictive slices in the H.264standard", IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 1, pp. 119-126, Jan. 2005.
3.
A. Vetro, W. Matusik, H. Pfister and J. Xin, "Codingapproaches for end-to-end 3-D TV systems", Picture Coding Symp., 2004-Dec.
4.
X. Guo and Q. Huang, "Multiviewvideo coding based on global motion model", Lecture Notes Comput. Sci., vol. 3333, pp. 665-672, Dec. 2004.
5.
G. Li and Y. He, "Anovel multiview video coding scheme based on H.264", Proc. ICICS-PCM 2003, pp. 493-497, 2003-Dec.
6.
K. Mueller, P. Merkle, A. Smolic and T. Wiegand, "Multiview coding using AVC", MPEG2006/M12945 75th MPEG Meeting, 2006-Jan.
7.
Y.-L. Lee, J.-H. Hur, D.-Y. Kim and Y.-K. Lee, "H.264/MPEG-4 AVC-based multiview video coding", MPEG2006/M12871 75th MPEG Meeting, 2006-Jan.
8.
J.-R. Ohm, "Submissions received in CfP on multiview video coding", MPEG2006/M12969 75th MPEG Meeting, 2006-Jan.
9.
W. Yang, Y. Lu, F. Wu, J. Cai, K. Ngan and S. Li, "4-D wavelet-based multiview video coding", IEEE Trans. Circuits Syst. Video Technol., vol. 16, Nov. 11.
10.
M. Siegel, S. Sethuraman, J. McVeigh and A. Jordan, "Compression and interpolation of 3-D stereoscopic and multiviewvideo", Proc. SPIE Stereoscop. Displ. Virtual Reality Syst., vol. 3012, pp. 227-238, 1997.
11.
B. L. Tseng and D. Anastassiou, "Multiviewpoint video coding with MPEG-2compatibility", IEEE Trans. Circuits Syst. Video Technol., vol. 6, no. 4, pp. 414-419, Jul. 1996.
12.
D. V. Papadimitrion and T. J. Dennis, "Three dimensional parameter estimation from stereo imagesequences from model-based image coding", Signal Process.: Image Commun., vol. 7, pp. 471-487, 1995.
13.
R. S. Wang and Y. Wang, "Multiview video sequence analysis compression and virtualviewpoint synthesis", IEEE Trans. Circuits Syst. Video Technol., vol. 10, pp. 397-410, Apr. 2000.
14.
F. Dufaux and J. Konrad, "Efficient robust and fast global motionestimation for video coding", IEEE Trans. Image Process., vol. 9, no. 3, pp. 497-501, Mar. 2000.
15.
T. Wiegand and B. Girod, "Lagrange multiplier selection in hybrid video coder control", Proc. Int. Conf. Image Process., vol. 3, pp. 542-545, 2001.
16.
J. Xu, Z. Xiong, S. Li and Y.-Q. Zhang, "Three-dimensional embeddedsubband coding with optimized truncation (3-D Escot)", J. Appl. Comput. Harmonic Anal., vol. 10, pp. 290-315, May 2001.
17.
R. Xiong, F. Wu, J. Xu, S. Li and Y.-Q. Zhang, "Barbell lifting wavelet transform for highlyscalable video coding", Picture Coding Symp. 2004, 2004-Dec.
18.
R. Kawada, "KDDI multiview video sequences for MPEG 3-DAV use", MPEG2004/M10533 68th MPEG Meeting, 2004-Mar.
Contact IEEE to Subscribe

References

References is not available for this document.