Efficient Hierarchical Entropy Model for Learned Point Cloud Compression | IEEE Conference Publication | IEEE Xplore

Efficient Hierarchical Entropy Model for Learned Point Cloud Compression


Abstract:

Learning an accurate entropy model is a fundamental way to remove the redundancy in point cloud compression. Recently, the octree-based auto-regressive entropy model whic...Show More

Abstract:

Learning an accurate entropy model is a fundamental way to remove the redundancy in point cloud compression. Recently, the octree-based auto-regressive entropy model which adopts the self-attention mechanism to explore dependencies in a large-scale context is proved to be promising. However, heavy global attention computations and auto-regressive contexts are inefficient for practical applications. To improve the efficiency of the attention model, we propose a hierarchical attention structure that has a linear complexity to the context scale and maintains the global receptive field. Furthermore, we present a grouped context structure to address the serial decoding issue caused by the auto-regression while preserving the compression performance. Experiments demonstrate that the proposed entropy model achieves superior rate-distortion performance and significant decoding latency reduction compared with the state-of-the-art large-scale auto-regressive entropy model.
Date of Conference: 17-24 June 2023
Date Added to IEEE Xplore: 22 August 2023
ISBN Information:

ISSN Information:

Conference Location: Vancouver, BC, Canada
References is not available for this document.

1. Introduction

Point cloud is a fundamental data structure to represent 3D scenes. It has been widely applied in 3D vision systems such as autonomous driving and immersive applications. The large-scale point cloud typically contains millions of points [36]. It is challenging to store and transmit such massive data. Hence, efficient point cloud compression that reduces memory footprints and transmission bandwidth is necessary to develop practical point cloud applications.

Select All
1.
Johannes Balle, Valero Laparra and P Simoncelli Eero, "End-to-end optimized image compression", International Con-ference on Learning Representations, 2017.
2.
Johannes Balle, David Minnen, Saurabh Singh, Sung Jin Hwang and Nick Johnston, "Variational image compression with a scale hyperprior", International Conference on Learning Representations, 2018.
3.
Jens Behley, Martin Garbade, Andres Milioto, Jan Quen-zel, Sven Behnke, Cyrill Stachniss, et al., "Se-mantickitti: A dataset for semantic scene understanding of lidar sequences", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9297-9307, 2019.
4.
Sourav Biswas, Jerry Liu, Kelvin Wong, Shenlong Wang and Raquel Urtasun, "Muscle: Multi sweep compression of lidar using deep entropy models", Advances in Neural In-formation Processing Systems, pp. 22170-22181, 2020.
5.
Zhengxue Cheng, Heming Sun, Masaru Takeuchi and Jiro Katto, "Learned image compression with discretized gaussian mixture likelihoods and attention modules", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7939-7948, 2020.
6.
"MPEG 3D Graphics Coding", Common test conditions for G-PCC. ISO/IEC JTC1/SC29/WG7 N00106, 2021.
7.
"MPEG 3D Graphics Coding", Preliminary dataset for ai-based point cloud experiments. ISO/IEC JTC1/SC29IWG7 W21570, 2022.
8.
Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, et al., "Multiscale vision transformers", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6824-6835, 2021.
9.
Guangchi Fang, Qingyong Hu, Hanyun Wang, Yiling Xu and Yulan Guo, "3dac: Learning attribute compression for point clouds", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14819-14828, 2022.
10.
Chunyang Fu, Ge Li, Rui Song, Wei Gao and Shan Liu, "Oc-tattention: Octree-based large-scale contexts model for point cloud compression", Proceedings of the AAAI Conference on Artificial Intelligence, 2022.
11.
Diogo C Garcia, Tiago A Fonseca, Renan U Ferreira and Ricardo L De Queiroz, "Geometry coding for dynamic vox-elized point clouds using octrees and multiple contexts", IEEE Transactions on Image Processing, vol. 29, pp. 313-322, 2019.
12.
MPEG Group. Mpeg G-PCC tmc13, 2021, [online] Available: https://github.com/MPEGGroup/mpeG-PCC-tmc13.
13.
Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin and Yan Wang, "Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5718-5727, 2022.
14.
Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang and Hongwei Qin, "Checkerboard context model for efficient learned image compression", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14771-14780, 2021.
15.
Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xi-angyang Xue and Yanwei Fu, "Density-preserving deep point cloud compression", Proceedings of the IEEE/CVF Con-ference on Computer Vision and Pattern Recognition, pp. 2333-2342, 2022.
16.
Lila Huang, Shenlong Wang, Kelvin Wong, Jerry Liu and Raquel Urtasun, "Octsqueeze: Octreestructured en-tropy model for lidar compression", Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pp. 1313-1323, 2020.
17.
Tianxin Huang and Yong Liu, "3d point cloud geometry compression on deep learning", Proceedings of the 27th ACM international conference on multimedia, pp. 890-898, 2019.
18.
Emre Can Kaya and Ioan Tabus, "Neural network modeling of probabilities for coding the octree representation of point clouds", 2021 IEEE 23rd International Workshop on Mul-timedia Signal Processing, 2021.
19.
Jun-Hyuk Kim, Byeongho Heo and Jong-Seok Lee, "Joint global and local hierarchical priors for learned image compression", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5982-5991, 2022.
20.
Diederik P Kingma and Jimmy Ba, "Adam: A method for stochastic optimization", International Conference on Learning Representations, 2015.
21.
Jooyoung Lee, Seunghyun Cho and Seung-Kwon Beack, "Context-adaptive entropy model for end-to-end optimized image compression", International Conference on Learning Representations, 2019.
22.
Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong et al., "Swin transformer v2: Scaling up capacity and resolution", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12009-12019, 2022.
23.
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, et al., "Swin transformer: Hierarchical vision transformer using shifted windows", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012-10022, 2021.
24.
Donald Meagher, "Geometric modeling using octree encoding", Computer graphics and image processing, vol. 19, no. 2, pp. 129-147, 1982.
25.
Rufael Mekuria, Kees Blom and Pablo Cesar, "Design im-plementation and evaluation of a point cloud codec for tele-immersive video", IEEE Transactions on Circuits and Systemsfor Video Technology, vol. 27, no. 4, pp. 828-842, 2016.
26.
Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte and Luc Van Gool, "Conditional probability models for deep image compression", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4394-4402, 2018.
27.
Fabian Mentzer, George D Toderici, Michael Tschannen and Eirikur Agustsson, "High-fidelity generative image compression", Advances in Neural Information Processing Systems, pp. 11913-11924, 2020.
28.
David Minnen, Johannes Balle and George D Toderici, "Joint autoregressive and hierarchical priors for learned image compression", Advances in neural information processing systems, vol. 31, 2018.
29.
David Minnen and Saurabh Singh, "Channel-wise autoregres-sive entropy models for learned image compression", 2020 IEEE International Conference on Image Processing, pp. 3339-3343, 2020.
30.
Dat Thanh Nguyen and Andre Kaup, Learning-based loss-less point cloud geometry coding using sparse representations, 2022.
Contact IEEE to Subscribe

References

References is not available for this document.