Conferences >ICASSP 2024 - 2024 IEEE Inter...

Volumetric 3d Point Cloud Attribute Compression: Learned Polynomial Bilateral Filter for Prediction

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We extend a previous study on 3D point cloud attribute compression scheme that uses a volumetric approach: given a target volumetric attribute function f : ℝ3 ↦ ℝ, we qua...Show More

Metadata

Abstract:

We extend a previous study on 3D point cloud attribute compression scheme that uses a volumetric approach: given a target volumetric attribute function f : ℝ³ ↦ ℝ, we quantize and encode parameters θ that characterize f at the encoder, for reconstruction

${f_{\hat \theta }}({\mathbf{x}})$ at known 3D points x at the decoder. Specifically, parameters

$\hat \theta$ are quantized coefficients of B-spline basis vectors Φl (for order p ≥ 2) that span the function space

$\mathcal{F}_l^{(p)}$ at a particular resolution l, which are coded from coarse to fine resolutions for scalability. In this work, we focus on the prediction of finer-grained coefficients given coarser-grained ones by learning parameters of a polynomial bilateral filter (PBF) from data. PBF is a pseudo-linear filter that is signal-dependent with a graph spectral interpretation common in the graph signal processing (GSP) field. We demonstrate PBF’s predictive performance over a linear predictor inspired by MPEG standardization over a wide range of point cloud datasets.

Published in: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 14-19 April 2024

Date Added to IEEE Xplore: 18 March 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/ICASSP48485.2024.10445884

Conference Location: Seoul, Korea, Republic of

Contents

1. INTRODUCTION

We study lossy compression of 3D point cloud attributes from a volumetric function approach. Specifically, assuming point cloud geometry is first encoded, we encode point cloud attributes, given that encoded geometry is known both at encoder and decoder—this is the dominant approach both in research [1] – [7] and in MPEG geometry-based point cloud compression (G-PCC) standard [8] – [11]. Mathematically, given known 3D locations x_i ∈ ℝ³ both at the encoder and decoder, we encode quantized parameters for a target volumentric attribute function f : ℝ³ ↦ ℝ from coarse to fine resolutions for scalability, so that it can be evaluated as at x_i at the decoder for signal reconstruction. [3], [12] proposed such a framework using volumetric B-spline basis functions Φ_l of order p = 1 that span a nested sequence of function spaces —called Region Adaptive Hierarchical Transform (RAHT(1))—and now forms the core in MPEG G-PCC [6]. Recently, [13] extended this framework to B-spline basis functions of order p ≥ 2(RAHT(p)) and demonstrated state-of-the-art (SOTA) coding performance. Moreover, the feedforward network obtained by unrolling a finite-term Taylor’s series of a matrix inverse to compute orthonormalized RAHT(p) coefficients is amenable to end-to-end data-driven parameter tuning. The goal of this paper is to further improve performance by designing a predictor for finer-grained unnormalized coefficients given coarser-grained coefficients , and training.

References is not available for this document.

Volumetric 3d Point Cloud Attribute Compression: Learned Polynomial Bilateral Filter for Prediction

Abstract:

Metadata

Abstract:

ISSN Information:

1. INTRODUCTION

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Volumetric 3d Point Cloud Attribute Compression: Learned Polynomial Bilateral Filter for Prediction

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. INTRODUCTION

References