Abstract:
The visual quality of point clouds plays a crucial role in the development and broadcasting of immersive media. Therefore, investigating point cloud quality assessment (P...Show MoreMetadata
Abstract:
The visual quality of point clouds plays a crucial role in the development and broadcasting of immersive media. Therefore, investigating point cloud quality assessment (PCQA) is instrumental in facilitating immersive media applications, including virtual reality and augmented reality applications. Considering reference point clouds are not available in many cases, no-reference (NR) metrics have become a research hotspot. Existing NR methods suffer from difficult training. To address this shortcoming, we propose a novel NR-PCQA method, Point Cloud Quality Assessment via Domain-relevance Degradation Description (D3-PCQA). First, we demonstrate our model’s interpretability by deriving the function of each module using a kernelized ridge regression model. Specifically, quality assessment can be characterized as a leap from the scattered perceptual domain (reflecting subjective perception) to the ordered quality domain (reflecting mean opinion score). Second, to reduce the significant domain discrepancy, we establish an intermediate domain, the description domain, based on insights from the human visual system (HVS), by considering the domain relevance among samples located in the perception domain and learning a structured latent space. The anchor features derived from the learned latent space are generated as cross-domain auxiliary information to promote domain transformation. Furthermore, the newly established description domain decomposes the NR-PCQA problem into two relevant stages. These stages include a classification stage that gives the degradation descriptions to point clouds and a regression stage to determine the confidence degrees of descriptions, providing a semantic explanation for the predicted quality scores. Experimental results demonstrate that D3-PCQA exhibits robust performance and outstanding generalization on several publicly available datasets.
Published in: IEEE Transactions on Broadcasting ( Early Access )