Loading [MathJax]/extensions/MathMenu.js
Recurrent Diffusion for 3D Point Cloud Generation From a Single Image | IEEE Journals & Magazine | IEEE Xplore

Recurrent Diffusion for 3D Point Cloud Generation From a Single Image


Abstract:

Single-image 3D shape reconstruction has attracted significant attention with the advance of generative models. Recent studies have utilized diffusion models to achieve u...Show More

Abstract:

Single-image 3D shape reconstruction has attracted significant attention with the advance of generative models. Recent studies have utilized diffusion models to achieve unprecedented shape reconstruction quality. However, these methods, in each sampling step, perform denoising in a single forward pass, leading to cumulative errors that severely impact the geometric consistency of the generated shapes with the input targets and face difficulties in reconstructing rich details of complex 3D shapes. Moreover, the performance of current works suffers significant degradation due to limited information when only a single image is used as input during testing, further affecting the quality of 3D shape generation. In this paper, we present a recurrent diffusion framework, aiming to improve generation performance during single image-to-shape inference. Diverging from denoising in a single forward pass, we recursively refine the noise prediction in a self-rectified manner with the explicit guidance of the input target, thereby markedly suppressing cumulative errors and improving detail modeling. To enhance the geometric perception ability of the network during single-image inference, we further introduce a multi-view training scheme equipped with a view-robust conditional generation mechanism, which effectively promotes generation quality even when only a single image is available during inference. The effectiveness of our method is demonstrated through extensive evaluations on two public 3D shape datasets, where it surpasses state-of-the-art methods both qualitatively and quantitatively.
Published in: IEEE Transactions on Image Processing ( Volume: 34)
Page(s): 1753 - 1765
Date of Publication: 27 February 2025

ISSN Information:

PubMed ID: 40031553

Funding Agency:


I. Introduction

Single-IMAGE 3D point cloud generation from a single image is one of the most critical fields in computer vision, which has a wide range of applications such as autonomous driving, robot navigation, and augmented/virtual reality. Its goal is to recover plausible 3D geometric shapes with the limited information from the single-view observation. This task is challenging due to the inherent lack of crucial 3D information, such as depth and viewpoint [1]. Additionally, issues like blurriness and occlusions further compound the difficulty of this task.

Contact IEEE to Subscribe

References

References is not available for this document.