Loading [MathJax]/extensions/MathMenu.js
A Two-Stage Personalized Virtual Try-On Framework With Shape Control and Texture Guidance | IEEE Journals & Magazine | IEEE Xplore

A Two-Stage Personalized Virtual Try-On Framework With Shape Control and Texture Guidance


Abstract:

The Diffusion model has a strong ability to generate wild images. However, the model can just generate inaccurate images with the guidance of text, which makes it very ch...Show More

Abstract:

The Diffusion model has a strong ability to generate wild images. However, the model can just generate inaccurate images with the guidance of text, which makes it very challenging to directly apply the text-guided generative model for virtual try-on scenarios. Taking images as guiding conditions of the diffusion model, this paper proposes a brand new personalized virtual try-on model (PE-VITON), which uses the two stages (shape control and texture guidance) to decouple the clothing attributes. Specifically, the proposed model adaptively matches the clothing to human body parts through the Shape Control Module (SCM) to mitigate the misalignment of the clothing and the human body parts. The semantic information of the input clothing is parsed by the Texture Guided Module (TGM), and the corresponding texture is generated by directional guidance. Therefore, this model can effectively solve the problems of weak reduction of clothing folds, poor generation effect under complex human posture, blurred edges of clothing, and unclear texture styles in traditional try-on methods. Meanwhile, the model can automatically enhance the generated clothing folds and textures according to the human posture, and improve the authenticity of the virtual try-on. In this paper, qualitative and quantitative experiments are carried out on high-resolution paired and unpaired datasets, the results show that the proposed model outperforms the state-of-the-art model.
Published in: IEEE Transactions on Multimedia ( Volume: 26)
Page(s): 10225 - 10236
Date of Publication: 31 May 2024

ISSN Information:


I. Introduction

Recently, Denoising Diffusion Probabilistic Model (DDPM) [1] and score-based generative models [2], [3] have shown great capabilities in image generation tasks (Fig. 1). Compared with traditional generative models based on GAN [4], [5], [6], [7] and VAE [8], these models can achieve the generation of a series of realistic style images, which have been widely recognized in various fields [1], [9].

Contact IEEE to Subscribe

References

References is not available for this document.