1. Introduction
Virtual try-on, which enables users to try on clothes to check the size or style in a virtual way, has a huge amount of commercial value and attracts extensive attention in computer vision. Many virtual try-on systems [13, 38] have been presented and achieve promising results when the pose is fixed. However, these approaches usually learn to synthesize the image conditioned on clothes only. When given a different pose, they tend to synthesize blurry images, losing most of the details and style, as shown in Figure 4.
Some results of our model. The clothes and poses images are shown in the first row, while the person images shown in the first column. The results manipulated by both clothes and pose are shown in the other columns.