I. Introduction
In computer vision, face frontalization technology plays an increasingly important role in today's society [1], such as facial recognition, expression synthesis, style migration, etc., and it has always been a hotspot of research. Although its technology has made remarkable progress, effectively dealing with the variation of large gestures of faces is still a significant challenge within the industry [2]. For example, traditional face recognition methods are usually performed in experimental environments that rely on frontal face images. In real-world environments, however, faces often present multiple angles due to different head postures, and the texture information for their facial recognition is not as recognizable as frontal facial texture information [3], along with problems such as facial deformation of the face and displacement of the five views. This directly affects the accuracy and robustness of the recognition system.