I. Introduction
Recent generative models are developed with a growing emphasis on universality, aiming to enhance the real-world applicability in solving complex challenges [1], [2], [3]. Significant advances have been made in visual domain translation [4], [5], [6], [7], [8], which harnesses the transformation of images by exploiting inherent content correlations across disparate realms. The defining feature of universality for domain translators is their ability to seamlessly convert images from any real-world source domain to a chosen target domain. This pursuit of universality in visual domain translation erodes the barriers segregating different domains and provides invaluable technological support for a wide range of applications. These span from artistic creations such as anthropomorphic or skeuomorphic designs to the entertainment industry, including customized effect generation on various platforms.