I. Introduction
In recent years, underwater robots have been widely used in underwater measurement, manipulation, and observation, with the deepening of ocean exploration [1]. However, to achieve autonomous operation, a crucial requirement is to enhance the perception capabilities of underwater robots in the underwater environment. In the field of computer vision, most visual perception algorithms heavily rely on extensive datasets, especially with the emergence of deep learning-based methods in recent years. Unfortunately, due to the unique characteristics of the underwater environment, collecting underwater data is costly, resulting in a limited availability of effective underwater datasets. Consequently, visual perception of the underwater environment remains a challenging problem that requires further attention and research.