I. Introduction
Medical image segmentation plays a crucial role in assisting clinicians with the quantitative analysis of pathological features. While deep learning has demonstrated effectiveness in image segmentation, its application in the medical field presents many challenges. The annotation in medical images typically requires specialized knowledge, making it difficult to obtain a large amount of labeled data. Many efforts have been made to reduce the dependence on labeled data. Techniques like semi-supervised learning [1], [2] and foundation models [3], [4] have attracted much attention and can improve segmentation performance when annotated data is scarce.