I. Introduction
Semantic segmentation is a crucial task in computer vision, aiming to assign predefined semantic labels to each pixel in an image and providing a higher-level representation of the image [1]. Traditional segmentation methods based only on RGB images are sensitive to lighting conditions and lack robustness in challenging low-light environments. Consequently, researchers have proposed fusion methods for thermal (T) and RGB images, utilizing complementary information to enhance segmentation performance in challenging low-light scenarios. The proposal of the RGB-T semantic segmentation task also promotes further development in instrument measurement tasks.