Abstract:
Recent advancements in weakly supervised semantic segmentation (WSSS) methods have focused on incorporating contextual knowledge to enhance the completeness of class acti...Show MoreMetadata
Abstract:
Recent advancements in weakly supervised semantic segmentation (WSSS) methods have focused on incorporating contextual knowledge to enhance the completeness of class activation maps (CAMs). Inspired by prototype learning, we introduce a novel approach that capitalizes on the diverse and fine-grained features of instances, as well as the non-discriminative features of background context. However, traditional clustering methods, commonly employed in prototype-based techniques to generate feature centroids, often lack sensitivity to noisy features. To address the limitation, that CAMs predominantly emphasize discriminative features, we propose a Gaussian Mixture Model Bootstrap Prototype Aware Learning (GMM-BPA) strategy. This enriches instance comprehension by integrating instance attributes and semantic context through a Bootstrap Prototype Aware CAM (BPACAM). The core of our method lies in capturing instance knowledge through attributes, which are learned from class support banks and background support banks modeled as Gaussian distributions. Extensive experiments conducted on two standard WSSS benchmarks, PASCAL VOC and MS COCO, demonstrate that GMM-BPA outperforms existing methods and achieves state-of-the-art performance.
Published in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date of Conference: 06-11 April 2025
Date Added to IEEE Xplore: 07 March 2025
ISBN Information:
ISSN Information:
SenseTime Research, SenseTime Group Inc, Shanghai, China
SenseTime Research, SenseTime Group Inc, Shanghai, China