I. Introduction
Weakly supervised semantic segmentation (WSSS) approaches attempt to learn pixel-level classification labels from weak data annotations. Instead of training an end-to-end model from scratch, WSSS approaches typically refine the outputs of a strong, pretrained classifier (post hoc). As shown in Fig. 1, post hoc approaches for WSSS typically operate in four stages.