I. Introduction
Different from the task of object localization under full supervision, weakly supervised object localization (WSOL) aims to infer the position information of the target in an image by utilizing image-level annotation. Since category-related annotations are easier to obtain, WSOL can be implemented without intensive annotations, reducing the high cost of labeling and gaining much attention.