I. Introduction
Background subtraction (BgS), or change detection, is a fundamental branch of semantic segmentation that handles the identification of changing or moving entities in the field of view of a visual camera sensor. In particular, models in BgS seek to classify regions of semantically meaningful changes (e.g., walking people and moving vehicles) as foreground (FG), and classify static regions (e.g., buildings and roads) or nonsemantic motions (e.g., wavy water and rain droplets) as background (BG). In practice, BgS methods have been utilized as an important step in intelligent surveillance systems such as traffic analyses, motion segmentation, or behavior analysis [1], [2]. Hence, accuracy and generalization are stressed as essential properties for BgS adoption across various visual challenges, such as dynamic backgrounds, camera motion, illumination changes, dual-camera sensors, camouflaged objects, and especially on unseen scenarios (or scene-agnostic evaluation). Because the processed results of BgS are typically pipelined toward higher-level analytic components (e.g., tracking and activity recognition), any erroneous predictions would significantly limit practical applicability.