I. Introduction
Visual object tracking is a basic and important task in the field of computer vision. For a video or an image sequence, given a region of interest (usually labeled by a rectangular bounding box (bbox)), the target position should be predicted in subsequent frames. The visual tracking task is usually a component of a large-scale visual system, so the target of which can be arbitrary; that is, the category of the target is a priori unknown to the tracking algorithm.