I. Introduction
Human-centered pose estimation, such as gaze estimation, hand pose estimation and human body posture estimation attract much research interests [1]–[5], as they can greatly facilitate many computer vision tasks such as human-computer interaction [6], action recognition [7] and person re-identification [8]. In this paper, we concentrate on the very challenging problem of multi-person pose estimation, which aims at obtaining every 2D human pose represented by a set of anatomical keypoints (or body joints) in an image [9], [10].