I. Introduction
Person Re-identification (ReID) aims to search people across non-overlapping surveillance camera views deployed at different locations by matching person images. It has been widely studied in computer vision and pattern recognition communities due to its importance in many safety-critical applications, such as automated video surveillance and forensics including multi-camera tracking, crowded counting and multi-camera activity analysis [1]. Some early approaches used low-level features such as colors, shapes and local descriptors for person appearance representation and employed distance metric learning models as matching functions. With the great success convolutional neural networks (CNN) achieved in many computer vision tasks [2]–[5], deep learning methods have dominated this community, with convincing superiority against hand-crafted competitors [6].