I. Introduction
Vehicle re-identification (Re-ID) has very important applications in video surveillance and intelligent transportation, aiming to match the vehicle images of the same identity across non-overlapped camera views [1], [2], [3], [4], [5], [6], [7], [8]. Compared with person re-identification, it is more challenging due to the small inter-class similarity and large intra-class variation of vehicles [9]. Currently, vehicle re-identification methods based on deep learning have made great advances, but most of them focused on RGB single modality alone [10], [11], [12], [13]. In environments lacking ambient light, it is difficult to obtain effective information only with RGB cameras, which largely limits the scalability of Re-ID models in real-world scenarios. Fig. 1 shows the comparison of vehicle images of three modalities. Generally, Near Infrared (NIR) and Thermal Infrared (TIR) images are robust to lighting variations and provide complementary information to RGB images. With the complementary of NIR and TIR to RGB, we can effectively solve the problem of insufficient lighting. In addition, as the cost of capturing NIR and TIR images has decreased, it is easy to obtain NIR and TIR modality information. However, most current vehicle re-identification methods are unable to handle other modality information except RGB, and thus are insufficient to solve the RGB, NIR, and TIR modalities i.e. multi-modality vehicle re-identification task. Therefore, it is necessary to study the multi-modality vehicle re-identification.
Comparison of vehicle images of different modalities. (a) RGB modality. (b) NIR modality. (c) TIR modality. (d) mixed modality.