I. Introduction
Fine-grained vehicle recognition is an important subject in computer vision field, which indicates that different subtypes in the same large class and recognize the category information of vehicles should be recognized, such as vehicle make, vehicle model or manufacture year. For example, we need to retrieve “Rolls-Royce Phantom” from numerous images of different vehicle models, as Fig. 1 shown. There are two difficulties of fine-grained vehicle recognition. One is the subtle granularity of images. Another is the huge differences in morphology, posture, color and background in fine-grained images. With the continuous improvement of the living standard of modern society and the rapid growth of vehicles' number, traffic supervision is facing great challenges. As an important point of traffic supervision, video surveillance system has been widely used in various fields of modern traffic [1], [2]. However, the traditional way of relying on manual interpretation has been unable to meet the needs of mass traffic video processing nowadays. Building intelligent recognition system to automatically process all kinds of traffic video information is an inevitable trend. The identification of vehicle types in traffic video images, as one of the key technologies in the construction, has been widely noted by researchers at home and abroad for a long time.
Fine-grained vehicle image.