Self and Channel Attention Network for Person Re-Identification | IEEE Conference Publication | IEEE Xplore

Self and Channel Attention Network for Person Re-Identification


Abstract:

Recent research has shown promising results for person re-identification by focusing on several trends. One is designing efficient metric learning loss functions such as ...Show More

Abstract:

Recent research has shown promising results for person re-identification by focusing on several trends. One is designing efficient metric learning loss functions such as triplet loss family to learn the most discriminative representations. The other is learning local features by designing part based architectures to form an informative descriptor from semantically coherent parts. Some efforts adjust distant outliers to their most similar positions by using soft attention and learn the relationship between distant similar features. However, only a few prior efforts focus on channel-wise dependencies and learn non-local sharp similar part features directly for the degraded data in the person re-identification task. In this paper, we propose a novel Self and Channel Attention Network (SCAN) to model long-range dependencies between channels and feature maps. We add multiple classifiers to learn discriminative global features by using classification loss. Self Attention (SA) module and Channel Attention (CA) module are introduced to model non-local and channel-wise dependencies in the learned features. Spectral normalization is applied to the whole network to stabilize the training process. Experimental results on the person re-identification benchmarks show the proposed components achieve significant improvement with respect to the baseline.
Date of Conference: 10-15 January 2021
Date Added to IEEE Xplore: 05 May 2021
ISBN Information:
Print on Demand(PoD) ISSN: 1051-4651
Conference Location: Milan, Italy

Funding Agency:

References is not available for this document.

I. Introduction

Person re-identification (re-id) is a task to retrieve the same person images from gallery sets of non overlapping cameras, given an image of a person of interest from another camera. The task of re-id is gaining increasing importance as it is an essential component of intelligent surveillance systems [1], [2]. The variations like illumination, human pose, view angle, resolution, occlusions, clothing and background in the images make re-id a very challenging task. With the advancement of deep learning and neural networks, ConvNets [3]–[4], [5], well designed for image classification tasks, are performing well in re- id as they provide impressive feature representations of person images. Due to their discriminative representation capability, they outperform the traditional handcrafted low-level features by a large margin. The difference between re- id and image classification tasks is that the training and testing classes (i.e. person identities) are different in re-id. Therefore, the re-id task requires more discriminative feature representations to distinguish unseen similar images.

Select All
1.
C. Micheloni, P. Remagnino, H.-L. Eng and J. Geng, "Intelligent monitoring of complex environments", IEEE Intelligent Systems, 2010.
2.
J. Chen, K. Li, Q. Deng, K. Li and S. Y. Philip, "Distributed deep learning model for intelligent video surveillance systems with edge computing", IEEE Transactions on Industrial Informatics, 2019.
3.
A. Krizhevsky, I. Sutskever and G. E. Hinton, "Imagenet classification with deep convolutional neural networks", NIPS, 2012.
4.
K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.
5.
K. He, X. Zhang, S. Ren and J. Sun, "Deep residual learning for image recognition", CVPR, 2016.
6.
N. Martinel, G. Luca Foresti and C. Micheloni, "Aggregating deep pyramidal representations for person re-identification", CVPRW, 2019.
7.
A. Hermans, L. Beyer and B. Leibe, In defense of the triplet loss for person re-identification, 2017.
8.
W. Chen, X. Chen, J. Zhang and K. Huang, "Beyond triplet loss: a deep quadruplet network for person re-identification", CVPR, 2017.
9.
Y. Zhai, X. Guo, Y. Lu and H. Li, "In defense of the classification loss for person re-identification", CVPRW, 2019.
10.
Z. Zheng, L. Zheng and Y. Yang, "Unlabeled samples generated by gan improve the person re-identification baseline in vitro", ICCV, 2017.
11.
A. Munir, N. Martinel and C. Micheloni, "Multi branch siamese network for person re-identification", ICIP, 2020.
12.
H. Zhao, M. Tian, S. Sun, J. Shao, J. Yan, S. Yi, et al., "Spindle net: Person re-identification with human body region guided feature decomposition and fusion", CVPR, 2017.
13.
L. Zhao, X. Li, Y. Zhuang and J. Wang, "Deeply-learned part-aligned representations for person re-identification", ICCV, 2017.
14.
Y. Sun, L. Zheng, Y. Yang, Q. Tian and S. Wang, "Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline)", ECCV, 2018.
15.
X. Zhang, H. Luo, X. Fan, W. Xiang, Y. Sun, Q. Xiao, et al., Alignedreid: Surpassing human-level performance in person re-identification, 2017.
16.
X. Liu, H. Zhao, M. Tian, L. Sheng, J. Shao, S. Yi, et al., "Hydraplus-net: Attentive deep features for pedestrian analysis", ICCV, 2017.
17.
J. Xu, R. Zhao, F. Zhu, H. Wang and W. Ouyang, "Attention-aware compositional network for person re- identification", CVPR, 2018.
18.
W. Li, X. Zhu and S. Gong, "Harmonious attention network for person re-identification", CVPR, 2018.
19.
M. Zheng, S. Karanam, Z. Wu and R. J. Radke, "Re-identification with consistent attentive siamese networks", CVPR, 2019.
20.
A. Munir and C. Micheloni, "Self attention based multi branch network for person re-identification", SpliTech, 2020.
21.
H. Zhang, I. Goodfellow, D. Metaxas and A. Odena, "Self-attention generative adversarial networks", ICML, 2019.
22.
L. Bazzani, M. Cristani, A. Perina, M. Farenzena and V. Murino, "Multiple-shot person re-identification by hpe signature", ICPR, 2010.
23.
A. Das, A. Chakraborty and A. K. Roy-Chowdhury, "Consistent reidentification in a camera network", ECCV, 2014.
24.
S. Pedagadi, J. Orwell, S. Velastin and B. Boghossian, "Local fisher discriminant analysis for pedestrian re-identification", CVPR, 2013.
25.
S. Ding, L. Lin, G. Wang and H. Chao, "Deep feature learning with relative distance comparison for person re-identification", Pattern Recognition, vol. 48, no. 10, pp. 2993-3003, 2015.
26.
R. Yu, Z. Dou, S. Bai, Z. Zhang, Y. Xu and X. Bai, "Hard-aware point-to-set deep metric for person re-identification", ECCV, 2018.
27.
E. Ahmed, M. Jones and T. K. Marks, "An improved deep learning architecture for person re-identification", CVPR, 2015.
28.
W. Li, R. Zhao, T. Xiao and X. Wang, "Deepreid: Deep filter pairing neural network for person re-identification", CVPR, 2014.
29.
R. R. Varior, B. Shuai, J. Lu, D. Xu and G. Wang, "A siamese long short-term memory architecture for human re-identification", ECCV, 2016.
30.
J. Dai, Y. Li, K. He and J. Sun, "R-fcn: Object detection via region-based fully convolutional networks", NIPS, 2016.
Contact IEEE to Subscribe

References

References is not available for this document.