Loading [a11y]/accessibility-menu.js
Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation | IEEE Journals & Magazine | IEEE Xplore

Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation


Abstract:

Deep regression trackers are among the fastest tracking algorithms available, and therefore suitable for real-time robotic applications. However, their accuracy is inadeq...Show More

Abstract:

Deep regression trackers are among the fastest tracking algorithms available, and therefore suitable for real-time robotic applications. However, their accuracy is inadequate in many domains due to distribution shift and overfitting. In this letter we overcome such limitations by presenting the first methodology for domain adaption of such a class of trackers. To reduce the labeling effort we propose a weakly-supervised adaptation strategy, in which reinforcement learning is used to express weak supervision as a scalar application-dependent and temporally-delayed feedback. At the same time, knowledge distillation is employed to guarantee learning stability and to compress and transfer knowledge from more powerful but slower trackers. Extensive experiments on five different robotic vision domains demonstrate the relevance of our methodology. Real-time speed is achieved on embedded devices and on machines without GPUs, while accuracy reaches significant results.
Published in: IEEE Robotics and Automation Letters ( Volume: 6, Issue: 3, July 2021)
Page(s): 5016 - 5023
Date of Publication: 02 April 2021

ISSN Information:

Funding Agency:

Citations are not available for this document.

I. Introduction

Real-time visual object tracking is a key module in many robotic perception systems [1]–[6]. Recently, deep regression trackers [7]–[9] (DRTs) have been proposed in the robotics community [7] because of their efficiency and generality. Thanks to their simple architecture, DRTs achieve processing speeds that surpass 100 FPS, making them suitable even for low-resource robots. Moreover, with the availability of large-scale computer vision datasets [10], these trackers can learn to track a large variety of targets without relying on particular assumptions, thus simplifying the development of tracking pipelines. However, acquiring thousands of videos for training these systems is not realistic in many real-world robotic application domains. Additionally, many domains offer particular scenarios that differ much from the examples which DRTs are trained on. For example, drone [11] and driving [3], [12] applications require tracking objects from particular camera views. Underwater robots offer uncommon targets and settings [4], [13]. Other robotics systems can use different imaging modalities [2]. Robotic manipulation configurations need the tracking of atypical objects [14]. As shown in Fig. 1, these situations cause DRTs’ accuracy to be very low. This is due to their deep learning architecture that is subject to overfitting if trained directly on small application datasets, and suffers from the shift between training and test data distributions when trained for large-scale generic object tracking.

Cites in Papers - |

Cites in Papers - IEEE (7)

Select All
1.
Chensheng Peng, Zhaoyu Zeng, Jinling Gao, Jundong Zhou, Masayoshi Tomizuka, Xinbing Wang, Chenghu Zhou, Nanyang Ye, "PNAS-MOT: Multi-Modal Object Tracking With Pareto Neural Architecture Search", IEEE Robotics and Automation Letters, vol.9, no.5, pp.4377-4384, 2024.
2.
Matteo Dunnhofer, Luca Sordi, Niki Martinel, Christian Micheloni, "Tracking Skiers from the Top to the Bottom", 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp.8496-8506, 2024.
3.
Matteo Dunnhofer, Luca Sordi, Christian Micheloni, "Visualizing Skiers' Trajectories in Monocular Videos", 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.5188-5198, 2023.
4.
Xie Chen, Dawei Zhang, Zhonglong Zheng, Yiran He, "Deep Regression Tracking with Graph Attention", 2022 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML), pp.352-358, 2022.
5.
Matteo Dunnhofer, Christian Micheloni, "CoCoLoT: Combining Complementary Trackers in Long-Term Visual Tracking", 2022 26th International Conference on Pattern Recognition (ICPR), pp.5132-5139, 2022.
6.
Han Sun, Yongqiang Bai, Wenbo Zhang, "Distilling Siamese Trackers with Attention Mask", 2022 41st Chinese Control Conference (CCC), pp.6622-6627, 2022.
7.
Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni, "Is First Person Vision Challenging for Object Tracking?", 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp.2698-2710, 2021.

Cites in Papers - Other Publishers (9)

1.
David J. Barrientos R, Marie Chantelle C. Medina, Bruno J.T. Fernandes, Pablo V.A. Barros, "The use of reinforcement learning algorithms in object tracking: A systematic literature review", Neurocomputing, pp.127954, 2024.
2.
Muhammad Hassan Tanveer, Zainab Fatima, Shehnila Zardari, David Guerra-Zubiaga, "An In-Depth Analysis of Domain Adaptation in Computer and Robotic Vision", Applied Sciences, vol.13, no.23, pp.12823, 2023.
3.
Asif Hussain Khan, Rao Muhammad Umer, Matteo Dunnhofer, Christian Micheloni, Niki Martinel, "LBKENet:Lightweight Blur Kernel Estimation Network for Blind Image Super-Resolution", Image Analysis and Processing ? ICIAP 2023, vol.14234, pp.209, 2023.
4.
Yan Zhu, Peijun Hu, Xiang Li, Yu Tian, Xueli Bai, Tingbo Liang, Jingsong Li, "An End-to-End Data-Adaptive Pancreas Segmentation System with an Image Quality Control Toolbox", Journal of Healthcare Engineering, vol.2023, pp.1, 2023.
5.
Xingmei Wang, Guohao Nie, Boquan Li, Yilin Zhao, Minyang Kang, Bo Liu, "Hierarchical memory-guided long-term tracking with meta transformer inquiry network", Knowledge-Based Systems, vol.269, pp.110504, 2023.
6.
Bo Ju, Zhikang Zou, Xiaoqing Ye, Minyue Jiang, Xiao Tan, Errui Ding, Jingdong Wang, "Paint and Distill", Proceedings of the 30th ACM International Conference on Multimedia, pp.5639, 2022.
7.
Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni, "Visual Object Tracking in First Person Vision", International Journal of Computer Vision, 2022.
8.
Matteo Dunnhofer, Kristian Simonato, Christian Micheloni, "Combining complementary trackers for enhanced long-term visual object tracking", Image and Vision Computing, vol.122, pp.104448, 2022.
9.
Matteo Miani, Matteo Dunnhofer, Christian Micheloni, Andrea Marini, Nicola Baldo, "Surrogate Safety Measures Prediction at Multiple Timescales in V2P Conflicts Based on Gated Recurrent Unit", Sustainability, vol.13, no.17, pp.9681, 2021.
Contact IEEE to Subscribe

References

References is not available for this document.