Loading [MathJax]/extensions/MathZoom.js
TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation | IEEE Conference Publication | IEEE Xplore

TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation


Abstract:

Test-time adaptation methods have been gaining attention recently as a practical solution for addressing source-to-target domain gaps by gradually updating the model with...Show More

Abstract:

Test-time adaptation methods have been gaining attention recently as a practical solution for addressing source-to-target domain gaps by gradually updating the model without requiring labels on the target data. In this paper, we propose a method of test-time adaptation for category-level object pose estimation called TTA-COPE. We design a pose ensemble approach with a self-training loss using pose-aware confidence. Unlike previous unsupervised domain adaptation methods for category-level object pose estimation, our approach processes the test data in a sequential, online manner, and it does not require access to the source domain at runtime. Extensive experimental results demonstrate that the proposed pose ensemble and the self-training loss improve category-level object pose performance during test time under both semi-supervised and unsupervised settings.
Date of Conference: 17-24 June 2023
Date Added to IEEE Xplore: 22 August 2023
ISBN Information:

ISSN Information:

Conference Location: Vancouver, BC, Canada

Funding Agency:

References is not available for this document.

1. Introduction

Object pose estimation is a crucial problem in computer vision and robotics. Advanced methods that focus on diverse variations of object 6D pose estimation have been introduced, such as known 3D objects (instance-level) [28], [38], category-level [18], [36], [43], few-shot [52], and zero-shot pose estimation [13], [47]. These techniques are useful for downstream applications requiring an online operation, such as robotic manipulation [6], [25], [48] and augmented reality [23], [24], [32]. Our paper focuses on the category-level object pose estimation problem since it is more broadly applicable than the instance-level problem.

Select All
1.
Adel Ahmadyan, Liangkai Zhang, Artsiom Ablavatski, Jianing Wei and Matthias Grundmann, "Objectron: A large scale dataset of object-centric videos in the wild with pose annotations", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7822-7831, 2021.
2.
Dengsheng Chen, Jun Li, Zheng Wang and Kai Xu, "Learning canonical shape space for category-level 6D object pose and size estimation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11973-11982, 2020.
3.
Kai Chen and Qi Dou, "SGPA: Structure-guided prior adaptation for category-level 6D object pose estimation", Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2773-2782, 2021.
4.
Christopher Choy, JunYoung Gwak and Silvio Savarese, "4d spatiotemporal convnets: Minkowski convolutional neural networks", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3075-3084, 2019.
5.
Yang Fu and Xiaolong Wang, "Category-level 6d object pose estimation in the wild: A semi-supervised learning approach and a new dataset", arXiv preprint, 2022.
6.
Wei Gao and Russ Tedrake, "kpam 2.0: Feedback control for category-level robotic manipulation", IEEE Robotics and Automation Letters (RA-L), vol. 6, no. 2, pp. 2962-2969, 2021.
7.
Richard Hartley and Andrew Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2003.
8.
Kaiming He, Georgia Gkioxari, Piotr Dollár and Ross Girshick, "Mask r-cnn", Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2961-2969, 2017.
9.
Kaiming He, Xiangyu Zhang, Shaoqing Ren and Jian Sun, "Deep residual learning for image recognition", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, 2016.
10.
Sergey Ioffe and Christian Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift", International Conference on Machine Learning (ICML), pp. 448-456, 2015.
11.
Muhammad Zubair Irshad, Sergey Zakharov, Rares Ambrus, Thomas Kollar, Zsolt Kira and Adrien Gaidon, "ShAPO: Implicit representations for multi-object shape appearance and pose optimization", arXiv preprint, 2022.
12.
Maximilian Jaritz, Tuan-Hung Vu, Raoul de Charette, Emilie Wirbel and Patrick Pérez, "xmuda: Cross-modal unsupervised domain adaptation for 3d semantic segmentation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12605-12614, 2020.
13.
Yann Labbé, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, et al., "MegaPose: 6d pose estimation of novel objects via render & compare", 6th Annual Conference on Robot Learning (CoRL), 2022.
14.
Taeyeop Lee, Byeong-Uk Lee, Myungchul Kim and In So Kweon, "Category-level metric scale object shape and pose estimation", IEEE Robotics and Automation Letters (RA-L), vol. 6, no. 4, pp. 8575-8582, 2021.
15.
Taeyeop Lee, Byeong-Uk Lee, Inkyu Shin, Jaesung Choe, Ukcheol Shin, In So Kweon, et al., "UDA-COPE: Unsupervised domain adaptation for category-level object pose estimation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14891-14900, 2022.
16.
Jiehong Lin, Zewei Wei, Changxing Ding and Kui Jia, "Category-level 6d object pose and size estimation using self-supervised deep prior deformation networks", Proceedings of the European Conference on Computer Vision (ECCV), pp. 19-34, 2022.
17.
Jiehong Lin, Zewei Wei, Zhihao Li, Songcen Xu, Kui Jia and Yuanqing Li, "DualPoseNet: Category-level 6D object pose and size estimation using dual pose network with refined learning of pose consistency", Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 3560-3569, 2021.
18.
Yunzhi Lin, Jonathan Tremblay, Stephen Tyree, Patricio A. Vela and Stan Birchfield, "Single-stage keypoint-based category-level object pose estimation from an rgb image", IEEE International Conference on Robotics and Automation (ICRA), 2022.
19.
Shichen Liu, Tianye Li, Weikai Chen and Hao Li, "Soft rasterizer: A differentiable renderer for image-based 3d reasoning", Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7708-7717, 2019.
20.
Shaohui Liu, Yinda Zhang, Songyou Peng, Boxin Shi, Marc Pollefeys and Zhaopeng Cui, "DIST: Rendering deep implicit signed distance function with differentiable sphere tracing", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2019-2028, 2020.
21.
Yuang Liu, Wei Zhang and Jun Wang, "Source-free domain adaptation for semantic segmentation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1215-1224, 2021.
22.
Massimiliano Mancini, Hakan Karaoguz, Elisa Ricci, Patric Jensfelt and Barbara Caputo, "Kitting in the wild through online domain adaptation", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1103-1109, 2018.
23.
Eric Marchand, Hideaki Uchiyama and Fabien Spindler, "Pose estimation for augmented reality: A hands-on survey", IEEE Transactions on Visualization and Computer Graphics (TVCG), vol. 22, no. 12, pp. 2633-2651, 2015.
24.
Eitan Marder-Eppstein, "Project Tango", ACM SIGGRAPH Real-Time Live, pp. 25, 2016.
25.
Arsalan Mousavian, Clemens Eppner and Dieter Fox, "6-DoF GraspNet: Variational grasp generation for object manipulation", Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2901-2910, 2019.
26.
Shuaicheng Niu, Jiaxiang Wu, Yifan Zhang, Yaofo Chen, Shijian Zheng, Peilin Zhao, et al., "Efficient test-time model adaptation without forgetting", International Conference on Machine Learning (ICML), pp. 16888-16905, 2022.
27.
Kiru Park, Timothy Patten and Markus Vincze, "Pix2Pose: Pixel-wise coordinate regression of objects for 6D pose estimation", Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7668-7677, 2019.
28.
Sida Peng, Yuan Liu, Qixing Huang, Xiaowei Zhou and Hujun Bao, "PVNet: Pixel-wise voting network for 6DoF pose estimation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4561-4570, 2019.
29.
Wanli Peng, Jianhang Yan, Hongtao Wen and Yi Sun, "Self-supervised category-level 6D object pose estimation with deep implicit shape representation", Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI), vol. 36, pp. 2082-2090, 2022.
30.
Luis Pineda, Taosha Fan, Maurizio Monge, Shobha Venkataraman, Paloma Sodhi, Ricky T. Q. Chen, et al., "Theseus: A library for differentiable nonlinear optimization", Advances in Neural Information Processing Systems, 2022.

Contact IEEE to Subscribe

References

References is not available for this document.