Loading [MathJax]/extensions/MathMenu.js
Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report | IEEE Conference Publication | IEEE Xplore

Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report


Abstract:

Depth estimation is an important computer vision problem with many practical applications to mobile devices. While many solutions have been proposed for this task, they a...Show More

Abstract:

Depth estimation is an important computer vision problem with many practical applications to mobile devices. While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based depth estimation solutions that can demonstrate a nearly real-time performance on smartphones and IoT platforms. For this, the participants were provided with a new large-scale dataset containing RGB-depth image pairs obtained with a dedicated stereo ZED camera producing high-resolution depth maps for objects located at up to 50 meters. The run-time of all models was evaluated on the popular Raspberry Pi 4 platform with a mobile ARM-based Broadcom chipset. The proposed solutions can generate VGA resolution depth maps at up to 10 FPS on the Raspberry Pi 4 while achieving high fidelity results, and are compatible with any Android or Linux-based mobile devices. A detailed description of all models developed in the challenge is provided in this paper.
Date of Conference: 19-25 June 2021
Date Added to IEEE Xplore: 01 September 2021
ISBN Information:

ISSN Information:

Conference Location: Nashville, TN, USA
Computer Vision Lab, ETH, Zurich, Switzerland
AI Witchlabs, Switzerland
Raspberry Pi (Trading) Ltd
Computer Vision Lab, ETH, Zurich, Switzerland
Computer Vision Lab, ETH, Zurich, Switzerland
Computer Vision Lab, ETH, Zurich, Switzerland
AI Witchlabs, Switzerland
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Nanjing Artificial Intelligence Chip Research, Institute of Automation, China
Nanjing Artificial Intelligence Chip Research, Institute of Automation, China
Nanjing Artificial Intelligence Chip Research, Institute of Automation, China
Black Sesame Technologies Inc, Singapore
Black Sesame Technologies Inc, Singapore
Black Sesame Technologies Inc, Singapore
Visual Media Lab, KAIST, South Korea
Visual Media Lab, KAIST, South Korea
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Multimedia and Computer Vision Laboratory, National Cheng Kung University, Taiwan
Samsung Research, UK, United Kingdom
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
ETH, Zurich, Switzerland

1. Introduction

A wide spread of various depth-guided problems related to augmented reality, gesture recognition, object segmentation, autonomous driving and bokeh effect rendering tasks has created a strong demand for fast and efficient single-image depth estimation approaches that can run on portable low-power hardware. While many accurate deep learning-based solutions have been proposed for this problem in the past [46], [16], [14], [47], [48], [42], [15], [10], they were optimized for high fidelity results only while not taking into account computational efficiency and mobile-related constraints, which is essential for tasks related to image processing [23], [24], [37] on mobile devices. This results in solutions requiring powerful high-end GPUs and consuming gigabytes of RAM when processing even low-resolution input data, thus being incompatible with resource-constrained mobile hardware. In this challenge, we change the current depth estimation benchmarking paradigm by using a new depth estimation dataset collected in the wild and by imposing additional efficiency-related constraints on the designed solutions.

Computer Vision Lab, ETH, Zurich, Switzerland
AI Witchlabs, Switzerland
Raspberry Pi (Trading) Ltd
Computer Vision Lab, ETH, Zurich, Switzerland
Computer Vision Lab, ETH, Zurich, Switzerland
Computer Vision Lab, ETH, Zurich, Switzerland
AI Witchlabs, Switzerland
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Tencent GY-Lab, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, China
Nanjing Artificial Intelligence Chip Research, Institute of Automation, China
Nanjing Artificial Intelligence Chip Research, Institute of Automation, China
Nanjing Artificial Intelligence Chip Research, Institute of Automation, China
Black Sesame Technologies Inc, Singapore
Black Sesame Technologies Inc, Singapore
Black Sesame Technologies Inc, Singapore
Visual Media Lab, KAIST, South Korea
Visual Media Lab, KAIST, South Korea
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Harbin Institute of Technology, China Peng Cheng Laboratory, China
Multimedia and Computer Vision Laboratory, National Cheng Kung University, Taiwan
Samsung Research, UK, United Kingdom
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
OPPO Research Institute, China
ETH, Zurich, Switzerland
Contact IEEE to Subscribe

References

References is not available for this document.