Loading [MathJax]/extensions/MathMenu.js
Large-Scale Person Detection and Localization using Overhead Fisheye Cameras | IEEE Conference Publication | IEEE Xplore

Large-Scale Person Detection and Localization using Overhead Fisheye Cameras


Abstract:

Location determination finds wide applications in daily life. Instead of existing efforts devoted to localizing tourist photos captured by perspective cameras, in this ar...Show More

Abstract:

Location determination finds wide applications in daily life. Instead of existing efforts devoted to localizing tourist photos captured by perspective cameras, in this article, we focus on devising person positioning solutions using overhead fisheye cameras. Such solutions are advantageous in large field of view (FOV), low cost, anti-occlusion, and unaggressive work mode (without the necessity of cameras carried by persons). However, related studies are quite scarce, due to the paucity of data. To stimulate research in this exciting area, we present LOAF, the first large-scale overhead fisheye dataset for person detection and localization. LOAF is built with many essential features, e.g., i) the data cover abundant diversities in scenes, human pose, density, and location; ii) it contains currently the largest number of annotated pedestrian, i.e., 457K bounding boxes with groundtruth location information; iii) the body-boxes are labeled as radius-aligned so as to fully address the positioning challenge. To approach localization, we build a fisheye person detection network, which exploits the fisheye distortions by a rotation-equivariant training strategy and predict radius-aligned human boxes end-to-end. Then, the actual locations of the detected persons are calculated by a numerical solution on the fisheye model and camera altitude data. Extensive experiments on LOAF validate the superiority of our fisheye detector w.r.t. previous methods, and show that our whole fisheye positioning solution is able to locate all persons in FOV with an accuracy of0.5 m, within 0.1 s.
Date of Conference: 01-06 October 2023
Date Added to IEEE Xplore: 15 January 2024
ISBN Information:

ISSN Information:

Conference Location: Paris, France

Funding Agency:

References is not available for this document.

1. Introduction

Accurate position finding of persons attracts growing interest from both research and industrial communities, since it plays a crucial role in numerous location-sensitive application scenarios (e.g., surveillance, smart home, public health). Nevertheless, due to the line-of-sight (LOS) issue, GPS is unreliable in interior spaces and urban canyon. To overcome such limitation, various alternative solutions are investigated. Signal based solutions, including Bluetooth [13] and Wi-Fi [74], are popular, but they are easily interfered by changing environments and nearby human bodies [73]. A complementary stream of work is vision based; they typically make use of traditional cameras, RGBD cameras, or in-built smartphone cameras, and enjoy the advantage of reliable services. To get location information, visual positioning solutions usually refer to a pre-acquired 3D map or a geo-tagged database as the scene representation [63], or directly utilize the captured image to estimate the camera pose [39].

Select All
1.
Abdulrahman Alarifi, AbdulMalik Al-Salman, Mansour Alsaleh, Ahmad Alnafessah, Suheer Al-Hadhrami, Mai A Al-Ammar, et al., "Ultra wideband indoor positioning technologies: Analysis and recent advances", Sensors, vol. 16, no. 5, pp. 707, 2016.
2.
Relja Arandjelovic, Petr Gronat, Akihiko Torii, Tomas Pajdla and Josef Sivic, "Netvlad: Cnn architecture for weakly supervised place recognition", CVPR, 2016.
3.
Vassileios Balntas, Shuda Li and Victor Prisacariu, "Relocnet: Continuous metric learning relocalisation using neural nets", ECCV, 2018.
4.
Federico Boniardi, Abhinav Valada, Rohit Mohan, Tim Caselitz and Wolfram Burgard, "Robot localization in floor plans using a room layout edge extraction network", IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019.
5.
Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, et al., "Dsac-differentiable ransac for camera localization", CVPR, 2017.
6.
Eric Brachmann and Carsten Rother, "Expert sample consensus applied to camera re-localization", ICCV, 2019.
7.
Guillaume Bresson, Li Yu, Cyril Joly and Fabien Moutarde, "Urban localization with street views using a convolutional neural network for end-to-end camera pose regression", IEEE Intelligent Vehicles Symposium, 2019.
8.
Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan and Ling Shao, "From handcrafted to deep features for pedestrian detection: a survey", IEEE TPAMI, 2021.
9.
Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov and Sergey Zagoruyko, "End-to-end object detection with transformers", ECCV, 2020.
10.
Bao Xin Chen and John K Tsotsos, "Fast visual object tracking with rotated bounding boxes", 2019.
11.
David M Chen, Georges Baatz, Kevin Köser, Sam S Tsai, Ramakrishna Vedantham, Timo Pylvänäinen, Kimmo Roimela, Xin Chen, Jeff Bach, Marc Pollefeys et al., "City-scale landmark identification on mobile devices", CVPR, 2011.
12.
Long Chen, Shaobo Lin, Xiankai Lu, Dongpu Cao, Hangbin Wu, Chi Guo, et al., "Deep neural network based vehicle and pedestrian detection for autonomous driving: A survey", IEEE TITS, 2021.
13.
Liang Chen, Ling Pei, Heidi Kuusniemi, Yuwei Chen, Tuomo Kröger and Ruizhi Chen, "Bayesian fusion for indoor positioning using bluetooth fingerprints", Wireless Personal Communications, 2013.
14.
An-Ti Chiang and Yao Wang, "Human detection in fish-eye images using hog-based detectors over rotated windows", ICME Workshop, 2014.
15.
Sheng-Ho Chiang, Tsaipei Wang and Yi-Fu Chen, "Efficient pedestrian detection in top-view fisheye images using compositions of perspective view patches", Image and Vision Computing, 2021.
16.
Shin-Fang Ch'ng, Naoya Sogi, Pulak Purkait, Tat-Jun Chin and Kazuhiro Fukui, "Resolving marker pose ambiguity by robust rotation averaging with clique constraints", ICRA, 2020.
17.
Ibrahim Cinaroglu and Yalin Bastanlar, "A direct approach for human detection with catadioptric omnidirectional cameras", Signal Processing and Communications Applications Conference, 2014.
18.
Taco Cohen and Max Welling, "Group equivariant convolutional networks", ICLR, 2016.
19.
"Attribution-noncommercial-sharealike 4.0 international", [online] Available: https://creativecommons.org/licenses/by-nc-sa/4.0/.
20.
Joseph DeGol, Timothy Bretl and Derek Hoiem, "Chromatag: A colored marker and fast detection algorithm", ICCV, 2017.
21.
Carlos R del Blanco, Pablo Carballeira, Fernando Jaureguizar and Narciso García, "Robust people indoor localization with omnidirectional cameras using a grid of spatial-aware classifiers", Signal Processing: Image Communication, 2021.
22.
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei, "Imagenet: A large-scale hierarchical image database", CVPR, 2009.
23.
Mingyu Ding, Zhe Wang, Jiankai Sun, Jianping Shi and Ping Luo, "Camnet: Coarse-to-fine retrieval for camera relocalization", ICCV, 2019.
24.
Zhihao Duan, Ozan Tezcan, Hayato Nakamura, Prakash Ishwar and Janusz Konrad, "Rapid: rotation-aware people detection in overhead fisheye images", CVPR Workshop, 2020.
25.
Carlos Esteves, Christine Allen-Blanchette, Ameesh Makadia and Kostas Daniilidis, "Learning SO(3) equivariant representations with spherical cnns", ECCV, 2018.
26.
Zahid Farid, Rosdiadee Nordin and Mahamod Ismail, "Recent advances in wireless indoor localization techniques and system", Journal of Computer Networks and Communications, 2013.
27.
Mark Fiala, "Artag a fiducial marker system using digital techniques", CVPR, 2005.
28.
Martin A Fischler and Robert C Bolles, "Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography", Communications of the ACM, vol. 24, no. 6, pp. 381-395, 1981.
29.
Hugo Germain, Vincent Lepetit and Guillaume Bourmaud, "Neural reprojection error: Merging feature learning and camera pose estimation", CVPR, 2021.
30.
Yanying Gu, Anthony Lo and Ignas Niemegeers, "A survey of indoor positioning systems for wireless personal networks", IEEE Communications surveys & tutorials, 2009.
Contact IEEE to Subscribe

References

References is not available for this document.