Loading [MathJax]/extensions/MathMenu.js
High Throughput Hardware/Software Heterogeneous System for RRPN-Based Scene Text Detection | IEEE Journals & Magazine | IEEE Xplore

High Throughput Hardware/Software Heterogeneous System for RRPN-Based Scene Text Detection


Abstract:

Rotation Region Proposal Networks (RRPN) are used to generate rotated proposals with the information of text angle for arbitrary oriented scene text detection (STD). Howe...Show More

Abstract:

Rotation Region Proposal Networks (RRPN) are used to generate rotated proposals with the information of text angle for arbitrary oriented scene text detection (STD). However, the computational complexity of RRPN inference is relatively high compared with other methods, which makes it difficult for massive deployment. In this paper, the first full-stack FPGA-CPU heterogeneous system design of RRPN-based STD algorithm is proposed. A hardware/software partition method is presented to analyze and split the tasks to enhance the computation efficiency of hardware. The fast 2D Winograd algorithm and block floating point are utilized to reduce computation complexity while maintaining a relatively high precision. The implementation results show that the peak performance of MAC arrays in the proposed architecture reaches 655.4 GOPS and the energy efficiency achieves 64.9 GOPS/W. By fully exploiting the parallel and pipelined merits in the algorithms, the first hardware architectures for skew non-maximum suppression (S-NMS) layer and rotation region-of-interest (RRoI) polling layer are proposed. The throughput of the proposed hardware/software heterogeneous system achieves 40 times and 1.4 times improvements compared with CPU and GPU, respectively. Moreover, the comprehensive operating expense ratio of pure CPU, GPU, and the proposed system is 80.7:2.5:1, which indicates that it is suitable for massive deployment.
Published in: IEEE Transactions on Computers ( Volume: 71, Issue: 7, 01 July 2022)
Page(s): 1507 - 1521
Date of Publication: 24 June 2021

ISSN Information:

Funding Agency:

Citations are not available for this document.

1 Introduction

Scene text detection (STD) and recognition from natural scene images are important research topics in computer vision [1]. Current text detection and recognition techniques have been deeply applied in many industries such as finance, insurance, medical care, transportation, education, etc. The scenarios involving pictures or videos include e-commerce text translation, user-made content review, content/advertising recommendation distribution, and so on. While these business scenarios need to process tens of billions of data every day, the number of requests for these algorithms is still increasing significantly.

Cites in Papers - |

Cites in Papers - IEEE (3)

Select All
1.
Yishuo Meng, Junfeng Wu, Siwei Xiang, Jianfei Wang, Jia Hou, Zhijie Lin, Chen Yang, "A High-Throughput and Flexible CNN Accelerator Based on Mixed-Radix FFT Method", IEEE Transactions on Circuits and Systems I: Regular Papers, vol.72, no.2, pp.816-829, 2025.
2.
Yalan Wu, Jigang Wu, Mianyang Yao, Bosheng Liu, Long Chen, Siew Kei Lam, "Two-Level Scheduling Algorithms for Deep Neural Network Inference in Vehicular Networks", IEEE Transactions on Intelligent Transportation Systems, vol.24, no.9, pp.9324-9343, 2023.
3.
Yao Xin, Guoming Tang, Donglong Chen, Rumin Zhang, Teng Liang, Ray C. C. Cheung, Çetin Kaya Koç, "A Versatility-Performance Balanced Hardware Architecture for Scene Text Detection", 2022 IEEE Smartworld, Ubiquitous Intelligence & Computing, Scalable Computing & Communications, Digital Twin, Privacy Computing, Metaverse, Autonomous & Trusted Vehicles (SmartWorld/UIC/ScalCom/DigitalTwin/PriComp/Meta), pp.540-549, 2022.
Contact IEEE to Subscribe

References

References is not available for this document.