Abstract:
Scene text detection attracts much attention in computer vision, because it can be widely used in many applications such as real-time text translation, automatic informat...Show MoreMetadata
Abstract:
Scene text detection attracts much attention in computer vision, because it can be widely used in many applications such as real-time text translation, automatic information entry, blind person assistance, robot sensing and so on. Though many methods have been proposed for horizontal and oriented texts, detecting irregular shape texts such as curved texts is still a challenging problem. To solve the problem, we propose a robust scene text detection method with adaptive text region representation. Given an input image, a text region proposal network is first used for extracting text proposals. Then, these proposals are verified and refined with a refinement network. Here, recurrent neural network based adaptive text region representation is proposed for text region refinement, where a pair of boundary points are predicted each time step until no new points are found. In this way, text regions of arbitrary shapes are detected and represented with adaptive number of boundary points. This gives more accurate description of text regions. Experimental results on five benchmarks, namely, CTW1500, TotalText, ICDAR2013, ICDAR2015 and MSRA-TD500, show that the proposed method achieves state-of-the-art in scene text detection.
Date of Conference: 15-20 June 2019
Date Added to IEEE Xplore: 09 January 2020
ISBN Information:
ISSN Information:
References is not available for this document.
Select All
1.
Lluis Castrejon, Kaustav Kundu, Raquel Urtasun and Sanja Fidler, "Annotating object instances with a polygon-rnn", IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 5230-5238, 2017.
2.
Chee Kheng Chng and Chee Seng Chan, "Total-text: A comprehensive dataset for scene text detection and recognition", International Conference on Document Analysis and Recognition(ICDAR), pp. 935-942, 2017.
3.
Dan Deng, Haifeng Liu, Xuelong Li and Deng Cai, "Pixellink: Detecting scene text via instance segmentation", Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018.
4.
Kaiming He, Georgia Gkioxari, Piotr Dollr and Ross Girshick, "Mask r-cnn", IEEE International Conference on Computer Vision (ICCV), pp. 2980-2988, 2017.
5.
Wenhao He, Xu-Yao Zhang, Fei Yin and Cheng-Lin Liu, "Deep direct regression for multi-oriented scene text detection", IEEE International Conference on Computer Vision(ICCV), pp. 745-753, 2017.
6.
Sepp Hochreiter and Jrgen Schmidhuber, "Long short-term memory", Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
7.
Jie Hu, Li Shen and Gang Sun, "Squeeze-and-excitation networks", IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), pp. 7132-7141, 2018.
8.
Yingying Jiang, Xiangyu Zhu, Xiaobing Wang, Shuli Yang, Wei Li, Hua Wang, et al., "R2cnn: Rotational region cnn for arbitrarily-oriented scene text detection", International Conference on Pattern Recognition(ICPR), pp. 3610-3615, 2018.
9.
Dimosthenis Karatzas, Lluis Gomez-Bigorda, Anguelos Nicolaou, Suman Ghosh, Andrew Bagdanov, Masakazu Iwamura, et al., "Icdar 2015 competition on robust reading", International Conference on Document Analysis and Recognition(ICDAR), pp. 1156-1160, 2015.
10.
Dimosthenis Karatzas, Faisal Shafait, Seiichi Uchida, Masakazu Iwamura, Lluis Gomez i Bigorda, Sergi Robles Mestre, et al., "Icdar 2013 robust reading competition", International Conference on Document Analysis and Recognition(ICDAR), pp. 1484-1493, 2013.
11.
Minghui Liao, Baoguang Shi and Xiang Bai, "Textboxes++: A single-shot oriented scene text detector", IEEE Transactions on Image Processing, vol. 27, no. 8, pp. 3676-3690, 2018.
12.
Minghui Liao, Baoguang Shi, Xiang Bai, Xinggang Wang and Wenyu Liu, "Textboxes: A fast text detector with a single deep neural network", Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 4161-4167, 2017.
13.
Minghui Liao, Zhen Zhu, Baoguang Shi, Gui-Song Xia and Xiang Bai, "Rotation-sensitive regression for oriented scene text detection", IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), pp. 5909-5918, 2018.
14.
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, et al., "Ssd: Single shot multibox detector", European Conference on Computer Vision(ECCV), pp. 21-37, 2016.
15.
Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao and Junjie Yan. Fots, "Fast oriented text spotting with a unified network", IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), pp. 5676-5685.
16.
Yuliang Liu and Lianwen Jin, "Deep matching prior network: Toward tighter multi-oriented text detection", IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 3454-3461.
17.
Yuliang Liu, Lianwen Jin, Shuaitao Zhang, Canjie Luo and Sheng Zhang, "Curved scene text detection via transverse and longitudinal sequence connection", Pattern Recognition, vol. 90, no. 6, pp. 337-345, 2018.
18.
Zichuan Liu, Guosheng Lin, Sheng Yang, Jiashi Feng, Weisi Lin and Wang Ling Goh, "Learning markov clustering networks for scene text detection", IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), 2018.
19.
Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He, Wenhao Wu and Cong Yao. Textsnake, "A flexible representation for detecting text of arbitrary shapes", European Conference on Computer Vision(ECCV), pp. 19-35, 2018.
20.
Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu and Xiang Bai. Mask textspotter, "An end-to-end trainable neural network for spotting text with arbitrary shapes", European Conference on Computer Vision(ECCV), pp. 71-88, 2018.
21.
Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan and Xiang Bai, "Multi-oriented scene text detection via corner localization and region", IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), pp. 7553-7563, 2018.
22.
Jianqi Ma, Weiyuan Shao, Hao Ye, Li Wang, Hong Wang, Yingbin Zheng, et al., "Arbitrary-oriented scene text detection via rotation proposals", IEEE Transactions on Multimedia, vol. 20, no. 11, pp. 3111-3122, 2018.
23.
Shaoqing Ren, Kaiming He, Ross Girshick and Jian Sun, "Faster r-cnn: towards real-time object detection with region proposal networks", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137-1149, 2017.
24.
Baoguang Shi, Xiang Bai and Serge Belongie, "Detecting oriented text in natural images by linking segments", IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 3482-3490, 2017.
25.
Karen Simonyan and Andrew Zisserman, "Very deep convolutional networks for large-scale image recognition", International Conference on Learning Representations, 2015.
26.
Lei Sun, Qiang Huo and Wei Jia, "A robust approach for text detection from natural scene images", Pattern Recognition, vol. 48, no. 9, pp. 2906-2920, 2015.
27.
Shangxuan Tian, Yifeng Pan, Chang Huang, Shijian Lu, Kai Yu and Chew Lim Tan, "Text flow: A unified text detection system in natural scene images", IEEE International Conference on Computer Vision (ICCV), pp. 4651-4659, 2015.
28.
Qiangpeng Yang, Mengli Cheng, Wenmeng Zhou, Yan Chen, Minghui Qiu and Wei Lin, "Inceptext: A new inception-text module with deformable psroi pooling for multi-oriented scene text detection", pp. 1071-1077, 2018.
29.
Cong Yao, Xiang Bai, Wenyu Liu, Yi Ma and Zhuowen Tu, "Detecting texts of arbitrary orientations in natural images", IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 1083-1090, 2012.
30.
Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang and Hong-Wei Hao, "Robust text detection in natural scene images", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 5, pp. 970-983, 2014.