###
计算机系统应用英文版:2021,30(8):179-185
本文二维码信息
码上扫一扫!
基于深度学习的场景文本检测与识别
(1.中国石油大学(华东) 计算机科学与技术学院, 青岛 266580;2.山东电子职业技术学院 教务处, 济南 250200)
Scene Text Detection and Recognition Based on Deep Learning
(1.College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China;2.Academic Affairs Office, Shandong College of Electronic Technology, Jinan 250200, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 771次   下载 1470
Received:November 19, 2020    Revised:December 21, 2020
中文摘要: 针对复杂场景下文本识别流程复杂繁琐、适应性差、准确度低等缺点, 本文提出一种复杂场景下文本检测和识别的新方法. 该方法由文本区域检测网络及文本识别网络构成, 文本区域检测网络为改进的PSENet, 将PSENet的骨干网络改为ResNeXt-101, 在特征提取过程中加入可微二值化操作来优化分割网络, 不仅简化了后处理, 而且提高了文本检测的性能. 将卷积神经网络和加入聚合交叉熵损失的长短时记忆网络组成文本识别网络, 聚合交叉熵的引入提高了文本识别的准确性. 本文在两个数据集上进行验证, 实验结果表明, 两个网络模型融合后准确率最高达到95.6%, 优于改进之前的方法. 该方法能有效地检测和识别任意文本实例, 具有很好的实用性.
Abstract:This study proposes a new method for text detection and recognition in complex scenes to eliminate the shortcomings of a complicated text recognition process, poor adaptability, and low accuracy. This method is composed of a text area detection network and a text recognition network. The text area detection network is an improved PSENet. The backbone network of PSENet is changed to ResNeXt-101, and a differentiable binarization operation is added to optimize the segmentation network in the feature extraction process, which not only simplifies post-processing but also improves text detection. The text recognition network is formed by combining a convolutional neural network with a long short-term memory network with aggregate cross-entropy loss. The introduction of aggregate cross-entropy improves the accuracy of text recognition. Furthermore, experimental verification is carried out on two data sets, and the results show that the new method has accuracy as high as 95.6%, which is better than the previous methods. This method can effectively detect and recognize any text instances and has good practicability.
文章编号:     中图分类号:    文献标志码:
基金项目:科技部创新方法工作专项(2015IM010300)
引用文本:
宫法明,刘芳华,李厥瑾,宫文娟.基于深度学习的场景文本检测与识别.计算机系统应用,2021,30(8):179-185
GONG Fa-Ming,LIU Fang-Hua,LI Jue-Jin,GONG Wen-Juan.Scene Text Detection and Recognition Based on Deep Learning.COMPUTER SYSTEMS APPLICATIONS,2021,30(8):179-185