THERMAL SCIENCE
International Scientific Journal
COMBINED WITH THE RESIDUAL AND MULTI-SCALE METHOD FOR CHINESE THERMAL POWER SYSTEM RECORD TEXT RECOGNITION
ABSTRACT
Aiming at the problem that the recognition accuracy based on convolutional neural network of thermal power system record text is not high, a method of thermal power system record text recognition based on residual and multi-scale feature combination was proposed and implemented. Combined with the residual, a new network is designed to replace the traditional convolutional neural network and improve the feature extraction ability of the network. The 1 × 1 convolution core was used to increase the network depth and reduce the parameters instead of the 3 × 3 convolution core. The network order of each layer in residual block was adjusted so that the network representation ability can be improved. Combining feature information of different scales and retaining more vertical feature information, the classification accuracy of the network is improved. Experiments on the self-built image data set of thermal power system records show that the proposed network model improves the accuracy by 11% compared with convolutional recurrent neural network, and has better robustness to image distortion and blurring.
KEYWORDS
PAPER SUBMITTED: 2018-11-28
PAPER REVISED: 2019-01-25
PAPER ACCEPTED: 2019-01-05
PUBLISHED ONLINE: 2019-05-18
THERMAL SCIENCE YEAR
2019, VOLUME
23, ISSUE
Issue 5, PAGES [2631 - 2640]
- Mori S, et al., Historical review of OCR research and development. Proceedings of the IEEE, 80(1992), 7, pp. 1029-1058.
- Zhang Y, et al., Scene text recognition with deeper convolutional neural networks
- Yao C, et al., Strokelets: A Learned Multi-scale Representation for Scene Text Recognition
- Heutte L, et al., A structural/statistical feature based vector for handwritten character recognition
- Das N, et al., A statistical-topological feature combination for recognition of handwritten numerals
- YunTao W, et al., End-to-End Text Recognition with Convolutional Neural Networks
- Jaderberg M, et al., Reading Text in the Wild with Convolutional Neural Networks
- Ahranjany S S, et al., A very high accuracy handwritten character recognition system for Farsi/Arabic digits using Convolutional Neural Networks
- Wang Z, et al., Trilateral constrained sparse representation for Kinect depth hole filling. Pattern Recognit Lett 65 (2015), pp.95-102
- Liu H, et al., Distributed source localization under anchor position uncertainty. Chin J Electron 23 (2014), 1, pp. 93-96
- Zhong L, et al., OHRank: an algorithm integrating mentality and influence of opinion holder for opinion mining. Chin J Electron 22 (2013), 4, pp. 655-660
- Peng, Li , et al. A robust method for estimating image geometry with local structure constraint. IEEE Access 99(2018), PP. 1-10.
- Lu, Tao , et al. Robust Face Super-Resolution via Locality-constrained Low-rank Representation. IEEE Access (2017), pp. 1-10.
- Wang Z, et al., Trilateral constrained sparse representation for Kinect depth hole filling
- YunTao W, et al., Utilizing Principal Singular Vectors for 2D DOA Estimation in Single Snapshot Case with Uniform Rectangular Array
- YunTao W, et al., HOSVD-Based Subspace Algorithm for Multidimensional Frequency Estimation Without Pairing Parameters, CHINESE JOURNAL OF ELECTRONICS, 23 (2014), 4, pp. 729-734
- Jiao L, et al., Offline handwritten English character recognition based on convolutional neural network
- Yu N, et al., Handwritten digits recognition base on improved LeNet5
- Yann LeCun, Learning Invariant Feature Hierarchies, in Fusiello, Andrea and Murino, Vittorio and Cucchiara, Rita (Eds), European Conference on Computer Vision (ECCV 2012), 7583 (2012), pp.496-505
- T. Wang, et al., End-to-end text recognition with convolutional neural networks, Proceedings of the 21st International Conference on Pattern Recognition(ICPR 2012. Tsukuba Science City, JAPAN), (2012)
- Shi B, et al., An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
- He K, et al., Deep Residual Learning for Image Recognition
- Ioffe S, et al., Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
- Hochreiter S, et al., Long short-term memory.Neural Computation, 9 (1997), 8, pp. 1735-1780.
- Graves A, Gomez F. Connectionist temporal classification:labelling unsegmented sequence data with recurrent neural networks
- Kingma D,Ba J.Adam:A method for stochastic optimization