Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3573942.3574075acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

Handwritten Text Recognition of Imagery Acquisition Sheet in Tobacco Industry Using an Improved CRNN Network

Published: 16 May 2023 Publication History

Abstract

In the tobacco industry, the acquisition sheet is a kind of document used by the tobacco monopoly administration to record the information about the acquisition of fake and illicit cigarettes when the tobacco crime is seized by the "Internet Plus Express". To automatically identify the information helps to realize the automation of information management of tobacco monopoly administration department, and then build delivering for smoke crime management and prediction system, thus to deliver smoke involved crime effectively, and realize the wisdom of the tobacco industry monopoly, so to handwriting recognition research of the acquisition sheet image is very necessary. In this paper, an improved CRNN network text recognition method is proposed. Firstly, the acquisition sheet image with interference such as shadow and seal is preprocessed based on color channel separation method. Then, in the presence of printed text, the YoloX model is used to detect the handwritten text in the image, and the handwritten text line dataset is obtained. Then, the CNN network is improved by adding Swish activation function and attention mechanism to obtain the depth image features of handwritten text line data. BiLSTM network predicts the input feature sequence and finally realizes the recognition of handwritten text. Through the algorithm in this paper, the detection of handwritten text in the acquisition sheet image reaches an average accuracy of 0.983. The experimental results show that, compared with the CRNN model, the improved CRNN model in this paper has a better effect on the recognition of handwritten text in the acquisition sheet image.

References

[1]
Q X Ye, D Doermann. Text Detection and Recognition in Imagery:A Survey[J]. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(7): 1480-1500.
[2]
X C Yin, Z Y Zuo, S Tian. Text Detection, Tracking and Recognition in Video:A Comprehensive Survey[J]. In: IEEE Transactions on Image Processing, 2016, 25(6): 2752-2773.
[3]
R W Dai, C L Liu, B H Xiao. Chinese character recognition:history, status and prospects[J]. In: Front. Comput. Sci. China, 2007, 1(2): 126-136.
[4]
Zhi T, Huang W, Tong H, Detecting Text in Natural Image with Connectionist Text Proposal Network[C]// European Conference on Computer Vision. Springer, Cham, 2016.
[5]
Shi B, Bai X, Belongie S . Detecting Oriented Text in Natural Images by Linking Segments[C]// IEEE Computer Society. IEEE Computer Society, 2017.
[6]
Zhou X, Yao C, Wen H, EAST: An Efficient and Accurate Scene Text Detector[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2017.
[7]
Zhang C, Liang B, Huang Z, Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2019.
[8]
He K, Zhang X, Ren S, Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2014, 37(9):1904-16.
[9]
Ren S, He K, Girshick R, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6):1137-1149.
[10]
Liu W, Anguelov D, Erhan D, SSD: Single Shot MultiBox Detector[J]. 2015.
[11]
Redmon J, Farhadi A . YOLOv3: An Incremental Improvement[J]. arXiv e-prints, 2018.
[12]
Z Q Zhao, P Zheng, S T Xu, X D Wu. Object Detection With Deep Learning: A Review[J]. In: IEEE Transactions on Neural Networks and Learning Systems, 2019, 30(11): 3212-3232.
[13]
Ge Z, Liu S, Wang F, YOLOX: Exceeding YOLO Series in 2021[J]. 2021.
[14]
Bluche T, Louradour, Jérme, Messina R . Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention[J]. IEEE, 2018.
[15]
Sueiras, Jorge, Ruiz, Offline continuous handwriting recognition using sequence to sequence neural networks[J]. Neurocomputing, 2018.
[16]
Lei K, Toledo J I, Riba P, Convolve, Attend and Spell: An Attention-based Sequence-to-Sequence Model for Handwritten Word Recognition[C]// German Conference on Pattern Recognition. Springer, Cham, 2018.
[17]
Graves A, S Fernández, Gomez F . Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks[C]// International Conference on Machine Learning. ACM, 2006.

Index Terms

  1. Handwritten Text Recognition of Imagery Acquisition Sheet in Tobacco Industry Using an Improved CRNN Network

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition
    September 2022
    1221 pages
    ISBN:9781450396899
    DOI:10.1145/3573942
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 May 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. BiLSTM
    2. CRNN network
    3. Handwritten text recognition
    4. YoloX model

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    AIPR 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 23
      Total Downloads
    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Oct 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media