Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-030-86549-8_30guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Document Dewarping with Control Points

Published: 05 September 2021 Publication History

Abstract

Document images are now widely captured by handheld devices such as mobile phones. The OCR performance on these images are largely affected due to geometric distortion of the document paper, diverse camera positions and complex backgrounds. In this paper, we propose a simple yet effective approach to rectify distorted document image by estimating control points and reference points. After that, we use interpolation method between control points and reference points to convert sparse mappings to backward mapping, and remap the original distorted document image to the rectified image. Furthermore, control points are controllable to facilitate interaction or subsequent adjustment. We can flexibly select post-processing methods and the number of vertices according to different application scenarios. Experiments show that our approach can rectify document images with various distortion types, and yield state-of-the-art performance on real-world dataset. This paper also provides a training dataset based on control points for document dewarping. Both the code and the dataset are released at https://github.com/gwxie/Document-Dewarping-with-Control-Points.

References

[1]
Brown MS and Tsoi YC Geometric and shading correction for images of printed materials using boundary IEEE Trans. Image Process. 2006 15 6 1544-1554
[2]
Courteille F, Crouzil A, Durou JD, and Gurdjos P Shape from shading for the digitization of curved documents Mach. Vision Appl. 2007 18 5 301-316
[3]
Das, S., Ma, K., Shu, Z., Samaras, D., Shilkrot, R.: DewarpNet: single-image document unwarping with stacked 3D and 2D regression networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 131–140 (2019)
[4]
He, Y., Pan, P., Xie, S., Sun, J., Naoi, S.: A book dewarping system by boundary-based 3D surface reconstruction. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 403–407. IEEE (2013)
[5]
Li X, Zhang B, Liao J, and Sander PV Document rectification and illumination correction using a patch-based CNN ACM Trans. Graphics 2019 38 6 1-11
[6]
Liu C, Zhang Y, Wang B, and Ding X Restoring camera-captured distorted document images Int. J. Doc. Anal. Recogn. 2015 18 2 111-124
[7]
Liu W et al. Leibe B, Matas J, Sebe N, Welling M, et al. SSD: single shot multibox detector Computer Vision – ECCV 2016 2016 Cham Springer 21-37
[8]
Liu, X., Meng, G., Fan, B., Xiang, S., Pan, C.: Geometric rectification of document images using adversarial gated unwarping network. Pattern Recogn. 108, 107576 (2020)
[9]
Ma, K., Shu, Z., Bai, X., Wang, J., Samaras, D.: DocUNet: document image unwarping via a stacked U-Net. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4709 (2018)
[10]
Markovitz A, Lavi I, Perel O, Mazor S, and Litman R Vedaldi A, Bischof H, Brox T, and Frahm J-M Can you read me now? Content aware rectification using angle supervision Computer Vision – ECCV 2020 2020 Cham Springer 208-223
[11]
Meijering E A chronology of interpolation: from ancient astronomy to modern signal and image processing Proc. IEEE 2002 90 3 319-342
[12]
Ramanna, V., Bukhari, S.S., Dengel, A.: Document image dewarping using deep learning. In: International Conference on Pattern Recognition Applications and Methods (2019)
[13]
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Neural Information Processing Systems (2015)
[14]
Sorkine, O.: Laplacian mesh processing. In: Eurographics (STARs), p. 29 (2005)
[15]
Stamatopoulos N, Gatos B, Pratikakis I, and Perantonis SJ Goal-oriented rectification of camera-based document images IEEE Trans. Image Process. 2010 20 4 910-920
[16]
Tian, Y., Narasimhan, S.G.: Rectification and 3D reconstruction of curved document images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 377–384. IEEE (2011)
[17]
Tsoi, Y.C., Brown, M.S.: Multi-view document rectification using boundary. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2007)
[18]
Wada T, Ukida H, and Matsuyama T Shape from shading with interreflections under a proximal light source: Distortion-free copying of an unfolded book Int. J. Comput. Vision 1997 24 2 125-135
[19]
Wang N, Zhang Y, Li Z, Fu Y, Liu W, and Jiang Y-G Ferrari V, Hebert M, Sminchisescu C, and Weiss Y Pixel2Mesh: generating 3D mesh models from single RGB images Computer Vision – ECCV 2018 2018 Cham Springer 55-71
[20]
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003, vol. 2, pp. 1398–1402. IEEE (2003)
[21]
Xie G-W, Yin F, Zhang X-Y, and Liu C-L Bai X, Karatzas D, and Lopresti D Dewarping document image by displacement flow estimation with fully convolutional network Document Analysis Systems 2020 Cham Springer 131-144
[22]
You S, Matsushita Y, Sinha S, Bou Y, and Ikeuchi K Multiview rectification of folded documents IEEE Trans. Pattern Anal. Mach. Intell. 2017 40 2 505-511
[23]
Zhang L, Yip AM, Brown MS, and Tan CL A unified framework for document restoration using inpainting and shape-from-shading Pattern Recogn. 2009 42 11 2961-2978

Cited By

View all
  • (2024)Towards unified multi-granularity text detection with interactive attentionProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3694116(50012-50025)Online publication date: 21-Jul-2024
  • (2024)Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document ScanningProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685662(1-9)Online publication date: 20-Aug-2024
  • (2024)Table image dewarping with key element segmentationInternational Journal on Document Analysis and Recognition10.1007/s10032-024-00480-z27:3(349-362)Online publication date: 1-Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
Document Analysis and Recognition – ICDAR 2021: 16th International Conference, Lausanne, Switzerland, September 5–10, 2021, Proceedings, Part I
Sep 2021
652 pages
ISBN:978-3-030-86548-1
DOI:10.1007/978-3-030-86549-8

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 05 September 2021

Author Tags

  1. Dewarping document image
  2. Control points
  3. Deep learning

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Towards unified multi-granularity text detection with interactive attentionProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3694116(50012-50025)Online publication date: 21-Jul-2024
  • (2024)Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document ScanningProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685662(1-9)Online publication date: 20-Aug-2024
  • (2024)Table image dewarping with key element segmentationInternational Journal on Document Analysis and Recognition10.1007/s10032-024-00480-z27:3(349-362)Online publication date: 1-Sep-2024
  • (2024)Coarse-to-Fine Document Image Registration for DewarpingDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70546-5_20(343-358)Online publication date: 30-Aug-2024
  • (2023)Layout-aware Single-image Document FlatteningACM Transactions on Graphics10.1145/362781843:1(1-17)Online publication date: 2-Nov-2023
  • (2023)UVDoc: Neural Grid-based Document UnwarpingSIGGRAPH Asia 2023 Conference Papers10.1145/3610548.3618174(1-11)Online publication date: 10-Dec-2023
  • (2023)Dewarping Document Image in Complex Scene by Geometric Control PointsPattern Recognition10.1007/978-3-031-47665-5_22(265-278)Online publication date: 5-Nov-2023
  • (2022)Geometric Representation Learning for Document Image RectificationComputer Vision – ECCV 202210.1007/978-3-031-19836-6_27(475-492)Online publication date: 23-Oct-2022

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media