Abstract
Identifying Tattoo is an integral part of forensic investigation and crime identification. Tattoo text detection is challenging because of its freestyle handwriting over the skin region with a variety of decorations. This paper introduces Deformable Convolution and Inception based Neural Network (DCINN) for detecting tattoo text. Before tattoo text detection, the proposed approach detects skin regions in the tattoo images based on color models. This results in skin regions containing Tattoo text, which reduces the background complexity of the tattoo text detection problem. For detecting tattoo text in the skin regions, we explore a DCINN, which generates binary maps from the final feature maps using differential binarization technique. Finally, polygonal bounding boxes are generated from the binary map for any orientation of text. Experiments on our Tattoo-Text dataset and two standard datasets of natural scene text images, namely, Total-Text, CTW1500 show that the proposed method is effective in detecting Tattoo text as well as natural scene text in the images. Furthermore, the proposed method outperforms the existing text detection methods in several criteria.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Nandanwar, L., et al.: Forged text detection in video, scene, and document images. IET Image Process. 14(17), 4744–4755 (2021)
Nag, S., et al.: A new unified method for detecting text from marathon runners and sports players in video (PR-D-19-01078R2). Pattern Recogn. 107, 107476 (2020)
Roy, S., Shivakumara, P., Pal, U., Lu, T., Kumar, G.H.: Delaunay triangulation based text detection from multi-view images of natural scene. Pattern Recogn. Lett. 129, 92–100 (2020)
Long, S., Ruan, J., Zhang, W., He, X., Wu, W., Yao, C.: TextSnake: a flexible representation for detecting text of arbitrary shapes. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11206, pp. 19–35. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01216-8_2
Sun, Z.H., Baumes, J., Tunison, P., Turek, M., Hoogs, A.: Tattoo detection and localization using region based deep learning. In: Proceedings of the ICPR, pp. 3055–3060 (2016)
Han, H., Li, J., Jain, A.K., Shan, S., Chen, X.: Tattoo image search at scale: joint detection and compact representation learning. IEEE Trans. Pattern Anal. Mach. Intell. 41(10), 2333–2348 (2019)
Molloy, K., Wagstaff, D.: Effects of gender, self-rated attractiveness, and mate value on perceptions tattoos. Personality Individ. Differ 168, 110382 (2021)
Di, X., Patel, V.M.: Deep tattoo recognition. In: Proceedings of the ICVPRW, pp. 119–126 (2016)
Baek, Y., Lee, B., Han, D., Yun, S., Lee, H.: Character region awareness for text detection. In: Proceedings of the CVPR, pp. 9365–9374 (2019)
Liao, M., Wan, Z., Yao, C., Chen, K., Bai, X.: Real-time scene text detection with differentiable binarization. In: Proceedings of the AAAI (2020)
Wang, W., et al.: Shape robust text detection with progressive scale expansion network. In: Proceedings of the CVPR, pp. 9328–9337 (2019)
Ma, J., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20, 3111–3122 (2018)
Feng, W., He, W., Yin, F., Zhang, X.Y., Liu, C.L.: TextDragon: an end-to-end framework for arbitrary shaped text spotting. In: Proceedings of the ICCV, pp. 9076–9084 (2019)
Liao, M., Shi, B., Bai, X.: TextBoxes++: a single-shot oriented scene text detector. IEEE Trans. Image Process. 27(8), 3676–3690 (2018)
Raghunandan, K.S., Shivakumara, P., Roy, S., Kumar, G.H., Pal, U., Lu, T.: Multi-script-oriented text detection and recognition in video/scene/born digital images. IEEE Trans. Circuits Syst. Video Technol. 29, 1145–1162 (2019)
Xu, Y., Wang, Y., Zhou, W., Wang, Y., Yang, Z., Bai, X.: TextField: learning a deep direction field for irregular scene text detection. IEEE Trans. Image Process. 28, 5566–5579 (2019)
Cai, Y., Wang, W., Chen, Y., Ye, Q.: IOS-Net: an inside-to-outside supervision network for scale robust text detection in the wild. Pattern Recogn. 103, 107304 (2020)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: ICCV, pp. 2980–2988 (2017)
Huang, Z., Zhong, Z., Sun, L., Huo, Q.: Mask R-CNN with pyramid attention network for scene text detection. In: Proceedings of the ICCV, pp. 764–772 (2019)
Lyu, P., Liao, M., Yao, C., Wu, W., Bai, X.: Mask TextSpotter: an end-to-end trainable neural network for spotting text with arbitrary shapes. In: Proceedings of the ECCV, pp. 71–78 (2018)
Wang, S., Liu, Y., He, Z., Wang, Y., Tang, Z.: A quadrilateral scene text detector with two-stage network architecture. Pattern Recogn. 102, 107230 (2020)
Liu, Y., Chen, H., Shen, C., He, T., Jin, L., Wang, L.: ABCNet: real-time scene text spotting with adaptive Bezier curve network. In: Proceedings of the CVPR (2020)
Wang, C., Fu, H., Yang, L., Cao, X.: Text co-detection in multi-view scene. IEEE Trans. Image Process. 29, 4627–4642 (2020)
Ami, I.B., Basha, T., Avidan, S.: Racing bib number recognition. In: Proceedings of the BMCV, pp. 1–12 (2012)
Shivakumara, P., Raghavendra, R., Qin, L., Raja, K.B., Lu, T., Pal, U.: A new multi-modal approach to bib number/text detection and recognition in Marathon images. Pattern Recogn. 61, 479–491 (2017)
Nag, S., Ramachandra, R., Shivakumara, P., Pal, U., Lu, T., Kankanhalli, M.: CRNN based jersey number/text recognition in sports and marathon images. ICDAR, pp. 1149–1156 (2019)
Kamlesh, Xu, P., Yang, Y., Xu, Y.: Person re-identification with end-to-end scene text recognition. In: Proceedings of the CCCV, pp. 363–374 (2017)
Paracchini, M., Marcon, M., Villa, F., Tubaro, S.: Deep skin detection on low resolution grayscale images. Pattern Recogn. Lett. 131, 322–328 (2020)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.: Going deeper with convolutions. In: Proceedings of the CVPR, pp. 1–9 (2015)
Zhu, X., Hu, H., Lin, S., Dai, J.:Deformable convnets v2: more deformable, better results. In: Proceedings of the CVPR 2019, pp. 9300–9308 (2019)
Yuliang, L., Lianwen, J., Shuaitao, Z., Sheng, Z.: Detecting curve text in the wild: new dataset and new solution. arXiv:1712.02170 (2017)
Chng, C.K., Chan, C.S.: Total-text: a comprehensive dataset for scene text detection and recognition. In: Proceedings of the ICDAR, pp. 935–942 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Chowdhury, T., Shivakumara, P., Pal, U., Lu, T., Raghavendra, R., Chanda, S. (2021). DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12822. Springer, Cham. https://doi.org/10.1007/978-3-030-86331-9_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-86331-9_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86330-2
Online ISBN: 978-3-030-86331-9
eBook Packages: Computer ScienceComputer Science (R0)