DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region

Chowdhury, Tamal; Shivakumara, Palaiahnakote; Pal, Umapada; Lu, Tong; Raghavendra, Ramachandra; Chanda, Sukalpa

doi:10.1007/978-3-030-86331-9_22

DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region

Tamal Chowdhury¹¹,
Palaiahnakote Shivakumara¹²,
Umapada Pal¹¹,
Tong Lu¹³,
Ramachandra Raghavendra¹⁴ &
…
Sukalpa Chanda¹⁵

Conference paper
First Online: 02 September 2021

3513 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12822))

Abstract

Identifying Tattoo is an integral part of forensic investigation and crime identification. Tattoo text detection is challenging because of its freestyle handwriting over the skin region with a variety of decorations. This paper introduces Deformable Convolution and Inception based Neural Network (DCINN) for detecting tattoo text. Before tattoo text detection, the proposed approach detects skin regions in the tattoo images based on color models. This results in skin regions containing Tattoo text, which reduces the background complexity of the tattoo text detection problem. For detecting tattoo text in the skin regions, we explore a DCINN, which generates binary maps from the final feature maps using differential binarization technique. Finally, polygonal bounding boxes are generated from the binary map for any orientation of text. Experiments on our Tattoo-Text dataset and two standard datasets of natural scene text images, namely, Total-Text, CTW1500 show that the proposed method is effective in detecting Tattoo text as well as natural scene text in the images. Furthermore, the proposed method outperforms the existing text detection methods in several criteria.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Nandanwar, L., et al.: Forged text detection in video, scene, and document images. IET Image Process. 14(17), 4744–4755 (2021)
Article Google Scholar
Nag, S., et al.: A new unified method for detecting text from marathon runners and sports players in video (PR-D-19-01078R2). Pattern Recogn. 107, 107476 (2020)
Article Google Scholar
Roy, S., Shivakumara, P., Pal, U., Lu, T., Kumar, G.H.: Delaunay triangulation based text detection from multi-view images of natural scene. Pattern Recogn. Lett. 129, 92–100 (2020)
Article Google Scholar
Long, S., Ruan, J., Zhang, W., He, X., Wu, W., Yao, C.: TextSnake: a flexible representation for detecting text of arbitrary shapes. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11206, pp. 19–35. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01216-8_2
Chapter Google Scholar
Sun, Z.H., Baumes, J., Tunison, P., Turek, M., Hoogs, A.: Tattoo detection and localization using region based deep learning. In: Proceedings of the ICPR, pp. 3055–3060 (2016)
Google Scholar
Han, H., Li, J., Jain, A.K., Shan, S., Chen, X.: Tattoo image search at scale: joint detection and compact representation learning. IEEE Trans. Pattern Anal. Mach. Intell. 41(10), 2333–2348 (2019)
Article Google Scholar
Molloy, K., Wagstaff, D.: Effects of gender, self-rated attractiveness, and mate value on perceptions tattoos. Personality Individ. Differ 168, 110382 (2021)
Article Google Scholar
Di, X., Patel, V.M.: Deep tattoo recognition. In: Proceedings of the ICVPRW, pp. 119–126 (2016)
Google Scholar
Baek, Y., Lee, B., Han, D., Yun, S., Lee, H.: Character region awareness for text detection. In: Proceedings of the CVPR, pp. 9365–9374 (2019)
Google Scholar
Liao, M., Wan, Z., Yao, C., Chen, K., Bai, X.: Real-time scene text detection with differentiable binarization. In: Proceedings of the AAAI (2020)
Google Scholar
Wang, W., et al.: Shape robust text detection with progressive scale expansion network. In: Proceedings of the CVPR, pp. 9328–9337 (2019)
Google Scholar
Ma, J., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20, 3111–3122 (2018)
Article Google Scholar
Feng, W., He, W., Yin, F., Zhang, X.Y., Liu, C.L.: TextDragon: an end-to-end framework for arbitrary shaped text spotting. In: Proceedings of the ICCV, pp. 9076–9084 (2019)
Google Scholar
Liao, M., Shi, B., Bai, X.: TextBoxes++: a single-shot oriented scene text detector. IEEE Trans. Image Process. 27(8), 3676–3690 (2018)
Article MathSciNet Google Scholar
Raghunandan, K.S., Shivakumara, P., Roy, S., Kumar, G.H., Pal, U., Lu, T.: Multi-script-oriented text detection and recognition in video/scene/born digital images. IEEE Trans. Circuits Syst. Video Technol. 29, 1145–1162 (2019)
Article Google Scholar
Xu, Y., Wang, Y., Zhou, W., Wang, Y., Yang, Z., Bai, X.: TextField: learning a deep direction field for irregular scene text detection. IEEE Trans. Image Process. 28, 5566–5579 (2019)
Article MathSciNet Google Scholar
Cai, Y., Wang, W., Chen, Y., Ye, Q.: IOS-Net: an inside-to-outside supervision network for scale robust text detection in the wild. Pattern Recogn. 103, 107304 (2020)
Article Google Scholar
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: ICCV, pp. 2980–2988 (2017)
Google Scholar
Huang, Z., Zhong, Z., Sun, L., Huo, Q.: Mask R-CNN with pyramid attention network for scene text detection. In: Proceedings of the ICCV, pp. 764–772 (2019)
Google Scholar
Lyu, P., Liao, M., Yao, C., Wu, W., Bai, X.: Mask TextSpotter: an end-to-end trainable neural network for spotting text with arbitrary shapes. In: Proceedings of the ECCV, pp. 71–78 (2018)
Google Scholar
Wang, S., Liu, Y., He, Z., Wang, Y., Tang, Z.: A quadrilateral scene text detector with two-stage network architecture. Pattern Recogn. 102, 107230 (2020)
Article Google Scholar
Liu, Y., Chen, H., Shen, C., He, T., Jin, L., Wang, L.: ABCNet: real-time scene text spotting with adaptive Bezier curve network. In: Proceedings of the CVPR (2020)
Google Scholar
Wang, C., Fu, H., Yang, L., Cao, X.: Text co-detection in multi-view scene. IEEE Trans. Image Process. 29, 4627–4642 (2020)
Article MathSciNet Google Scholar
Ami, I.B., Basha, T., Avidan, S.: Racing bib number recognition. In: Proceedings of the BMCV, pp. 1–12 (2012)
Google Scholar
Shivakumara, P., Raghavendra, R., Qin, L., Raja, K.B., Lu, T., Pal, U.: A new multi-modal approach to bib number/text detection and recognition in Marathon images. Pattern Recogn. 61, 479–491 (2017)
Article Google Scholar
Nag, S., Ramachandra, R., Shivakumara, P., Pal, U., Lu, T., Kankanhalli, M.: CRNN based jersey number/text recognition in sports and marathon images. ICDAR, pp. 1149–1156 (2019)
Google Scholar
Kamlesh, Xu, P., Yang, Y., Xu, Y.: Person re-identification with end-to-end scene text recognition. In: Proceedings of the CCCV, pp. 363–374 (2017)
Google Scholar
Paracchini, M., Marcon, M., Villa, F., Tubaro, S.: Deep skin detection on low resolution grayscale images. Pattern Recogn. Lett. 131, 322–328 (2020)
Article Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.: Going deeper with convolutions. In: Proceedings of the CVPR, pp. 1–9 (2015)
Google Scholar
Zhu, X., Hu, H., Lin, S., Dai, J.:Deformable convnets v2: more deformable, better results. In: Proceedings of the CVPR 2019, pp. 9300–9308 (2019)
Google Scholar
Yuliang, L., Lianwen, J., Shuaitao, Z., Sheng, Z.: Detecting curve text in the wild: new dataset and new solution. arXiv:1712.02170 (2017)
Chng, C.K., Chan, C.S.: Total-text: a comprehensive dataset for scene text detection and recognition. In: Proceedings of the ICDAR, pp. 935–942 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, India
Tamal Chowdhury & Umapada Pal
Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia
Palaiahnakote Shivakumara
National Key Lab for Novel Software Technology, Nanjing University, Nanjing, China
Tong Lu
Faculty of Information Technology and Electrical Engineering, IIK, NTNU, Trondheim, Norway
Ramachandra Raghavendra
Department of Information Technology, Østfold University College, Halden, Norway
Sukalpa Chanda

Authors

Tamal Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar
Palaiahnakote Shivakumara
View author publications
You can also search for this author in PubMed Google Scholar
Umapada Pal
View author publications
You can also search for this author in PubMed Google Scholar
Tong Lu
View author publications
You can also search for this author in PubMed Google Scholar
Ramachandra Raghavendra
View author publications
You can also search for this author in PubMed Google Scholar
Sukalpa Chanda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ramachandra Raghavendra .

Editor information

Editors and Affiliations

Universitat Autònoma de Barcelona, Barcelona, Spain
Josep Lladós
Lehigh University, Bethlehem, PA, USA
Daniel Lopresti
Kyushu University, Fukuoka-shi, Japan
Seiichi Uchida

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chowdhury, T., Shivakumara, P., Pal, U., Lu, T., Raghavendra, R., Chanda, S. (2021). DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12822. Springer, Cham. https://doi.org/10.1007/978-3-030-86331-9_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-86331-9_22
Published: 02 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86330-2
Online ISBN: 978-3-030-86331-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)