Skip to main content
Log in

Print-scan invariant text image watermarking for hardcopy document authentication

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, a novel contour feature-based text image watermarking scheme against print and scan processes is proposed. We employ a mathematical multiplicative transformation model to approximate the geometric invariant feature that can survive a variety of attacks during the print-scan process and thus serve as reference points for both watermark embedding and extraction. Based on the print-scan invariant, the boundary points of each character are flipped using Fourier descriptors with visual perception characteristics, so that the watermarks are embedded into the visually nonsignificant points. In the calculation process of the print-scan invariant, a certain text line serves as the benchmark line without affording additional characters for watermark adjustment. Thus, the hiding capacity is greatly improved. For the data detection, noise reduction and deskewing mechanisms are performed previously to compensate for the distortions caused by hardcopy. The watermark is then extracted by parity check of the invariant feature of connected components for soft authentication. The experimental results show that the proposed approach is not limited to a particular language, and has better robustness, watermark transparency as well as hiding capacity compared with some existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Alotaibi RA, Elrefaei LA (2017) Improved capacity arabic text watermarking methods based on open word space. J King Saud Univ Comput Inf Sci. https://doi.org/10.1016/j.jksuci.2016.12.007

  2. Amiri SH, Jamzad M (2014) Robust watermarking against print and scan attack through efficient modeling algorithm. Elsevier Sci Inc 29(10):1181–1196

    Google Scholar 

  3. Chen Z, Ngo C, Zhang W, Cao J, Jiang Y (2014) Name-face association in web videos: a large-scale dataset, baselines, and open issues. J Comput Sci Technol 29(5):785–798

    Article  Google Scholar 

  4. Culnane C, Treharne H, Ho ATS (2008) Improving multi-set formatted binary text watermarking using continuous line embedding. ICICIC, Kumamoto

    Google Scholar 

  5. Daraee F, Mozaffari S (2014) Watermarking in binary document images using fractal codes. Pattern Recogn Lett 35(1):120–129

    Article  Google Scholar 

  6. Fang S, Xie H, Chen Z, Zhu S, Gu X, Gao X (2017) Detecting uyghur text in complex background images with convolutional neural network. Multimed Tools Appl 76(13):1–21

    Article  Google Scholar 

  7. González-Lee M, Nakano-Miyatake M, Pérez-Meana H, Sánchez-Pérez G (2015) Script format document authentication scheme based on watermarking techniques. J Appl Res Technol 13(3):435–442

    Article  Google Scholar 

  8. Guo Y, Au OC, Zhou J, Tang K, Fan X (2016) Halftone image watermarking via optimization. Signal Process Image Commun 41(C):85–100

    Article  Google Scholar 

  9. Huang D, Yan H (2001) Interword distance changes represented by sine waves for watermarking text images. IEEE Trans Circuits Syst Video Technol 11(12):1237–1245

    Article  Google Scholar 

  10. Jung KH, Yoo KY (2014) Data hiding method in binary images based on block masking for key authentication. Inf Sci 277(2):188–196

    Article  Google Scholar 

  11. Kim YW, Oh IS (2004) Watermarking text document images using edge direction histograms. Pattern Recogn Lett 25(11):1243–1251

    Article  Google Scholar 

  12. Li RJ, Chang LW (2006) Data hiding in binary images for annotation by parity check. International Symposium on Intelligent Signal Processing and Communications (pp.764–767). IEEE

  13. Li CM, Hu P, Lau WC (2015) AuthPaper: Protecting paper-based documents and credentials using Authenticated 2D barcodes. IEEE International Conference on Communications (pp.7400–7406). IEEE

  14. Liu N, Yu X, Wang C, Li C, Ma L, Lei J (2017) Energy-sharing model with price-based demand response for microgrids of peer-to-peer prosumers. IEEE Trans Power Syst 32(5):3569–3583

    Article  Google Scholar 

  15. Lu H, Kot AC, Shi YQ (2004) Distance-reciprocal distortion measure for binary document images. IEEE Signal Process Lett 11(2):228–231

    Article  Google Scholar 

  16. Qi W-F, Xiao-Long LI, Yang B, Cheng DF (2008) Document watermarking scheme for information tracking. J Commun 29(10):183–190

    Google Scholar 

  17. Sang J, Xu C (2012) Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications. ACM International Conference on Multimedia (pp.19–28). ACM

  18. Sang J, Xu C, Liu J (2012) User-aware image tag refinement via ternary semantic analysis. IEEE Trans Multimed 14(3):883–895

    Article  Google Scholar 

  19. Sang J, Fang Q, Xu C (2017) Exploiting Social-Mobile Information for Location Visualization. ACM Transactions on Intelligent Systems and Technology, 8(3), Article No.:39

  20. Smith EB, Qiu X (2003) Statistical image differences, degradation features, and character distance metrics. Doc Anal Recognit 6(3):146–153

    Article  Google Scholar 

  21. Solachidis V, Pitas I (2004) Watermarking polygonal lines using fourier descriptors. IEEE Comput Graph Appl 24(3):44–51

    Article  Google Scholar 

  22. Song Y, Chen J, Xie H, Chen Z, Gao X, Chen X (2017) Robust and parallel uyghur text localization in complex background images. Mach Vis Appl 9:1–15

    Google Scholar 

  23. Sugai T, Shimmyo U, Ito H, Suzuki M (2008) Development of Watermarking Method for Printed Documents. 70th Convention of the Information Processing Society of Japan, Tsukuba, Japan

  24. Tan L, Sun X, Sun G (2012) Print-scan resilient text image watermarking based on stroke direction modulation for chinese document authentication. Radioengineering 21(1):170–181

    Google Scholar 

  25. Varna AL, Rane S, Vetro A (2009) Data hiding in hard-copy text documents robust to print, scan and photocopy operations. IEEE International Conference on Acoustics, Speech and Signal Processing (pp.1397–1400). IEEE

  26. Villán R, Voloshynovskiy S, Koval O, Vila J, Topak E, Deguillaume F, et al (2006) Text data-hiding for digital and printed documents: theoretical and practical considerations. Spie-is&t Electronic Imaging, Security, Steganography, and Watermarking of Multimedia Contents VIII (Vol.6072, pp.607212–607212-11), CA, United States

  27. Wang CC, Chang YF, Chang CC, Jan JK, Lin CC (2014) A high capacity data hiding scheme for binary images based on block patterns. J Syst Softw 93(2):152–162

    Article  Google Scholar 

  28. Wu NI, Hwang MS (2017) Development of a data hiding scheme based on combination theory for lowering the visual noise in binary images. Displays 49:116–123

    Article  Google Scholar 

  29. Wu M, Liu B (2004) Data hiding in binary image for authentication and annotation. IEEE Trans Multimed 6(4):528–538

    Article  Google Scholar 

  30. Xie H, Zhang Y, Gao K, Tang S, Xu K, Guo L et al (2013) Robust common visual pattern discovery using graph matching. J Vis Commun Image Represent 24(5):635–646

    Article  Google Scholar 

  31. Yang H, Kot AC, Rahardja S (2008) Orthogonal data embedding for binary images in morphological transform domain - a high-capacity approach. IEEE Trans Multimed 10(3):339–351

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grants No. 61472136, 61772196,61471170), the Scientific Research Project of Hunan Provincial Education Department for the Excellent Youth Scholars (Grant No. 16B142), the Key Project of Scientific Research Fund of Hunan Provincial Education Department (Grants No. 17A113, 16A114), and the Hunan Provincial Natural Science Foundation of China (Grant No. 2016JJ2070). The authors would like to thank the financial support provided by the Key Laboratory of Hunan Province for New Retail Virtual Reality Technology (2017TP1026). The authors would also like to thank the reviewers for their insightful comments, which have greatly helped to improve the quality of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Hu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tan, L., Hu, K., Zhou, X. et al. Print-scan invariant text image watermarking for hardcopy document authentication. Multimed Tools Appl 78, 13189–13211 (2019). https://doi.org/10.1007/s11042-018-5771-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-5771-5

Keywords

Navigation