Abstract
In this paper, a novel contour feature-based text image watermarking scheme against print and scan processes is proposed. We employ a mathematical multiplicative transformation model to approximate the geometric invariant feature that can survive a variety of attacks during the print-scan process and thus serve as reference points for both watermark embedding and extraction. Based on the print-scan invariant, the boundary points of each character are flipped using Fourier descriptors with visual perception characteristics, so that the watermarks are embedded into the visually nonsignificant points. In the calculation process of the print-scan invariant, a certain text line serves as the benchmark line without affording additional characters for watermark adjustment. Thus, the hiding capacity is greatly improved. For the data detection, noise reduction and deskewing mechanisms are performed previously to compensate for the distortions caused by hardcopy. The watermark is then extracted by parity check of the invariant feature of connected components for soft authentication. The experimental results show that the proposed approach is not limited to a particular language, and has better robustness, watermark transparency as well as hiding capacity compared with some existing methods.
Similar content being viewed by others
References
Alotaibi RA, Elrefaei LA (2017) Improved capacity arabic text watermarking methods based on open word space. J King Saud Univ Comput Inf Sci. https://doi.org/10.1016/j.jksuci.2016.12.007
Amiri SH, Jamzad M (2014) Robust watermarking against print and scan attack through efficient modeling algorithm. Elsevier Sci Inc 29(10):1181–1196
Chen Z, Ngo C, Zhang W, Cao J, Jiang Y (2014) Name-face association in web videos: a large-scale dataset, baselines, and open issues. J Comput Sci Technol 29(5):785–798
Culnane C, Treharne H, Ho ATS (2008) Improving multi-set formatted binary text watermarking using continuous line embedding. ICICIC, Kumamoto
Daraee F, Mozaffari S (2014) Watermarking in binary document images using fractal codes. Pattern Recogn Lett 35(1):120–129
Fang S, Xie H, Chen Z, Zhu S, Gu X, Gao X (2017) Detecting uyghur text in complex background images with convolutional neural network. Multimed Tools Appl 76(13):1–21
González-Lee M, Nakano-Miyatake M, Pérez-Meana H, Sánchez-Pérez G (2015) Script format document authentication scheme based on watermarking techniques. J Appl Res Technol 13(3):435–442
Guo Y, Au OC, Zhou J, Tang K, Fan X (2016) Halftone image watermarking via optimization. Signal Process Image Commun 41(C):85–100
Huang D, Yan H (2001) Interword distance changes represented by sine waves for watermarking text images. IEEE Trans Circuits Syst Video Technol 11(12):1237–1245
Jung KH, Yoo KY (2014) Data hiding method in binary images based on block masking for key authentication. Inf Sci 277(2):188–196
Kim YW, Oh IS (2004) Watermarking text document images using edge direction histograms. Pattern Recogn Lett 25(11):1243–1251
Li RJ, Chang LW (2006) Data hiding in binary images for annotation by parity check. International Symposium on Intelligent Signal Processing and Communications (pp.764–767). IEEE
Li CM, Hu P, Lau WC (2015) AuthPaper: Protecting paper-based documents and credentials using Authenticated 2D barcodes. IEEE International Conference on Communications (pp.7400–7406). IEEE
Liu N, Yu X, Wang C, Li C, Ma L, Lei J (2017) Energy-sharing model with price-based demand response for microgrids of peer-to-peer prosumers. IEEE Trans Power Syst 32(5):3569–3583
Lu H, Kot AC, Shi YQ (2004) Distance-reciprocal distortion measure for binary document images. IEEE Signal Process Lett 11(2):228–231
Qi W-F, Xiao-Long LI, Yang B, Cheng DF (2008) Document watermarking scheme for information tracking. J Commun 29(10):183–190
Sang J, Xu C (2012) Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications. ACM International Conference on Multimedia (pp.19–28). ACM
Sang J, Xu C, Liu J (2012) User-aware image tag refinement via ternary semantic analysis. IEEE Trans Multimed 14(3):883–895
Sang J, Fang Q, Xu C (2017) Exploiting Social-Mobile Information for Location Visualization. ACM Transactions on Intelligent Systems and Technology, 8(3), Article No.:39
Smith EB, Qiu X (2003) Statistical image differences, degradation features, and character distance metrics. Doc Anal Recognit 6(3):146–153
Solachidis V, Pitas I (2004) Watermarking polygonal lines using fourier descriptors. IEEE Comput Graph Appl 24(3):44–51
Song Y, Chen J, Xie H, Chen Z, Gao X, Chen X (2017) Robust and parallel uyghur text localization in complex background images. Mach Vis Appl 9:1–15
Sugai T, Shimmyo U, Ito H, Suzuki M (2008) Development of Watermarking Method for Printed Documents. 70th Convention of the Information Processing Society of Japan, Tsukuba, Japan
Tan L, Sun X, Sun G (2012) Print-scan resilient text image watermarking based on stroke direction modulation for chinese document authentication. Radioengineering 21(1):170–181
Varna AL, Rane S, Vetro A (2009) Data hiding in hard-copy text documents robust to print, scan and photocopy operations. IEEE International Conference on Acoustics, Speech and Signal Processing (pp.1397–1400). IEEE
Villán R, Voloshynovskiy S, Koval O, Vila J, Topak E, Deguillaume F, et al (2006) Text data-hiding for digital and printed documents: theoretical and practical considerations. Spie-is&t Electronic Imaging, Security, Steganography, and Watermarking of Multimedia Contents VIII (Vol.6072, pp.607212–607212-11), CA, United States
Wang CC, Chang YF, Chang CC, Jan JK, Lin CC (2014) A high capacity data hiding scheme for binary images based on block patterns. J Syst Softw 93(2):152–162
Wu NI, Hwang MS (2017) Development of a data hiding scheme based on combination theory for lowering the visual noise in binary images. Displays 49:116–123
Wu M, Liu B (2004) Data hiding in binary image for authentication and annotation. IEEE Trans Multimed 6(4):528–538
Xie H, Zhang Y, Gao K, Tang S, Xu K, Guo L et al (2013) Robust common visual pattern discovery using graph matching. J Vis Commun Image Represent 24(5):635–646
Yang H, Kot AC, Rahardja S (2008) Orthogonal data embedding for binary images in morphological transform domain - a high-capacity approach. IEEE Trans Multimed 10(3):339–351
Acknowledgments
This work was supported by the National Natural Science Foundation of China (Grants No. 61472136, 61772196,61471170), the Scientific Research Project of Hunan Provincial Education Department for the Excellent Youth Scholars (Grant No. 16B142), the Key Project of Scientific Research Fund of Hunan Provincial Education Department (Grants No. 17A113, 16A114), and the Hunan Provincial Natural Science Foundation of China (Grant No. 2016JJ2070). The authors would like to thank the financial support provided by the Key Laboratory of Hunan Province for New Retail Virtual Reality Technology (2017TP1026). The authors would also like to thank the reviewers for their insightful comments, which have greatly helped to improve the quality of this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tan, L., Hu, K., Zhou, X. et al. Print-scan invariant text image watermarking for hardcopy document authentication. Multimed Tools Appl 78, 13189–13211 (2019). https://doi.org/10.1007/s11042-018-5771-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-5771-5