Document Filter for Writer Identification

Pignelli, Fabio; Oliveira, Luiz S.; Britto, Alceu S.; Costa, Yandre M. G.; Bertolini, Diego

doi:10.1007/978-3-030-96878-6_16

Document Filter for Writer Identification

Fabio Pignelli⁷,
Luiz S. Oliveira⁸,
Alceu S. Britto Jr.⁹,
Yandre M. G. Costa⁷ &
…
Diego Bertolini^7,10

Conference paper
First Online: 02 March 2022

272 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1527))

Abstract

The writing can be used as an important biometric modality which allows to unequivocally identify an individual. It happens because the writing of two different persons present differences that can be explored both in terms of graphometric properties or even by addressing the manuscript as a digital image, taking into account the use of image processing techniques that can properly capture different visual attributes of the image (e.g. texture). In this work, we perform a detailed study in which we dissect whether or not the use of a dataset with only a single sample taken from some writers may skew the results obtained in the experimental protocol. In this sense, we propose here what we call “Document Filter”. The Document Filter protocol is supposed to be used as a preprocessing technique, in such a way that all the data taken from fragments of the same document must be placed either into the training or into the test set. The rationale behind it, is that the classifier must capture the features from the writer itself, and not features regarding other particularities which could affect the writing in a specific document (e.g. emotional state of the writer, pen used, paper type, and etc.). By analyzing the literature, one can find several works dealing with the writer identification problem. However, the performance of the writer identification systems must be evaluated also taking into account the occurrence of writer volunteers who contributed with a single sample during the creation of the manuscript databases. To address the open issue investigated here, a comprehensive set of experiments was performed on the IAM, CVL and BFL databases. They have shown that, in the most extreme case, the recognition rate obtained using the DF protocol drops 30.94% points.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Chapter Google Scholar
Bertolini, D., Oliveira, L.S., Costa, Y.M.G., Helal, L.G.: Knowledge transfer for writer identification. In: Mendoza, M., Velastín, S. (eds.) CIARP 2017. LNCS, vol. 10657, pp. 102–110. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-75193-1_13
Chapter Google Scholar
Bertolini, D., Oliveira, L.S., Justino, E., Sabourin, R.: Texture-based descriptors for writer identification and verification. Expert Syst. Appl. 40(6), 2069–2080 (2013)
Article Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011)
Google Scholar
Crosier, M., Griffin, L.D.: Using basic image features for texture classification. Int. J. Comput. Vis. 88(3), 447–460 (2010)
Article MathSciNet Google Scholar
Durou, A., Al-Maadeed, S., Aref, I., Bouridane, A., Elbendak, M.: A comparative study of machine learning approaches for handwriter identification. In: 2019 IEEE 12th International Conference on Global Security, Safety and Sustainability (ICGS3), pp. 206–212. IEEE (2019)
Google Scholar
Freitas, C., Oliveira, L.S., Sabourin, R., Bortolozzi, F.: Brazilian forensic letter database. In: 11th International workshop on Frontiers on Handwriting Recognition, Montreal, Canada (2008)
Google Scholar
Hannad, Y., Siddiqi, I., El Kettani, M.E.Y.: Writer identification using texture descriptors of handwritten fragments. Expert Syst. Appl. 47, 14–22 (2016)
Article Google Scholar
He, S., Schomaker, L.: Deep adaptive learning for writer identification based on single handwritten word images. Pattern Recogn. 88, 64–74 (2019)
Article Google Scholar
Kannala, J., Rahtu, E.: BSIF: binarized statistical image features. In: Proceedings of the 21st International Conference on Pattern Recognition (ICPR 2012), pp. 1363–1366 (2012)
Google Scholar
Kittler, J., Hater, M., Duin, R.P.: Combining classifiers. In: Proceedings of 13th International Conference on Pattern Recognition, vol. 2, pp. 897–901. IEEE (1996)
Google Scholar
Kleber, F., Fiel, S., Diem, M., Sablatnig, R.: CVL-database: an off-line database for writer retrieval, writer identification and word spotting. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 560–564, August 2013
Google Scholar
Koppenhaver, K.M.: Forensic Document Examination: Principles and Practice. Springer, Heidelberg (2007). https://doi.org/10.1007/978-1-59745-301-1
Book Google Scholar
Marti, U.V., Bunke, H.: The IAM-database: an English sentence database for offline handwriting recognition. Int. J. Doc. Anal. Recogn. 5, 39–46 (11 2002)
Google Scholar
Nanni, L., Lumini, A., Brahnam, S.: Local binary patterns variants as texture descriptors for medical image analysis. Artif. Intell. Med. 49(2), 117–125 (2010)
Article Google Scholar
Newell, A.J., Griffin, L.D.: Writer identification using oriented basic image features and the delta encoding. Pattern Recogn. 47(6), 2255–2265 (2014)
Article Google Scholar
Pampalk, E., Flexer, A., Widmer, G., et al.: Improvements of audio-based music similarity and genre classificaton. In: ISMIR, London, UK, vol. 5, pp. 634–637 (2005)
Google Scholar
Pekalska, E., Duin, R.P.: Dissimilarity representations allow for building good classifiers. Pattern Recogn. Lett. 23(8), 943–956 (2002)
Google Scholar
Ramirez Rivera, A., Rojas Castillo, J., Oksam Chae, O.: Local directional number pattern for face analysis: face and expression recognition. IEEE Trans. Image Process. 22(5), 1740–1752 (2013)
Article MathSciNet Google Scholar
Rehman, A., Naz, S., Razzak, M.I.: Writer identification using machine learning approaches: a comprehensive review. Multimedia Tools Appl. 78(8), 10889–10931 (2018). https://doi.org/10.1007/s11042-018-6577-1
Article Google Scholar
Song, T., Li, H., Meng, F., Wu, Q., Cai, J.: Letrist: locally encoded transform feature histogram for rotation-invariant texture classification. IEEE Trans. Circ. Syst. Video Technol. 28(7), 1565–1579 (2018)
Article Google Scholar
Wu, X., Tang, Y., Bu, W.: Offline text-independent writer identification based on scale invariant feature transform. IEEE Trans. Inf. Forensics Secur. 9(3), 526–536 (2014). https://doi.org/10.1109/TIFS.2014.2301274
Article Google Scholar
Xiong, Y., Wen, Y., Wang, P.S.P., Lu, Y.: Text-independent writer identification using sift descriptor and contour-directional feature. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 91–95 (2015)
Google Scholar

Download references

Acknowledgment

We thank the Brazilian research support agencies: Coordination for the Improvement of Higher Education Personnel (CAPES), and National Council for Scientific and Technological Development (CNPq) for their financial support.

Author information

Authors and Affiliations

State University of Maringá, Maringá, PR, Brazil
Fabio Pignelli, Yandre M. G. Costa & Diego Bertolini
Federal University of Paraná, Curitiba, PR, Brazil
Luiz S. Oliveira
Pontifical Catholic University of Paraná, Curitiba, PR, Brazil
Alceu S. Britto Jr.
Federal Technological University of Paraná, Campo Mourão, PR, Brazil
Diego Bertolini

Authors

Fabio Pignelli
View author publications
You can also search for this author in PubMed Google Scholar
Luiz S. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Alceu S. Britto Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Yandre M. G. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Diego Bertolini
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Slovak University of Technology in Bratislava, Bratislava, Slovakia
Gregor Rozinaj
Slovak University of Technology in Bratislava, Bratislava, Slovakia
Radoslav Vargic

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pignelli, F., Oliveira, L.S., Britto, A.S., Costa, Y.M.G., Bertolini, D. (2022). Document Filter for Writer Identification. In: Rozinaj, G., Vargic, R. (eds) Systems, Signals and Image Processing. IWSSIP 2021. Communications in Computer and Information Science, vol 1527. Springer, Cham. https://doi.org/10.1007/978-3-030-96878-6_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-96878-6_16
Published: 02 March 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96877-9
Online ISBN: 978-3-030-96878-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics