Skip to main content

Document Filter for Writer Identification

  • Conference paper
  • First Online:
  • 272 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1527))

Abstract

The writing can be used as an important biometric modality which allows to unequivocally identify an individual. It happens because the writing of two different persons present differences that can be explored both in terms of graphometric properties or even by addressing the manuscript as a digital image, taking into account the use of image processing techniques that can properly capture different visual attributes of the image (e.g. texture). In this work, we perform a detailed study in which we dissect whether or not the use of a dataset with only a single sample taken from some writers may skew the results obtained in the experimental protocol. In this sense, we propose here what we call “Document Filter”. The Document Filter protocol is supposed to be used as a preprocessing technique, in such a way that all the data taken from fragments of the same document must be placed either into the training or into the test set. The rationale behind it, is that the classifier must capture the features from the writer itself, and not features regarding other particularities which could affect the writing in a specific document (e.g. emotional state of the writer, pen used, paper type, and etc.). By analyzing the literature, one can find several works dealing with the writer identification problem. However, the performance of the writer identification systems must be evaluated also taking into account the occurrence of writer volunteers who contributed with a single sample during the creation of the manuscript databases. To address the open issue investigated here, a comprehensive set of experiments was performed on the IAM, CVL and BFL databases. They have shown that, in the most extreme case, the recognition rate obtained using the DF protocol drops 30.94% points.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32

    Chapter  Google Scholar 

  2. Bertolini, D., Oliveira, L.S., Costa, Y.M.G., Helal, L.G.: Knowledge transfer for writer identification. In: Mendoza, M., Velastín, S. (eds.) CIARP 2017. LNCS, vol. 10657, pp. 102–110. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-75193-1_13

    Chapter  Google Scholar 

  3. Bertolini, D., Oliveira, L.S., Justino, E., Sabourin, R.: Texture-based descriptors for writer identification and verification. Expert Syst. Appl. 40(6), 2069–2080 (2013)

    Article  Google Scholar 

  4. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011)

    Google Scholar 

  5. Crosier, M., Griffin, L.D.: Using basic image features for texture classification. Int. J. Comput. Vis. 88(3), 447–460 (2010)

    Article  MathSciNet  Google Scholar 

  6. Durou, A., Al-Maadeed, S., Aref, I., Bouridane, A., Elbendak, M.: A comparative study of machine learning approaches for handwriter identification. In: 2019 IEEE 12th International Conference on Global Security, Safety and Sustainability (ICGS3), pp. 206–212. IEEE (2019)

    Google Scholar 

  7. Freitas, C., Oliveira, L.S., Sabourin, R., Bortolozzi, F.: Brazilian forensic letter database. In: 11th International workshop on Frontiers on Handwriting Recognition, Montreal, Canada (2008)

    Google Scholar 

  8. Hannad, Y., Siddiqi, I., El Kettani, M.E.Y.: Writer identification using texture descriptors of handwritten fragments. Expert Syst. Appl. 47, 14–22 (2016)

    Article  Google Scholar 

  9. He, S., Schomaker, L.: Deep adaptive learning for writer identification based on single handwritten word images. Pattern Recogn. 88, 64–74 (2019)

    Article  Google Scholar 

  10. Kannala, J., Rahtu, E.: BSIF: binarized statistical image features. In: Proceedings of the 21st International Conference on Pattern Recognition (ICPR 2012), pp. 1363–1366 (2012)

    Google Scholar 

  11. Kittler, J., Hater, M., Duin, R.P.: Combining classifiers. In: Proceedings of 13th International Conference on Pattern Recognition, vol. 2, pp. 897–901. IEEE (1996)

    Google Scholar 

  12. Kleber, F., Fiel, S., Diem, M., Sablatnig, R.: CVL-database: an off-line database for writer retrieval, writer identification and word spotting. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 560–564, August 2013

    Google Scholar 

  13. Koppenhaver, K.M.: Forensic Document Examination: Principles and Practice. Springer, Heidelberg (2007). https://doi.org/10.1007/978-1-59745-301-1

    Book  Google Scholar 

  14. Marti, U.V., Bunke, H.: The IAM-database: an English sentence database for offline handwriting recognition. Int. J. Doc. Anal. Recogn. 5, 39–46 (11 2002)

    Google Scholar 

  15. Nanni, L., Lumini, A., Brahnam, S.: Local binary patterns variants as texture descriptors for medical image analysis. Artif. Intell. Med. 49(2), 117–125 (2010)

    Article  Google Scholar 

  16. Newell, A.J., Griffin, L.D.: Writer identification using oriented basic image features and the delta encoding. Pattern Recogn. 47(6), 2255–2265 (2014)

    Article  Google Scholar 

  17. Pampalk, E., Flexer, A., Widmer, G., et al.: Improvements of audio-based music similarity and genre classificaton. In: ISMIR, London, UK, vol. 5, pp. 634–637 (2005)

    Google Scholar 

  18. Pekalska, E., Duin, R.P.: Dissimilarity representations allow for building good classifiers. Pattern Recogn. Lett. 23(8), 943–956 (2002)

    Google Scholar 

  19. Ramirez Rivera, A., Rojas Castillo, J., Oksam Chae, O.: Local directional number pattern for face analysis: face and expression recognition. IEEE Trans. Image Process. 22(5), 1740–1752 (2013)

    Article  MathSciNet  Google Scholar 

  20. Rehman, A., Naz, S., Razzak, M.I.: Writer identification using machine learning approaches: a comprehensive review. Multimedia Tools Appl. 78(8), 10889–10931 (2018). https://doi.org/10.1007/s11042-018-6577-1

    Article  Google Scholar 

  21. Song, T., Li, H., Meng, F., Wu, Q., Cai, J.: Letrist: locally encoded transform feature histogram for rotation-invariant texture classification. IEEE Trans. Circ. Syst. Video Technol. 28(7), 1565–1579 (2018)

    Article  Google Scholar 

  22. Wu, X., Tang, Y., Bu, W.: Offline text-independent writer identification based on scale invariant feature transform. IEEE Trans. Inf. Forensics Secur. 9(3), 526–536 (2014). https://doi.org/10.1109/TIFS.2014.2301274

    Article  Google Scholar 

  23. Xiong, Y., Wen, Y., Wang, P.S.P., Lu, Y.: Text-independent writer identification using sift descriptor and contour-directional feature. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 91–95 (2015)

    Google Scholar 

Download references

Acknowledgment

We thank the Brazilian research support agencies: Coordination for the Improvement of Higher Education Personnel (CAPES), and National Council for Scientific and Technological Development (CNPq) for their financial support.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pignelli, F., Oliveira, L.S., Britto, A.S., Costa, Y.M.G., Bertolini, D. (2022). Document Filter for Writer Identification. In: Rozinaj, G., Vargic, R. (eds) Systems, Signals and Image Processing. IWSSIP 2021. Communications in Computer and Information Science, vol 1527. Springer, Cham. https://doi.org/10.1007/978-3-030-96878-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-96878-6_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-96877-9

  • Online ISBN: 978-3-030-96878-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics