skip to main content
10.1145/1815330.1815376acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Latent Dirichlet allocation based writer identification in offline handwriting

Published:09 June 2010Publication History

ABSTRACT

In this paper, we describe a novel approach to Writer Identification in Offline handwriting using Latent Dirichlet Allocation. State-of-the-art methods for writer identification employ the traditional feature-classification paradigm which does not provide enough information about the handwriting attributes such as writing style which are key components in any forensic analysis of handwriting. This problem is also compounded due to lack of efficient rules for defining a particular writing style that can capture writer specific characteristics over a large dataset. We propose to address this issue by using a generative model in form of Latent Dirichlet Allocation(LDA) that automatically infers writing styles from handwritten document collection without any pre-defined set of rules. This information is then used to represent each writer as a distribution over multiple writing style for classifying any unknown writer sample. We describe our approach on two different feature sets consisting of contour angle features as well as structural and concavity features. Our experimental results show comparable performance with baseline systems and also demonstrate the efficacy of LDA for learning multiple handwriting styles.

References

  1. Bresenham line drawing algorithm. http://en.wikipedia.org/wiki/bresenham's_line_algorithm.Google ScholarGoogle Scholar
  2. Latent dirichlet allocation. http://www.cs.princeton.edu/~blei/lda-c/.Google ScholarGoogle Scholar
  3. Morphological waveform coding for writer identification. Pattern Recognition, 33(3):385--398, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  4. A. Bhardwaj, M. Malgireddy, S. Setlur, V. Govindaraju, and S. Ramachandrula. Writer identification in offline handwriting using topic models. In Proceedings of the NIPS 2009 Workshop on Applications of Topic Models: Text and Beyond, 2009.Google ScholarGoogle Scholar
  5. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003. Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Bulacu and L. Schomaker. Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell., 29(4):701--717, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C.-C. Chang and C.-J. Lin. Libsvm: a library for support vector machines, 2001.Google ScholarGoogle Scholar
  8. F. Farooq, L. Lorigo, and V. Govindaraju. On the accent in handwriting of individuals. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition, 10 2006.Google ScholarGoogle Scholar
  9. J. T. Favata and G. Srikantan. A multiple feature/resolution approach to handprinted digit and character recognition. International Journal of Imaging Systems and Technology, 7(4):304--311, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  10. U. Marti and H. Bunke. The iam-database: an english sentence database for offline handwriting recognition. International Journal on Document Analysis and Recognition, 5(1):39--46, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  11. M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI '04: Proceedings of the 20th conference on Uncertainty in artificial intelligence, pages 487--494, Arlington, Virginia, United States, 2004. AUAI Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. E. S. Said, T. N. Tan, and K. D. Baker. Personal identification based on handwriting. Pattern Recognition, 33(1):149--160, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  13. S. Srihari, S.-H. Cha, H. Arora, and S. Lee. Individuality of handwriting: a validation study. In Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on, pages 106--109, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. N. Srihari, M. J. Beal, K. Bandi, and V. Shah. A statistical model for writer verification. In ICDAR '05: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pages 1105--1109, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. P. Tan, H. E. S. Said, G. S. Peake, T. N. Tan, and K. D. Baker. Writer identification from non-uniformly skewed handwriting images. In In Proc. of the 9th British Machine Vision Conference, pages 478--487, 1998.Google ScholarGoogle Scholar

Index Terms

  1. Latent Dirichlet allocation based writer identification in offline handwriting

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
      June 2010
      490 pages
      ISBN:9781605587738
      DOI:10.1145/1815330

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 June 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader