Skip to main content

Filled-in Document Identification Using Local Features and a Direct Voting Scheme

  • Conference paper
Pattern Recognition and Image Analysis (IbPRIA 2011)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6669))

Included in the following conference series:

Abstract

In this work, an approach combining local representations with a direct voting scheme on a k-nearest neighbors classifier to identify filled-in document images is presented. A document class is represented by a high number of local feature vectors selected from its reference image using a given criterion. In the test phase, a number of vectors are equally selected from an image and used to classify it. The experimental results show that the parameterization is not critical, and good performances in terms of error-rate and processing time can be obtained, even though the test documents contain a large proportion of filled-in regions, obviously not present in the reference images.

Work partially supported by the Spanish MICINN grants TIN2009-14205-C04-02 and Consolider Ingenio 2010: MIPRCV (CSD2007-00018) and by IMPIVA and the E.U. by means of the ERDF in the context of the R+D Program for Technological Institutes of IMPIVA network for 2010 (IMIDIC-2010/191).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arlandis, J., Perez-Cortes, J.C., Ungria, E.: Identification of very similar filled-in forms with a reject option. In: ICDAR, pp. 246–250 (2009)

    Google Scholar 

  2. Arya, S., Mount, D., Netanyahu, N., Silverman, R., Wu, A.: An optimal algorithm for approximate nearest neighbor searching. Journal of the ACM 45, 891–923 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  3. Doermann, D.: The indexing and retrieval of document images: A survey. Computer Vision and Image Understanding 70(3), 287–298 (1998)

    Article  Google Scholar 

  4. Fan, K.-C., Chang, M.-L., Wang, Y.-K.: Form document identification using line structure based features. In: ICDAR, pp. 704–708 (2001)

    Google Scholar 

  5. Heroux, P., Diana, S., Ribert, A., Trupin, E.: Classification method study for automatic form class identification. In: Proc. 14th Int. Conf. on Pattern Recognition, ICPR 1998, pp. 926–928 (1998)

    Google Scholar 

  6. Kittler, J., Hatef, M., Duin, R.P.W., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)

    Article  Google Scholar 

  7. Mandal, S., Chowdhury, S.P., Das, A.K., Chanda, B.: A hierarchical method for automated identification and segmentation of forms. In: International Conference on Document Analysis and Recognition, pp. 705–709 (2005)

    Google Scholar 

  8. Mohr, R., Picard, S., Schmid, C.: Bayesian decision versus voting for image retrieval. In: Sommer, G., Daniilidis, K., Pauli, J. (eds.) CAIP 1997. LNCS, vol. 1296, pp. 376–383. Springer, Heidelberg (1997)

    Chapter  Google Scholar 

  9. Nagasaki, T., Marukawa, K., Kagehiro, T., Sako, H.: A coupon classification method based on adaptive image vector matching. In: 18th International Conference on Pattern Recognition, pp. 280–283 (2006)

    Google Scholar 

  10. Dimmick, D.L., Garris, M.D.: Structured Forms Database 2, NIST Special Database 6 Technical Report and CD-ROM, National Institute of Standards and Technology (1992)

    Google Scholar 

  11. Ogata, H., Watanabe, S., Imaizumi, A., Yasue, T., Furukawa, N., Sako, H., Fujisawa, H.: Form-type identification for banking applications and its implementation issues. In: DRR, pp. 208–218 (2003)

    Google Scholar 

  12. Ohtera, R., Horiuchi, T.: Faxed form identification using histogram of the hough-space. In: International Conference on Pattern Recognition, vol. 2, pp. 566–569 (2004)

    Google Scholar 

  13. Paredes, R., Pérez-Cortes, J.C., Juan, A., Vidal, E.: Local representations and a direct voting scheme for face recognition. In: PRIS, pp. 71–79 (2001)

    Google Scholar 

  14. Sako, H., Seki, M., Furukawa, N., Ikeda, H., Imaizumi, A.: Form reading based on form-type identification and form-data recognition. In: International Conference on Document Analysis and Recognition, vol. 2, p. 926 (2003)

    Google Scholar 

  15. Ting, A., Leung, M.: Business form classification using strings. In: ICPR 1996, pages II: 690–694 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Arlandis, J., Castello-Fos, V., Perez-Cortes, JC. (2011). Filled-in Document Identification Using Local Features and a Direct Voting Scheme. In: Vitrià, J., Sanches, J.M., Hernández, M. (eds) Pattern Recognition and Image Analysis. IbPRIA 2011. Lecture Notes in Computer Science, vol 6669. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21257-4_68

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-21257-4_68

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21256-7

  • Online ISBN: 978-3-642-21257-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics