Abstract
In this work, an approach combining local representations with a direct voting scheme on a k-nearest neighbors classifier to identify filled-in document images is presented. A document class is represented by a high number of local feature vectors selected from its reference image using a given criterion. In the test phase, a number of vectors are equally selected from an image and used to classify it. The experimental results show that the parameterization is not critical, and good performances in terms of error-rate and processing time can be obtained, even though the test documents contain a large proportion of filled-in regions, obviously not present in the reference images.
Work partially supported by the Spanish MICINN grants TIN2009-14205-C04-02 and Consolider Ingenio 2010: MIPRCV (CSD2007-00018) and by IMPIVA and the E.U. by means of the ERDF in the context of the R+D Program for Technological Institutes of IMPIVA network for 2010 (IMIDIC-2010/191).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arlandis, J., Perez-Cortes, J.C., Ungria, E.: Identification of very similar filled-in forms with a reject option. In: ICDAR, pp. 246–250 (2009)
Arya, S., Mount, D., Netanyahu, N., Silverman, R., Wu, A.: An optimal algorithm for approximate nearest neighbor searching. Journal of the ACM 45, 891–923 (1998)
Doermann, D.: The indexing and retrieval of document images: A survey. Computer Vision and Image Understanding 70(3), 287–298 (1998)
Fan, K.-C., Chang, M.-L., Wang, Y.-K.: Form document identification using line structure based features. In: ICDAR, pp. 704–708 (2001)
Heroux, P., Diana, S., Ribert, A., Trupin, E.: Classification method study for automatic form class identification. In: Proc. 14th Int. Conf. on Pattern Recognition, ICPR 1998, pp. 926–928 (1998)
Kittler, J., Hatef, M., Duin, R.P.W., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)
Mandal, S., Chowdhury, S.P., Das, A.K., Chanda, B.: A hierarchical method for automated identification and segmentation of forms. In: International Conference on Document Analysis and Recognition, pp. 705–709 (2005)
Mohr, R., Picard, S., Schmid, C.: Bayesian decision versus voting for image retrieval. In: Sommer, G., Daniilidis, K., Pauli, J. (eds.) CAIP 1997. LNCS, vol. 1296, pp. 376–383. Springer, Heidelberg (1997)
Nagasaki, T., Marukawa, K., Kagehiro, T., Sako, H.: A coupon classification method based on adaptive image vector matching. In: 18th International Conference on Pattern Recognition, pp. 280–283 (2006)
Dimmick, D.L., Garris, M.D.: Structured Forms Database 2, NIST Special Database 6 Technical Report and CD-ROM, National Institute of Standards and Technology (1992)
Ogata, H., Watanabe, S., Imaizumi, A., Yasue, T., Furukawa, N., Sako, H., Fujisawa, H.: Form-type identification for banking applications and its implementation issues. In: DRR, pp. 208–218 (2003)
Ohtera, R., Horiuchi, T.: Faxed form identification using histogram of the hough-space. In: International Conference on Pattern Recognition, vol. 2, pp. 566–569 (2004)
Paredes, R., Pérez-Cortes, J.C., Juan, A., Vidal, E.: Local representations and a direct voting scheme for face recognition. In: PRIS, pp. 71–79 (2001)
Sako, H., Seki, M., Furukawa, N., Ikeda, H., Imaizumi, A.: Form reading based on form-type identification and form-data recognition. In: International Conference on Document Analysis and Recognition, vol. 2, p. 926 (2003)
Ting, A., Leung, M.: Business form classification using strings. In: ICPR 1996, pages II: 690–694 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Arlandis, J., Castello-Fos, V., Perez-Cortes, JC. (2011). Filled-in Document Identification Using Local Features and a Direct Voting Scheme. In: Vitrià, J., Sanches, J.M., Hernández, M. (eds) Pattern Recognition and Image Analysis. IbPRIA 2011. Lecture Notes in Computer Science, vol 6669. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21257-4_68
Download citation
DOI: https://doi.org/10.1007/978-3-642-21257-4_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21256-7
Online ISBN: 978-3-642-21257-4
eBook Packages: Computer ScienceComputer Science (R0)