Skip to main content

Document Analysis Techniques for Automatic Electoral Document Processing: A Survey

  • Conference paper
  • First Online:
Book cover E-Voting and Identity (Vote-ID 2015)

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 9269))

Included in the following conference series:

Abstract

In this paper, we will discuss the most common challenges in electoral document processing and study the different solutions from the document analysis community that can be applied in each case. We will cover Optical Mark Recognition techniques to detect voter selections in the Australian Ballot, handwritten number recognition for preferential elections and handwriting recognition for write-in areas. We will also propose some particular adjustments that can be made to those general techniques in the specific context of electoral documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Citizen’s oversight projects (2009). www.copswiki.org/Cops/BallotStatements

  2. Elections ACT: Scanning of ballot papers (2015). http://www.elections.act.gov.au/elections_and_voting/scanning_of_ballot_papers

  3. Amin, A., Fischer, S.: A document skew detection method using the hough transform. Pattern Anal. Appl. 3(3), 243–253 (2000)

    Article  MATH  Google Scholar 

  4. Bernsen, J.: Dynamic thresholding of grey-level images. In: International Conference on Pattern Recognition (ICPR), pp. 1251–1255 (1986)

    Google Scholar 

  5. Ciresan, D.C., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition CVPR 2012 (2012), long preprint arXiv:1202.2745v1 [cs.CV]

  6. Ebrahimzadeh, R., Jampour, M.: Efficient handwritten digit recognition based on histogram of oriented gradients and svm. Int. J. Comput. Appl. 104(9), 10–13 (2014)

    Google Scholar 

  7. Fischer, A., Frinken, V., Bunke, H.: Hidden markov models for off-line cursive handwriting recognition. In: Govindaraju, V., Rao, C.R. (eds.) Handbook of Statistics: Machine Learning: Theory and Applications, vol. 31, p. 421. Elsevier, Amsterdam (2013)

    Chapter  Google Scholar 

  8. Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., Schmidhuber, J.: A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 31(5), 855–868 (2009)

    Article  Google Scholar 

  9. Hinds, S.C., Fisher, J.L., D’Amato, D.P.: A document skew detection method using run-length encoding and the hough transform. In: 10th International Conference on Pattern Recognition (ICPR), 1990, vol. 1, pp. 464–468. IEEE (1990)

    Google Scholar 

  10. Ji, T., Kim, E., Srikantan, R., Tsai, A., Cordero, A., Wagner, D.: An analysis of write-in marks on optical scan ballots. In: Proceedings of the 2011 Conference on Electronic Voting Technology/Workshop on Trustworthy Elections, EVT/WOTE 2011. USENIX Association, Berkeley (2011)

    Google Scholar 

  11. Keysers, D., Gollan, C., Ney, H.: Local context in non-linear deformation models for handwritten character recognition. In: 17th International Conference on Pattern Recognition (ICPR), 2004, vol. 4, pp. 511–514. IEEE (2004)

    Google Scholar 

  12. Kim, E., Carlini, N., Chang, A., Yiu, G., Wang, K., Wagner, D.: Improved support for machine-assisted ballot-level audits. In: Presented as part of the 2013 Electronic Voting Technology Workshop/Workshop on Trustworthy Elections. USENIX, Berkeley (2013). https://www.usenix.org/conference/evtwote13/workshop-program/presentation/Kim

  13. Le, D.S., Thoma, G.R., Wechsler, H.: Automated page orientation and skew angle detection for binary document images. Pattern Recogn. 27(10), 1325–1344 (1994)

    Article  Google Scholar 

  14. Lecun, Y., Cortes, C.: The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/

  15. Liu, C.L., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recogn. 36(10), 2271–2285 (2003)

    Article  MATH  Google Scholar 

  16. Niblack, W.: An Introduction to Digital Image Processing. Strandberg Publishing Company, Birkerod (1985)

    Google Scholar 

  17. Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285–296), 23–27 (1975)

    Google Scholar 

  18. Plötz, T., Fink, G.A.: Markov models for offline handwriting recognition: a survey. Int. J. Doc. Anal. Recogn. 12(4), 269–298 (2009)

    Article  Google Scholar 

  19. Pratikakis, I., Gatos, B., Ntirogiannis, K.: ICDAR 2013 document image binarization contest (DIBCO 2013). In: 12th International Conference on Document Analysis and Recognition (ICDAR), 2013, pp. 1471–1476. IEEE (2013)

    Google Scholar 

  20. Rabiner, L., Juang, B.H.: An introduction to hidden markov models. IEEE ASSP Mag. 3(1), 4–16 (1986)

    Article  Google Scholar 

  21. Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33(2), 225–236 (2000)

    Article  Google Scholar 

  22. Serra, J.: Introduction to mathematical morphology. Comput. Vis. Graph. Image Process. 35(3), 283–305 (1986)

    Article  MATH  Google Scholar 

  23. Sezgin, M., et al.: Survey over image thresholding techniques and quantitative performance evaluation. J. Electron. Imaging 13(1), 146–168 (2004)

    Article  MathSciNet  Google Scholar 

  24. Singh, C., Bhatia, N., Kaur, A.: Hough transform based fast skew detection and accurate skew correction methods. Pattern Recogn. 41(12), 3528–3546 (2008)

    Article  MATH  Google Scholar 

  25. Smith, E.H.B., Lopresti, D.P., Nagy, G.: Ballot mark detection. In: ICPR, pp. 1–4. IEEE (2008)

    Google Scholar 

  26. Smith, E.H.B., Lopresti, D.P., Nagy, G., Wu, Z.: Towards improved paper-based election technology. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 1255–1259. IEEE (2011)

    Google Scholar 

  27. Smith, E.H.B., Nagy, G., Lopresti, D.P.: Mark detection from scanned ballots. In: Berkner, K., Likforman-Sulem, L. (eds.) DRR. SPIE Proceedings, vol. 7247, pp. 1–10. SPIE (2009)

    Google Scholar 

  28. Wang, K., Kim, E., Carlini, N., Motyashov, I., Nguyen, D., Wagner, D.: Operator-assisted tabulation of optical scan ballots. In: Presented as part of the 2012 Electronic Voting Technology Workshop/Workshop on Trustworthy Elections. USENIX, Berkeley (2012)

    Google Scholar 

  29. Xiu, P., Lopresti, D.P., Baird, H.S., Nagy, G., Smith, E.H.B.: Style-based ballot mark recognition. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 216–220. IEEE (2009)

    Google Scholar 

Download references

Acknowledgements

We thank the reviewers for their suggestions and comments. This work has been partially supported by the Spanish project TIN2012-37475-C02-02 and the European project ERC-2010-AdG-20100407-269796 and by the Secretaria d’Universitats i Recerca del Departament d’Economia i Coneixement de la Generalitat de Catalunya.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to J. Ignacio Toledo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Toledo, J.I., Cucurull, J., Puiggalí, J., Fornés, A., Lladós, J. (2015). Document Analysis Techniques for Automatic Electoral Document Processing: A Survey. In: Haenni, R., Koenig, R., Wikström, D. (eds) E-Voting and Identity. Vote-ID 2015. Lecture Notes in Computer Science(), vol 9269. Springer, Cham. https://doi.org/10.1007/978-3-319-22270-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22270-7_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22269-1

  • Online ISBN: 978-3-319-22270-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics