Skip to main content

esCam: A Mobile Application to Capture and Enhance Text Images

  • Conference paper
  • First Online:
Advances in Computational Intelligence (IWANN 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9095))

Included in the following conference series:

Abstract

Taking high resolution photos with mobile devices anytime anywhere is becoming increasingly common. Therefore, images of all kinds of text documents are recorded. This work presents esCam, an application for Android platform, whose goal is to preprocess the images of those text documents, in particular, perspective correction and image cleaning and enhancing. What truly differentiates our application is that esCam focuses on treatment of text that may appear in the image, using neural networks. These preprocessing steps are needed to make easier the digitalization and also to benefit subsequent steps such as document analysis and text recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://www.dsic.upv.es/ mcastro/esCam/index_en.htm

  2. España Boquera, S., Castro-Bleda, M.J., Gorbe-Moya, J., Zamora-Martinez, F.: Improving offline handwritten text recognition with hybrid HMM/ANN models. IEEE Trans. PAMI 33(4), 767–779 (2011)

    Google Scholar 

  3. Gatos, B., Pratikakis, I., Perantonis, S.J.: Adaptive degraded document image binarization. Pattern Recognition 39(3), 317–327 (2006)

    Article  MATH  Google Scholar 

  4. Hidalgo, J.L., España, S., Castro, M.J., Pérez, J.A.: Enhancement and Cleaning of Handwritten Data by Using Neural Networks. In: Marques, J.S., Pérez de la Blanca, N., Pina, P. (eds.) IbPRIA 2005. LNCS, vol. 3522, pp. 376–383. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  5. Lichman, M.: UCI ML repository (2013). http://archive.ics.uci.edu/ml

  6. Mori, S., Suen, C.Y., Yamamoto, K.: Historical review of OCR research and development. Proceedings of the IEEE 80(7), 1029–1058 (1992)

    Article  Google Scholar 

  7. Nagy, G.: Twenty Years of Document Image Analysis in PAMI. IEEE Trans. PAMI 22(1), 38–62 (2000)

    Article  Google Scholar 

  8. Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285–296), 23–27 (1975)

    Google Scholar 

  9. Pastor-Pellicer, J., Zamora-Mart\’ınez, F., España-Boquera, S., Castro-Bleda, M.J.: F-Measure as the Error Function to Train Neural Networks. In: Rojas, I., Joya, G., Gabestany, J. (eds.) IWANN 2013, Part I. LNCS, vol. 7902, pp. 376–384. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  10. Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recognition 33(2), 225–236 (2000)

    Article  Google Scholar 

  11. Zamora-Mart\’ınez, F., España-Boquera, S., Castro-Bleda, M.J.: Behaviour-Based Clustering of Neural Networks Applied to Document Enhancement. In: Sandoval, F., Prieto, A.G., Cabestany, J., Graña, M. (eds.) IWANN 2007. LNCS, vol. 4507, pp. 144–151. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  12. Zamora-Martínez, F., España-Boquera, S., Gorbe-Moya, J., Pastor-Pellicer, J., Palacios, A.: APRIL-ANN toolkit, A Pattern Recognizer In Lua with Artificial Neural Networks (2013). https://github.com/pakozm/april-ann

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. J. Castro-Bleda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Pastor-Pellicer, J., Castro-Bleda, M.J., Adelantado-Torres, J.L. (2015). esCam: A Mobile Application to Capture and Enhance Text Images. In: Rojas, I., Joya, G., Catala, A. (eds) Advances in Computational Intelligence. IWANN 2015. Lecture Notes in Computer Science(), vol 9095. Springer, Cham. https://doi.org/10.1007/978-3-319-19222-2_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19222-2_50

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19221-5

  • Online ISBN: 978-3-319-19222-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics