Skip to main content

Computer Assisted Transcription for Ancient Text Images

  • Conference paper
Book cover Image Analysis and Recognition (ICIAR 2007)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4633))

Included in the following conference series:

Abstract

Paleography experts spend many hours transcribing ancient documents and state-of-the-art handwritten text recognition systems are not suitable for performing this task automatically. We propose here a new interactive, on-line framework which, rather than full automation, aims at assisting the experts in the proper recognition-transcription process; that is, facilitate and speed up the transcription of old documents. This framework combines the efficiency of automatic handwriting recognition systems with the accuracy of the experts, leading to a cost-effective perfect transcription of ancient manuscripts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cubel, E., Civera, J., Vilar, J.M., Lagarda, A.L., Vidal, E., Casacuberta, F., Picó, D., González, J., Rodríguez, L.: Finite-state models for computer assisted translation. In: Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004), Valencia, Spain, pp. 586–590. IOS Press, Amsterdam (2004)

    Google Scholar 

  2. Civera, J., Vilar, J.M., Cubel, E., Lagarda, A.L., Barrachina, S., Casacuberta, F., Vidal, E., Picó, D., González, J.: A syntactic pattern recognition approach to computer assisted translation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A., de Ridder, D. (eds.) Structural, Syntactic, and Statistical Pattern Recognition. LNCS, vol. 3138, pp. 207–215. Springer, Heidelberg (2004)

    Google Scholar 

  3. Barrachina, S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S., Lagarda, A.L., Ney, H., Tomás, J., Vidal, E., Vilar, J.: Statistical approaches to computer-assited translation. In: Computational Linguistic (submitted 2006)

    Google Scholar 

  4. Rodriguez, L., Casacuberta, F., Vidal, E.: Computer Assisted Speech Transcription. In: Proceedings of the third Iberian Conference on Pattern Recognition and Image Analysis, Girona (Spain). LNCS, Springer, Heidelberg (2007)

    Google Scholar 

  5. Alabau, V., Benedí, J., Casacuberta, F., Juan, A., Martínez-Hinarejos, C., Pastor, M., Rodríguez, L., Sánchez, J., Sanchis, A., Vidal, E.: Pattern Recognition Approaches for Speech Recognition Applications. In: Pla, F., Radeva, P., Vitrià, J. (eds.) Pattern Recognition: Progress, Directions and Applications. Centre de Visió per Computador, pp. 21–40 (2006), ISBN 84-933652-6-2

    Google Scholar 

  6. Bazzi, I., Schwartz, R., Makhoul, J.: An Omnifont Open-Vocabulary OCR System for English and Arabic. IEEE Trans. on PAMI 21(6), 495–504 (1999)

    Google Scholar 

  7. Rabiner, L.: A Tutorial of Hidden Markov Models and Selected Application in Speech Recognition. In: Proc. IEEE, vol. 77, pp. 257–286 (1989)

    Google Scholar 

  8. Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Cambridge (1998)

    Google Scholar 

  9. DRIRA, F.: Towards Restoring Historic Documents Degraded Over Time. In: DIAL 2006. Proceedings of the Second International Conference on Document Image Analysis for Libraries, Washington, DC, pp. 350–357. IEEE Computer Society Press, Los Alamitos (2006)

    Google Scholar 

  10. Toselli, A.H., Juan, A., Keysers, D., González, J., Salvador, I., Ney, H., Vidal, E., Casacuberta, F.: Integrated Handwriting Recognition and Interpretation using Finite-State Models. Int. Journal of Pattern Recognition and Artificial Intelligence 18(4), 519–539 (2004)

    Article  Google Scholar 

  11. Kavallieratou, E., Stamatatos, E.: Improving the quality of degraded document images. In: DIAL 2006. Proceedings of the Second International Conference on Document Image Analysis for Libraries, Washington, DC, pp. 340–349. IEEE Computer Society Press, Los Alamitos (2006)

    Google Scholar 

  12. Marti, U.-V., Bunke, H.: Using a Statistical Language Model to improve the preformance of an HMM-Based Cursive Handwriting Recognition System. Int. Journal of Pattern Recognition and Artificial Intelligence 15(1), 65–90 (2001)

    Article  Google Scholar 

  13. Pastor, M., Toselli, A., Vidal, E.: Projection profile based algorithm for slant removal. In: Campilho, A., Kamel, M. (eds.) ICIAR 2004. LNCS, vol. 3211, pp. 183–190. Springer, Heidelberg (2004)

    Google Scholar 

  14. Romero, V., Pastor, M., Toselli, A.H., Vidal, E.: Criteria for handwritten off-line text size normalization. In: VIIP 2006. Procc. of The Sixth IASTED international Conference on Visualization, Imaging, and Image Processing, Palma de Mallorca, Spain (2006)

    Google Scholar 

  15. Katz, S.M.: Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Trans. on Acoustics, Speech and Signal Processing ASSP-35, 400–401 (1987)

    Article  Google Scholar 

  16. Kneser, R., Ney, H.: Improved backing-off for N-gram language modeling. In: ICASSP. International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 181–184 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Mohamed Kamel Aurélio Campilho

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E. (2007). Computer Assisted Transcription for Ancient Text Images. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2007. Lecture Notes in Computer Science, vol 4633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74260-9_105

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74260-9_105

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74258-6

  • Online ISBN: 978-3-540-74260-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics