Computer Assisted Transcription for Ancient Text Images

Romero, Verónica; Toselli, Alejandro H.; Rodríguez, Luis; Vidal, Enrique

doi:10.1007/978-3-540-74260-9_105

Verónica Romero¹,
Alejandro H. Toselli¹,
Luis Rodríguez² &
…
Enrique Vidal¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4633))

Included in the following conference series:

International Conference Image Analysis and Recognition

2344 Accesses
14 Citations

Abstract

Paleography experts spend many hours transcribing ancient documents and state-of-the-art handwritten text recognition systems are not suitable for performing this task automatically. We propose here a new interactive, on-line framework which, rather than full automation, aims at assisting the experts in the proper recognition-transcription process; that is, facilitate and speed up the transcription of old documents. This framework combines the efficiency of automatic handwriting recognition systems with the accuracy of the experts, leading to a cost-effective perfect transcription of ancient manuscripts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cubel, E., Civera, J., Vilar, J.M., Lagarda, A.L., Vidal, E., Casacuberta, F., Picó, D., González, J., Rodríguez, L.: Finite-state models for computer assisted translation. In: Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004), Valencia, Spain, pp. 586–590. IOS Press, Amsterdam (2004)
Google Scholar
Civera, J., Vilar, J.M., Cubel, E., Lagarda, A.L., Barrachina, S., Casacuberta, F., Vidal, E., Picó, D., González, J.: A syntactic pattern recognition approach to computer assisted translation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A., de Ridder, D. (eds.) Structural, Syntactic, and Statistical Pattern Recognition. LNCS, vol. 3138, pp. 207–215. Springer, Heidelberg (2004)
Google Scholar
Barrachina, S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S., Lagarda, A.L., Ney, H., Tomás, J., Vidal, E., Vilar, J.: Statistical approaches to computer-assited translation. In: Computational Linguistic (submitted 2006)
Google Scholar
Rodriguez, L., Casacuberta, F., Vidal, E.: Computer Assisted Speech Transcription. In: Proceedings of the third Iberian Conference on Pattern Recognition and Image Analysis, Girona (Spain). LNCS, Springer, Heidelberg (2007)
Google Scholar
Alabau, V., Benedí, J., Casacuberta, F., Juan, A., Martínez-Hinarejos, C., Pastor, M., Rodríguez, L., Sánchez, J., Sanchis, A., Vidal, E.: Pattern Recognition Approaches for Speech Recognition Applications. In: Pla, F., Radeva, P., Vitrià, J. (eds.) Pattern Recognition: Progress, Directions and Applications. Centre de Visió per Computador, pp. 21–40 (2006), ISBN 84-933652-6-2
Google Scholar
Bazzi, I., Schwartz, R., Makhoul, J.: An Omnifont Open-Vocabulary OCR System for English and Arabic. IEEE Trans. on PAMI 21(6), 495–504 (1999)
Google Scholar
Rabiner, L.: A Tutorial of Hidden Markov Models and Selected Application in Speech Recognition. In: Proc. IEEE, vol. 77, pp. 257–286 (1989)
Google Scholar
Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Cambridge (1998)
Google Scholar
DRIRA, F.: Towards Restoring Historic Documents Degraded Over Time. In: DIAL 2006. Proceedings of the Second International Conference on Document Image Analysis for Libraries, Washington, DC, pp. 350–357. IEEE Computer Society Press, Los Alamitos (2006)
Google Scholar
Toselli, A.H., Juan, A., Keysers, D., González, J., Salvador, I., Ney, H., Vidal, E., Casacuberta, F.: Integrated Handwriting Recognition and Interpretation using Finite-State Models. Int. Journal of Pattern Recognition and Artificial Intelligence 18(4), 519–539 (2004)
Article Google Scholar
Kavallieratou, E., Stamatatos, E.: Improving the quality of degraded document images. In: DIAL 2006. Proceedings of the Second International Conference on Document Image Analysis for Libraries, Washington, DC, pp. 340–349. IEEE Computer Society Press, Los Alamitos (2006)
Google Scholar
Marti, U.-V., Bunke, H.: Using a Statistical Language Model to improve the preformance of an HMM-Based Cursive Handwriting Recognition System. Int. Journal of Pattern Recognition and Artificial Intelligence 15(1), 65–90 (2001)
Article Google Scholar
Pastor, M., Toselli, A., Vidal, E.: Projection profile based algorithm for slant removal. In: Campilho, A., Kamel, M. (eds.) ICIAR 2004. LNCS, vol. 3211, pp. 183–190. Springer, Heidelberg (2004)
Google Scholar
Romero, V., Pastor, M., Toselli, A.H., Vidal, E.: Criteria for handwritten off-line text size normalization. In: VIIP 2006. Procc. of The Sixth IASTED international Conference on Visualization, Imaging, and Image Processing, Palma de Mallorca, Spain (2006)
Google Scholar
Katz, S.M.: Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Trans. on Acoustics, Speech and Signal Processing ASSP-35, 400–401 (1987)
Article Google Scholar
Kneser, R., Ney, H.: Improved backing-off for N-gram language modeling. In: ICASSP. International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 181–184 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Camí de Vera s/n, 46022 València, Spain)
Verónica Romero, Alejandro H. Toselli & Enrique Vidal
Departamento de Sistemas Informáticos, Universidad de Castilla La Mancha., Spain
Luis Rodríguez

Authors

Verónica Romero
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro H. Toselli
View author publications
You can also search for this author in PubMed Google Scholar
Luis Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Vidal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Mohamed Kamel Aurélio Campilho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E. (2007). Computer Assisted Transcription for Ancient Text Images. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2007. Lecture Notes in Computer Science, vol 4633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74260-9_105

Download citation

DOI: https://doi.org/10.1007/978-3-540-74260-9_105
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74258-6
Online ISBN: 978-3-540-74260-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics