Abstract
Paleography experts spend many hours transcribing ancient documents and state-of-the-art handwritten text recognition systems are not suitable for performing this task automatically. We propose here a new interactive, on-line framework which, rather than full automation, aims at assisting the experts in the proper recognition-transcription process; that is, facilitate and speed up the transcription of old documents. This framework combines the efficiency of automatic handwriting recognition systems with the accuracy of the experts, leading to a cost-effective perfect transcription of ancient manuscripts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cubel, E., Civera, J., Vilar, J.M., Lagarda, A.L., Vidal, E., Casacuberta, F., Picó, D., González, J., Rodríguez, L.: Finite-state models for computer assisted translation. In: Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 2004), Valencia, Spain, pp. 586–590. IOS Press, Amsterdam (2004)
Civera, J., Vilar, J.M., Cubel, E., Lagarda, A.L., Barrachina, S., Casacuberta, F., Vidal, E., Picó, D., González, J.: A syntactic pattern recognition approach to computer assisted translation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A., de Ridder, D. (eds.) Structural, Syntactic, and Statistical Pattern Recognition. LNCS, vol. 3138, pp. 207–215. Springer, Heidelberg (2004)
Barrachina, S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S., Lagarda, A.L., Ney, H., Tomás, J., Vidal, E., Vilar, J.: Statistical approaches to computer-assited translation. In: Computational Linguistic (submitted 2006)
Rodriguez, L., Casacuberta, F., Vidal, E.: Computer Assisted Speech Transcription. In: Proceedings of the third Iberian Conference on Pattern Recognition and Image Analysis, Girona (Spain). LNCS, Springer, Heidelberg (2007)
Alabau, V., Benedí, J., Casacuberta, F., Juan, A., Martínez-Hinarejos, C., Pastor, M., Rodríguez, L., Sánchez, J., Sanchis, A., Vidal, E.: Pattern Recognition Approaches for Speech Recognition Applications. In: Pla, F., Radeva, P., Vitrià, J. (eds.) Pattern Recognition: Progress, Directions and Applications. Centre de Visió per Computador, pp. 21–40 (2006), ISBN 84-933652-6-2
Bazzi, I., Schwartz, R., Makhoul, J.: An Omnifont Open-Vocabulary OCR System for English and Arabic. IEEE Trans. on PAMI 21(6), 495–504 (1999)
Rabiner, L.: A Tutorial of Hidden Markov Models and Selected Application in Speech Recognition. In: Proc. IEEE, vol. 77, pp. 257–286 (1989)
Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Cambridge (1998)
DRIRA, F.: Towards Restoring Historic Documents Degraded Over Time. In: DIAL 2006. Proceedings of the Second International Conference on Document Image Analysis for Libraries, Washington, DC, pp. 350–357. IEEE Computer Society Press, Los Alamitos (2006)
Toselli, A.H., Juan, A., Keysers, D., González, J., Salvador, I., Ney, H., Vidal, E., Casacuberta, F.: Integrated Handwriting Recognition and Interpretation using Finite-State Models. Int. Journal of Pattern Recognition and Artificial Intelligence 18(4), 519–539 (2004)
Kavallieratou, E., Stamatatos, E.: Improving the quality of degraded document images. In: DIAL 2006. Proceedings of the Second International Conference on Document Image Analysis for Libraries, Washington, DC, pp. 340–349. IEEE Computer Society Press, Los Alamitos (2006)
Marti, U.-V., Bunke, H.: Using a Statistical Language Model to improve the preformance of an HMM-Based Cursive Handwriting Recognition System. Int. Journal of Pattern Recognition and Artificial Intelligence 15(1), 65–90 (2001)
Pastor, M., Toselli, A., Vidal, E.: Projection profile based algorithm for slant removal. In: Campilho, A., Kamel, M. (eds.) ICIAR 2004. LNCS, vol. 3211, pp. 183–190. Springer, Heidelberg (2004)
Romero, V., Pastor, M., Toselli, A.H., Vidal, E.: Criteria for handwritten off-line text size normalization. In: VIIP 2006. Procc. of The Sixth IASTED international Conference on Visualization, Imaging, and Image Processing, Palma de Mallorca, Spain (2006)
Katz, S.M.: Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Trans. on Acoustics, Speech and Signal Processing ASSP-35, 400–401 (1987)
Kneser, R., Ney, H.: Improved backing-off for N-gram language modeling. In: ICASSP. International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 181–184 (1995)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Romero, V., Toselli, A.H., Rodríguez, L., Vidal, E. (2007). Computer Assisted Transcription for Ancient Text Images. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2007. Lecture Notes in Computer Science, vol 4633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74260-9_105
Download citation
DOI: https://doi.org/10.1007/978-3-540-74260-9_105
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74258-6
Online ISBN: 978-3-540-74260-9
eBook Packages: Computer ScienceComputer Science (R0)