Abstract
In this paper a new neuro-fuzzy system is proposed for both tasks of document analysis and Optical Character Recognition. FasART (Fuzzy adaptive system ART based) inherits the stability, flexibility and modularity properties of ART supervised models, but with a formal description as a Fuzzy Logic System, and increased functionality. On the other hand Recursive FasART permits us to exploit context information, crucial aspect in document understanding. Satisfactory experimental results are presented for the global application of building a digital library of scientific papers, giving special emphasis on the creation of links between items in table of contents and paper first pages.
Preview
Unable to display preview. Download preview PDF.
References
Bayer, T. A. Understanding structured text documents by a model based document analysis system. Proc. of the 2nd International Conference on Document Analysis and Recognition, Sukuba Science City, Japan, October 20–22 1993.
Bayer, T. A., Walischewski H. Experiments on extracting structural information from paper documents using syntactic pattern analysis. Proc. of the 3rd International Conference on Document Analysis and Recognition, Montreal, Canada, August 14–16, 1995, 476–479.
Cano, J. M., Dimitriadis, Y.A. Lopez, J. A fuzzy neural arquitecture for supervised learning and classification of temporal sequences. Proc. of International Congress on Artificial Neural Networks, Amsterdam, The Netherlands, 1993.
Cano, J. M., Dimitriadis, Y. A., Lopez, J. FasArt: A New Neuro-Fuzzy Architecture for Incremental Learning in System Identification. Proc. of Congress of International Federation on Automatic Control '96, Vol. F, 133–138, San Francisco, USA, June 30–July 5 1996.
Carpenter, G., Grossberg, S., Markuzon, N., Reynolds, J. Fuzzy ARTMAP: A Neural Network Architecture for Incremental Supervised Learning of Analog Multidimensional Maps. IEEE Transactions on Neural Networks, 3 698–713 1992.
Carpenter, G. A., Grossberg, S. A massively Parallel architecture for self-organizing neural pattern recognition machine. Computer Vision, Graphics, and Image Processing, 37 54–115 1987.
Dengel, A., Bleisinger, R., Hoch, R. From paper to office document standard representation. Computer, 25 63–67 1992.
Gladney, H. M., Ahmed, Z., Ashany, R., Belkin, N. J., Fox, E. A., Zemankova, M. Digital Library: Gross structure and Requirements. IBM Research Report RJ 9840 (May 1994).
Fujihara, H., Babiker, E., Simmons, D. B. Fuzzy Approach to Document Recognition. Proc. of the 2nd IEEE International Conference on Fuzzy Systems, 980–985, San Francisco, USA, March–April 1993.
Govindan, V. K., Shivasprased, A. P. Character recognition: A review. Pattern Recognition, 23 671–683 1990.
O' Gorman, L., Kasturi, R. Document Image Analysis. Chapter 4, IEEE Computer Society Press. 161–181, 1994
Le, D. X., Thomas, G. R., Wechsler, H. Document image analysis using integrated image and neural processing. Proc. of the 3rd International Conference on Document Analysis and Recognition, 327–330, Montreal, Canada, August 14–16 1995.
Myka, A. Putting Paper Documents in the World-Wide-Web. Proc. of the 2nd International WWW conference, October 17–20, 1994, Chicago, 199–208.
Myka, A., Guntzer, U. Full-text searches in OCR databses. In Adam, N.; Bhargava, B.; Halem, N.; Yesha, Y. (editors): Advances in Digital Libraries. Springer Verlag, 1996.
Nagy, G., Seth, S. C., Stoddard, S. D. Document analysis with an expert system. Proc. of Recognition in Practice II, Amsterdam, June 19–21 1985.
Sainz, G. I., Cano, J. M., Dimitriadis, Y. A., Lopez, J. Document Understanding Based on a Neuro-Fuzzy Approach. Proc. of International Conference on Engineering Applications of Neural Networks, Part II, London, UK, June 17–19 1996.
Sainz, G. I., Cano, J. M., Dimitriadis, Y. A., Lopez, J. A New Neuro-Fuzzy System for Logical Labeling of Documents. Proc. of The International Conference on Pattern Recognition, volume Track D: Parallel and Connectionist Systems, pp 431–436, Vienna, Austria, August 25–29 1996.
Sainz, G. I., Cano, J. M., Dimitriadis, Y. A., Lopez, J. Structured document labeling and rule extraction using a new recurrent fuzzy-neural system. Submited to ICDAR'97.
Satoh, S., Takasu, A., Katsura, E. An Automated Generation of Electronic Library based on Document Image Understanding. Proc. of International Conference on Document Analysis and Recognition, Vol. I, 63–166, Montreal, August 1995.
Srihari, S. N., Lam, S. W., Govindaraju, V., Srihari, R. K., Hull, J. J. Document Image Understanding. CEDAR-TR-92-1. (May 1992). Available at http://www.cedar.buffalo.edu/Publications/TechReps/Survey/survey.html
Wu, S., Manber, U. Fast text searching allowing errors. Communications of ACM 35 83–91 1992.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Guadarrama, R.S., Izquierdo, J.M.C., Dimitriadis, Y.A., Palmero, G.I.S., Coronado, J.L. (1997). Building digital libraries from paper documents, using ART based neuro-fuzzy systems. In: Mira, J., Moreno-Díaz, R., Cabestany, J. (eds) Biological and Artificial Computation: From Neuroscience to Technology. IWANN 1997. Lecture Notes in Computer Science, vol 1240. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0032487
Download citation
DOI: https://doi.org/10.1007/BFb0032487
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63047-0
Online ISBN: 978-3-540-69074-0
eBook Packages: Springer Book Archive