Abstract
The DocMining platform is aimed at providing a general framework for document interpretation. It integrates document processing units coming from different sources and communicating through the document being interpreted. A task to be performed is represented by a scenario that describes the units to be run, and each unit is associated with a contract that describes the parameters, data and results of the unit as well as the way to run it. A controller interprets the scenario and triggers each required document processing unit at its turn. Documents, scenarios and contracts are all represented in XML, to make data manipulation and communications easier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Arias, J.-F., Lai, C.P., Surya, S., Kasturi, R., Chhabra, A.K.: Interpretation of telephone system manhole drawings. Pattern Recognition Letters 16(1), 355–359 (1995)
Baird, H.S.: Anatomy of a versatile page reader. Proceedings of the IEEE, Special Issue on OCR 80(7), 1059–1065 (1992)
Boatto, L., Consorti, V., Del Buono, M., Di Zenzo, S., Eramo, V., Esposito, A., Melcarne, F., Meucci, M., Morelli, A., Mosciatti, M., Scarci, S., Tucci, M.: An interpretation system for land register maps. IEEE Computer Magazine 25(7), 25–33 (1992)
Coüasnon, B.: DMOS: A generic document recognition method. application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems. In: Proceedings of 6th International Conference on Document Analysis and Recognition, Seattle, USA, pp. 215–220 (2001)
Delalandre, M., Nicolas, S., Trupin, E., Ogier, J.-M.: Symbols recognition by global-local structural approaches, based on the scenarios use, and with a XML representation of data. In: Proceedings of 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland (2003)
Delalandre, M., Saidali, Y., Ogier, J.-M., Trupin, E.: Adaptable vectorization system based on strategic knowledge and XML representation use. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 199–210. Springer, Heidelberg (2004)
Dengel, A.R., Klein, B.: smartFIX: A requirements-driven system for document analysis and understanding. In: Lopresti, D.P., Hu, J., Kashi, R.S. (eds.) DAS 2002. LNCS, vol. 2423, pp. 433–444. Springer, Heidelberg (2002)
Dosch, P., Ah-Soon, C., Masini, G., Sánchez, G., Tombre, K.: Design of an integrated environment for the automated analysis of architectural drawings. In: Lee, S.-W., Nakano, Y. (eds.) DAS 1998. LNCS, vol. 1655, pp. 295–309. Springer, Heidelberg (1999)
Dosch, P., Tombre, K., Ah-Soon, C., Masini, G.: A complete system for analysis of architectural drawings. International Journal on Document Analysis and Recognition 3(2), 102–116 (2000)
Gorski, N., Anisimov, V., Augustin, E., Baret, O., Maximov, S.: Industrial bank check processing: The A2iA CheckReader. International Journal on Document Analysis and Recognition 3(4), 196–206 (2001)
Hitz, O., Robadey, L., Ingold, R.: An architecture for editing document recognition results using XML. In: Proceedings of 4th IAPR International Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, pp. 385–396 (2000)
Lassaulzais, A., Mullot, R., Gardes, J., Lecourtier, Y.: Segmentation d’infrastructures de réseau téléphonique. In: Colloque International Francophonesur l’Écrit et le Document, Québec, Canada, pp. 188–197 (1998)
Niyogi, D., Srihari, S.N., Govindaraju, V.: Analysis of printed forms. In: Bunke, H., Wang, P.S.P. (eds.) Handbook of character recognition and document image analysis, pp. 485–502. World Scientific, Singapore (1997)
Pasternak, B.: Adaptierbares Kernsystem zur Interpretation von Zeichnungen. Dissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.), Universität Hamburg (1996)
Saidali, Y., Adam, S., Ogier, J.-M., Trupin, E., Labiche, J.: Knowledge representation and acquisition for engineering document analysis. In: Proceedings of 5th IAPR International Workshop on Graphics Recognition, Barcelona, Spain (2003)
Samet, H., Soffer, A.: MAGELLAN: Map Acquisition of GEographic Labels by Legend ANalysis. International Journal on Document Analysis and Recognition 1(2), 89–101 (1998)
Shamilian, J.H., Baird, H.S., Wood, T.L.: A retargetable table reader. In: Proceedings of 4th International Conference on Document Analysis and Recognition, Ulm, Germany, pp. 158–163 (1997)
Tombre, K., Ah-Soon, C., Dosch, P., Habed, A., Masini, G.: Stable, robust and off-the-shelf methods for graphics recognition. In: Proceedings of 14th International Conference on Pattern Recognition, Brisbane, Australia, pp. 406–408 (1998)
Yu, Y., Samal, A., Seth, S.C.: A system for recognizing a large class of engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(8), 868–890 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J. (2004). DocMining: A Cooperative Platform for Heterogeneous Document Interpretation According to User-Defined Scenarios. In: Lladós, J., Kwon, YB. (eds) Graphics Recognition. Recent Advances and Perspectives. GREC 2003. Lecture Notes in Computer Science, vol 3088. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25977-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-25977-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22478-5
Online ISBN: 978-3-540-25977-0
eBook Packages: Springer Book Archive