Skip to main content

DocMining: A Cooperative Platform for Heterogeneous Document Interpretation According to User-Defined Scenarios

  • Conference paper
Graphics Recognition. Recent Advances and Perspectives (GREC 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3088))

Included in the following conference series:

Abstract

The DocMining platform is aimed at providing a general framework for document interpretation. It integrates document processing units coming from different sources and communicating through the document being interpreted. A task to be performed is represented by a scenario that describes the units to be run, and each unit is associated with a contract that describes the parameters, data and results of the unit as well as the way to run it. A controller interprets the scenario and triggers each required document processing unit at its turn. Documents, scenarios and contracts are all represented in XML, to make data manipulation and communications easier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Arias, J.-F., Lai, C.P., Surya, S., Kasturi, R., Chhabra, A.K.: Interpretation of telephone system manhole drawings. Pattern Recognition Letters 16(1), 355–359 (1995)

    Article  Google Scholar 

  2. Baird, H.S.: Anatomy of a versatile page reader. Proceedings of the IEEE, Special Issue on OCR 80(7), 1059–1065 (1992)

    Google Scholar 

  3. Boatto, L., Consorti, V., Del Buono, M., Di Zenzo, S., Eramo, V., Esposito, A., Melcarne, F., Meucci, M., Morelli, A., Mosciatti, M., Scarci, S., Tucci, M.: An interpretation system for land register maps. IEEE Computer Magazine 25(7), 25–33 (1992)

    Google Scholar 

  4. Coüasnon, B.: DMOS: A generic document recognition method. application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems. In: Proceedings of 6th International Conference on Document Analysis and Recognition, Seattle, USA, pp. 215–220 (2001)

    Google Scholar 

  5. Delalandre, M., Nicolas, S., Trupin, E., Ogier, J.-M.: Symbols recognition by global-local structural approaches, based on the scenarios use, and with a XML representation of data. In: Proceedings of 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland (2003)

    Google Scholar 

  6. Delalandre, M., Saidali, Y., Ogier, J.-M., Trupin, E.: Adaptable vectorization system based on strategic knowledge and XML representation use. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 199–210. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  7. Dengel, A.R., Klein, B.: smartFIX: A requirements-driven system for document analysis and understanding. In: Lopresti, D.P., Hu, J., Kashi, R.S. (eds.) DAS 2002. LNCS, vol. 2423, pp. 433–444. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  8. Dosch, P., Ah-Soon, C., Masini, G., Sánchez, G., Tombre, K.: Design of an integrated environment for the automated analysis of architectural drawings. In: Lee, S.-W., Nakano, Y. (eds.) DAS 1998. LNCS, vol. 1655, pp. 295–309. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  9. Dosch, P., Tombre, K., Ah-Soon, C., Masini, G.: A complete system for analysis of architectural drawings. International Journal on Document Analysis and Recognition 3(2), 102–116 (2000)

    Article  Google Scholar 

  10. Gorski, N., Anisimov, V., Augustin, E., Baret, O., Maximov, S.: Industrial bank check processing: The A2iA CheckReader. International Journal on Document Analysis and Recognition 3(4), 196–206 (2001)

    Article  Google Scholar 

  11. Hitz, O., Robadey, L., Ingold, R.: An architecture for editing document recognition results using XML. In: Proceedings of 4th IAPR International Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, pp. 385–396 (2000)

    Google Scholar 

  12. Lassaulzais, A., Mullot, R., Gardes, J., Lecourtier, Y.: Segmentation d’infrastructures de réseau téléphonique. In: Colloque International Francophonesur l’Écrit et le Document, Québec, Canada, pp. 188–197 (1998)

    Google Scholar 

  13. Niyogi, D., Srihari, S.N., Govindaraju, V.: Analysis of printed forms. In: Bunke, H., Wang, P.S.P. (eds.) Handbook of character recognition and document image analysis, pp. 485–502. World Scientific, Singapore (1997)

    Google Scholar 

  14. Pasternak, B.: Adaptierbares Kernsystem zur Interpretation von Zeichnungen. Dissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.), Universität Hamburg (1996)

    Google Scholar 

  15. Saidali, Y., Adam, S., Ogier, J.-M., Trupin, E., Labiche, J.: Knowledge representation and acquisition for engineering document analysis. In: Proceedings of 5th IAPR International Workshop on Graphics Recognition, Barcelona, Spain (2003)

    Google Scholar 

  16. Samet, H., Soffer, A.: MAGELLAN: Map Acquisition of GEographic Labels by Legend ANalysis. International Journal on Document Analysis and Recognition 1(2), 89–101 (1998)

    Article  Google Scholar 

  17. Shamilian, J.H., Baird, H.S., Wood, T.L.: A retargetable table reader. In: Proceedings of 4th International Conference on Document Analysis and Recognition, Ulm, Germany, pp. 158–163 (1997)

    Google Scholar 

  18. Tombre, K., Ah-Soon, C., Dosch, P., Habed, A., Masini, G.: Stable, robust and off-the-shelf methods for graphics recognition. In: Proceedings of 14th International Conference on Pattern Recognition, Brisbane, Australia, pp. 406–408 (1998)

    Google Scholar 

  19. Yu, Y., Samal, A., Seth, S.C.: A system for recognizing a large class of engineering drawings. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(8), 868–890 (1997)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J. (2004). DocMining: A Cooperative Platform for Heterogeneous Document Interpretation According to User-Defined Scenarios. In: Lladós, J., Kwon, YB. (eds) Graphics Recognition. Recent Advances and Perspectives. GREC 2003. Lecture Notes in Computer Science, vol 3088. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25977-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-25977-0_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22478-5

  • Online ISBN: 978-3-540-25977-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics