Skip to main content

General diagram-recognition methodologies

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1072))

Abstract

The field of diagram recognition faces many challenges, including the great diversity in diagrammatic notations, and the presence of noise and ambiguity during the recognition process. To help address these problems, research is needed into methods for acquiring, representing, and exploiting notational conventions. We review several frameworks for diagram recognition: blackboard systems, schema-based systems, syntactic methods, and graph rewriting. Next we discuss the need for a computationally-relevant characterization of diagrammatic notations, the need to exploit soft constraints during diagram recognition, and the possibility that diagram generators may provide a useful source of information about notational conventions.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Anderson, “Two Dimensional Mathematical Notation,” in Syntactic Pattern Recognition, Applications, K. S. Fu editor, Springer 1977, pp. 147–177.

    Google Scholar 

  2. H. Baird and D. Ittner, “Data Structures for Page Readers” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 323–334.

    Google Scholar 

  3. A. Belkin, “Macintosh Notation Software: Present and Future,” Computer Music Journal, Vol. 18, No. 1, pp. 53–69, Spring 1994.

    Google Scholar 

  4. J. Bertin, Semiology of Graphics: Diagrams, Networks, and Maps, University of Wisconsin Press, 1983.

    Google Scholar 

  5. D. Blostein and H. Baird, “A Critical Survey of Music Image Analysis,” in Structured Document Image Analysis, Eds. H. Baird, H. Bunke, and K. Yamamoto, Springer Verlag, 1992, pp. 405–434.

    Google Scholar 

  6. D. Blostein, H. Fahmy, and A. Grbavec, “Practical Use of Graph Rewriting,” Technical Report No. 95-373, Computing and Information Science, Queen's University, January, 1995.

    Google Scholar 

  7. D. Blostein, A. Grbavec, “Recognition of Mathematical Notation,” in Handbook of Character Recognition and Document Image Analysis, Eds. H. Bunke and P. Wang, World Scientific, to appear.

    Google Scholar 

  8. H. Bunke, “Attributed Programmed Graph Grammars and Their Application to Schematic Diagram Interpretation,” IEEE Trans. Pattern Analysis and Machine Intelligence 4(6), pp. 574–582, Nov. 1982.

    Google Scholar 

  9. P. Chou, “Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar,” Proc. SPIE Visual Communications and Image Processing IV, Philadelphia PA, pp. 852–863, Nov. 1989.

    Google Scholar 

  10. S. Collin, K. Tombre, and P. Vaxiviere, “Don't Tell Mom I'm Doing Document Analysis; She Believes I'm in the Computer Vision Field,” Proc. Second Intl. Conf. on Document Analysis and Recognition, Tsukuba, Japan, Oct. 1993, pp. 619–622.

    Google Scholar 

  11. Y. Dimitriadis, J. Coronado, and C. de la Maza, “A New Interactive Mathematical Editor, Using On-line Handwritten Symbol Recognition, and Error Detection-Correction with an Attribute Grammar,” in Proc. First Intl. Conf. on Document Analysis and Recognition, Saint Malo, France, September 1991, pp. 242–250.

    Google Scholar 

  12. H. Fahmy and D. Blostein, “A Graph Grammar Programming Style for Recognition of Music Notation,” Machine Vision and Applications, Vol. 6, No. 2, pp. 83–99, 1993.

    Google Scholar 

  13. H. Fahmy, “Reasoning in the Presence of Uncertainty via Graph Rewriting,” PhD Thesis, Computing and Information Science, Queen's University, March 1995. (TR 95-382)

    Google Scholar 

  14. K. S. Fu, Syntactic Pattern Recognition and Applications, Prentice Hall 1982.

    Google Scholar 

  15. A. Grbavec and D. Blostein, “Mathematics Recognition Using Graph Rewriting,” Third International Conference on Document Analysis and Recognition, Montreal, Canada, August 1995.

    Google Scholar 

  16. W. Havens and A. Mackworth, “Representing Knowledge of the Visual World,” IEEE Computer, October 1983, pp. 90–96.

    Google Scholar 

  17. V. Jagannathan, R. Dodhiawala, L. Baum, Editors, Blackboard Architectures and Applications, Academic Press, 1989.

    Google Scholar 

  18. S. Joseph and T. Pridmore, “Knowledge-Directed Interpretation of Mechanical Engineering Drawings,” IEEE PAMI, Vol. 14, No. 9, Sept. 1992, pp. 928–940.

    Google Scholar 

  19. H. Kato and S. Inokuchi, “The Recognition System of Printed Piano Music using Musical Knowledge and Constraints,” Proc. IAPR Workshop on Syntactic and Structural Pattern Recognition., Murray Hill NJ, June 1990, pp. 231–248.

    Google Scholar 

  20. H. Kato and S. Inokuchi, “The Recognition Method for Roughly Hand-Drawn Logical Diagrams Based on Utilization of Multi-Layered Knowledge,” Proc. 10th Intl. Conf. on Pattern Recognition, Atlantic City NJ, June 1990, pp. 443–473.

    Google Scholar 

  21. D. Knuth, “Mathematical Typography,” Bulletin of the American Mathematical Society, Vol. 1, No. 2, March 1979.

    Google Scholar 

  22. G. Kopec and P. Chou, “Document Image Decoding Using Markov Source Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 16, No. 6, June 1994, pp. 602–617.

    Google Scholar 

  23. X. Lin, S. Shimotsuji, M. Minoh, T. Saki, “Efficient Diagram Understanding with Chraracteristic Pattern Detection,” Computer Vision, Graphics, and Image Processing, Vol. 30, 1985, pp. 84–106.

    Google Scholar 

  24. W. Lin and K.S. Fu, “A Syntactic Approach to Three-Dimensional Object Recognition,” IEEE Trans. Systems Man and Cybernetics, Vol. 16, No. 3, May 1986, pp. 405–422.

    Google Scholar 

  25. G. Lohse, K. Biolsi, N. Walker, H. Ruetter, “A Classification of Visual Representations,” Communications of the ACM, Vol. 37, No. 4, December 1994, pp. 36–49.

    Google Scholar 

  26. I. Mulder, A. Mackworth, W. Havens, “Knowledge Structuring and Constraint Satisfaction: The Mapsee Approach,” IEEE Pattern Analysis and Machine Intelligence, Vol. 10, No. 6, November 1988, pp. 866–879.

    Google Scholar 

  27. G. Novak and W. Bulko, “Diagrams and Text as Computer Input,” J. Visual Languages and Computing, Vol. 4, 1993, pp. 161–175.

    Google Scholar 

  28. M. Okamoto and B. Miao, “Recognition of Mathematical Expressions by Using the Layout Structure of Symbols,” in Proc. First Intl. Conference on Document Analysis and Recognition, Saint Malo, France, September 1991, pp. 242–250.

    Google Scholar 

  29. B. Pasternak, “Processing Imprecise and Structural Distorted Line Drawings by and Adaptable Drawing Interpretation Kernel,” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 349–363.

    Google Scholar 

  30. G. Read, Music Notation: A Manual of Modern Practice (Second Edition), Taplinger Publishing, New York, NY, 1979.

    Google Scholar 

  31. D. Roush, “Music Formatting Guidelines,” Technical Report OSU-3/88-TR10, Department of Computer and Information Science, The Ohio State University, 1988.

    Google Scholar 

  32. R. Sennhauser, “Integration of Contextual Knowledge Sources Into a Blackboard-based Text Recognition System,” IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 211–228.

    Google Scholar 

  33. J. Sloboda, “The Uses of Space in Music Notation,” Visual Language, Vol. XV, No. 1, pp. 86–112, 1981.

    Google Scholar 

  34. S. Srihari, “From Pixels to Paragraphs: the Use of Contextual Models in Text Recognition,” Proc. Second Intl. Conf. Document Analysis and Recognition, Tsukuba, Japan, Oct. 1993, pp. 416–423.

    Google Scholar 

  35. S. Truvé, “Image Interpretation Using Multi-Relational Grammars,” Proc. Third International Conference on Computer Vision, pp. 146–155, December 1990.

    Google Scholar 

  36. E. Tufte, The Visual Display of Quantitative Information, Graphics Press, 1983.

    Google Scholar 

  37. P. Vaxivière and K. Tombre, “Knowledge Organization and Interpretation Process in Engineering Drawing Interpretation,” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 313–321.

    Google Scholar 

  38. C. Wang and S. Srihari, “A Framework for Object Recognition in a Visually Complex Environment and its Application to Locating Address Blocks on Mail Pieces,” International Journal of Computer Vision, Vol. 2, 1989, pp. 125–151.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Rangachar Kasturi Karl Tombre

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Blostein, D. (1996). General diagram-recognition methodologies. In: Kasturi, R., Tombre, K. (eds) Graphics Recognition Methods and Applications. GREC 1995. Lecture Notes in Computer Science, vol 1072. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61226-2_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-61226-2_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-61226-1

  • Online ISBN: 978-3-540-68387-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics