Abstract
The field of diagram recognition faces many challenges, including the great diversity in diagrammatic notations, and the presence of noise and ambiguity during the recognition process. To help address these problems, research is needed into methods for acquiring, representing, and exploiting notational conventions. We review several frameworks for diagram recognition: blackboard systems, schema-based systems, syntactic methods, and graph rewriting. Next we discuss the need for a computationally-relevant characterization of diagrammatic notations, the need to exploit soft constraints during diagram recognition, and the possibility that diagram generators may provide a useful source of information about notational conventions.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
R. Anderson, “Two Dimensional Mathematical Notation,” in Syntactic Pattern Recognition, Applications, K. S. Fu editor, Springer 1977, pp. 147–177.
H. Baird and D. Ittner, “Data Structures for Page Readers” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 323–334.
A. Belkin, “Macintosh Notation Software: Present and Future,” Computer Music Journal, Vol. 18, No. 1, pp. 53–69, Spring 1994.
J. Bertin, Semiology of Graphics: Diagrams, Networks, and Maps, University of Wisconsin Press, 1983.
D. Blostein and H. Baird, “A Critical Survey of Music Image Analysis,” in Structured Document Image Analysis, Eds. H. Baird, H. Bunke, and K. Yamamoto, Springer Verlag, 1992, pp. 405–434.
D. Blostein, H. Fahmy, and A. Grbavec, “Practical Use of Graph Rewriting,” Technical Report No. 95-373, Computing and Information Science, Queen's University, January, 1995.
D. Blostein, A. Grbavec, “Recognition of Mathematical Notation,” in Handbook of Character Recognition and Document Image Analysis, Eds. H. Bunke and P. Wang, World Scientific, to appear.
H. Bunke, “Attributed Programmed Graph Grammars and Their Application to Schematic Diagram Interpretation,” IEEE Trans. Pattern Analysis and Machine Intelligence 4(6), pp. 574–582, Nov. 1982.
P. Chou, “Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar,” Proc. SPIE Visual Communications and Image Processing IV, Philadelphia PA, pp. 852–863, Nov. 1989.
S. Collin, K. Tombre, and P. Vaxiviere, “Don't Tell Mom I'm Doing Document Analysis; She Believes I'm in the Computer Vision Field,” Proc. Second Intl. Conf. on Document Analysis and Recognition, Tsukuba, Japan, Oct. 1993, pp. 619–622.
Y. Dimitriadis, J. Coronado, and C. de la Maza, “A New Interactive Mathematical Editor, Using On-line Handwritten Symbol Recognition, and Error Detection-Correction with an Attribute Grammar,” in Proc. First Intl. Conf. on Document Analysis and Recognition, Saint Malo, France, September 1991, pp. 242–250.
H. Fahmy and D. Blostein, “A Graph Grammar Programming Style for Recognition of Music Notation,” Machine Vision and Applications, Vol. 6, No. 2, pp. 83–99, 1993.
H. Fahmy, “Reasoning in the Presence of Uncertainty via Graph Rewriting,” PhD Thesis, Computing and Information Science, Queen's University, March 1995. (TR 95-382)
K. S. Fu, Syntactic Pattern Recognition and Applications, Prentice Hall 1982.
A. Grbavec and D. Blostein, “Mathematics Recognition Using Graph Rewriting,” Third International Conference on Document Analysis and Recognition, Montreal, Canada, August 1995.
W. Havens and A. Mackworth, “Representing Knowledge of the Visual World,” IEEE Computer, October 1983, pp. 90–96.
V. Jagannathan, R. Dodhiawala, L. Baum, Editors, Blackboard Architectures and Applications, Academic Press, 1989.
S. Joseph and T. Pridmore, “Knowledge-Directed Interpretation of Mechanical Engineering Drawings,” IEEE PAMI, Vol. 14, No. 9, Sept. 1992, pp. 928–940.
H. Kato and S. Inokuchi, “The Recognition System of Printed Piano Music using Musical Knowledge and Constraints,” Proc. IAPR Workshop on Syntactic and Structural Pattern Recognition., Murray Hill NJ, June 1990, pp. 231–248.
H. Kato and S. Inokuchi, “The Recognition Method for Roughly Hand-Drawn Logical Diagrams Based on Utilization of Multi-Layered Knowledge,” Proc. 10th Intl. Conf. on Pattern Recognition, Atlantic City NJ, June 1990, pp. 443–473.
D. Knuth, “Mathematical Typography,” Bulletin of the American Mathematical Society, Vol. 1, No. 2, March 1979.
G. Kopec and P. Chou, “Document Image Decoding Using Markov Source Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 16, No. 6, June 1994, pp. 602–617.
X. Lin, S. Shimotsuji, M. Minoh, T. Saki, “Efficient Diagram Understanding with Chraracteristic Pattern Detection,” Computer Vision, Graphics, and Image Processing, Vol. 30, 1985, pp. 84–106.
W. Lin and K.S. Fu, “A Syntactic Approach to Three-Dimensional Object Recognition,” IEEE Trans. Systems Man and Cybernetics, Vol. 16, No. 3, May 1986, pp. 405–422.
G. Lohse, K. Biolsi, N. Walker, H. Ruetter, “A Classification of Visual Representations,” Communications of the ACM, Vol. 37, No. 4, December 1994, pp. 36–49.
I. Mulder, A. Mackworth, W. Havens, “Knowledge Structuring and Constraint Satisfaction: The Mapsee Approach,” IEEE Pattern Analysis and Machine Intelligence, Vol. 10, No. 6, November 1988, pp. 866–879.
G. Novak and W. Bulko, “Diagrams and Text as Computer Input,” J. Visual Languages and Computing, Vol. 4, 1993, pp. 161–175.
M. Okamoto and B. Miao, “Recognition of Mathematical Expressions by Using the Layout Structure of Symbols,” in Proc. First Intl. Conference on Document Analysis and Recognition, Saint Malo, France, September 1991, pp. 242–250.
B. Pasternak, “Processing Imprecise and Structural Distorted Line Drawings by and Adaptable Drawing Interpretation Kernel,” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 349–363.
G. Read, Music Notation: A Manual of Modern Practice (Second Edition), Taplinger Publishing, New York, NY, 1979.
D. Roush, “Music Formatting Guidelines,” Technical Report OSU-3/88-TR10, Department of Computer and Information Science, The Ohio State University, 1988.
R. Sennhauser, “Integration of Contextual Knowledge Sources Into a Blackboard-based Text Recognition System,” IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 211–228.
J. Sloboda, “The Uses of Space in Music Notation,” Visual Language, Vol. XV, No. 1, pp. 86–112, 1981.
S. Srihari, “From Pixels to Paragraphs: the Use of Contextual Models in Text Recognition,” Proc. Second Intl. Conf. Document Analysis and Recognition, Tsukuba, Japan, Oct. 1993, pp. 416–423.
S. Truvé, “Image Interpretation Using Multi-Relational Grammars,” Proc. Third International Conference on Computer Vision, pp. 146–155, December 1990.
E. Tufte, The Visual Display of Quantitative Information, Graphics Press, 1983.
P. Vaxivière and K. Tombre, “Knowledge Organization and Interpretation Process in Engineering Drawing Interpretation,” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 313–321.
C. Wang and S. Srihari, “A Framework for Object Recognition in a Visually Complex Environment and its Application to Locating Address Blocks on Mail Pieces,” International Journal of Computer Vision, Vol. 2, 1989, pp. 125–151.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Blostein, D. (1996). General diagram-recognition methodologies. In: Kasturi, R., Tombre, K. (eds) Graphics Recognition Methods and Applications. GREC 1995. Lecture Notes in Computer Science, vol 1072. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61226-2_10
Download citation
DOI: https://doi.org/10.1007/3-540-61226-2_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61226-1
Online ISBN: 978-3-540-68387-2
eBook Packages: Springer Book Archive