Abstract
In this paper knowledge based image interpretation is formulated and solved as an optimization problem which takes into account the observed image data, the available task specific knowledge, and the requirements of an application. Knowledge is represented by a semantic network consisting of concepts (nodes) and links (edges). Concepts are further defined by attributes, relations, and a judgment function. The interface between the symbolic knowledge base and the results of image (or signal) processing and initial segmentation is specified via primitive concepts.
We present a recently developed approach to optimal interpretation that is based on the automatic conversion of the concept oriented semantic network to an attribute centered representation and the use of iterative optimization procedures, like e.g. simulated annealing or genetic algorithms. We show that this is a feasible approach which provides ‘any-time’ capability and allows parallel processing. It provides a well-defined combination of signal and symbol oriented processing by optimizing a heuristic judgment function.
The general ideas have been applied to various problems of image and speech understanding. As an example we describe the recognition of streets from TV image sequences to demonstrate the efficiency of iterative optimization.
Preview
Unable to display preview. Download preview PDF.
References
T. Matsuyama and V. Hwang. SIGMA: A Knowledge-based Aerial Image Understanding System. Plenum Press, New York, 1990.
I. Masaki, editor. Vision-Based Vehicle Guidance. Springer, Berlin, 1992.
O. Faugeras. Three-Dimensional Computer Vision. Artificial Intelligence Series. The MIT Press, Cambridge, MA, 1993.
H. Niemann. Pattern Analysis and Understanding. Springer Series in Information Sciences 4. Springer, Berlin, 2. edition, 1990.
P. Krause and D. Clark. Representing Uncertain Knowledge. Intellect Books, Oxford, 1993.
J.F. Sowa, editor. Principles of Semantic Networks. Morgan Kaufmann, San Mateo, Calif., 1991.
H. Niemann, G. Sagerer, S. Schröder, and F. Kummert. ERNEST: A semantic network system for pattern understanding. IEEE Trans. on Pattern Analysis and Machine Intelligence, 9:883–905, 1990.
A. Kobsa. The SB-ONE knowledge representation workbench. SFB 314 (XTRA), Memo Nr. 31, Univ. des Saarlandes, FB 10, Saarbrücken, F. R. of Germany, 1989.
D. Paulus, A. Winzen, and H. Niemann. Knowlege based object recognition and model generation. In Proc. Europto 93, Computer Vision for Industry, pages 382–393, München, 1994. SPIE Proc. No. 1989-47.
A. Winzen. Automatische Erzeugung dreidimensionaler Modelle für Bildanalysesysteme. Dissertation, Technische Fakultät, Universität Erlangen-Nürnberg, Erlangen, (1994).
H. Niemann, H. Bunke, I. Hofmann, G. Sagerer, F. Wolf, and H. Feistel. A knowledge based system for analysis of gated blood pool studies. IEEE Trans. Pattern Analysis and Machine Intelligence, 7:246–259, 1985.
H. Niemann, H. Brünig, R. Salzbrunn, and S. Schröder. A knowledge-based vision system for industrial applications. Machine Vision and Applications, 3:201–229, 1990.
H. Burkhardt, Y. Neuvo, and J. Simon, editors. From Pixels to Features II. Parallelism in Image Processing. North-Holland, Amsterdam, 1991.
L. Shastri. Semantic Networks: An Evidential Formalization and its Connectionist Realization. Research Notes in Artificial Intelligence. Pitman and Morgan Kaufmann Publishers, Inc., London and San Mateo, Calif., 1988.
V. Fischer and H. Niemann. Parallelism in a semantic network for image understanding. In A. Bode and M. Dal Cin, editors, Parallel Computer Architectures. Theory, Hardware, Software, Applications, volume 732 of Lecture Notes in Computer Science, pages 203–218. Springer-Verlag, Berlin, 1993.
V. Fischer. Parallelverarbeitung in einem semantischen Netzwerk für die wissensbasierte Musteranalyse. Dissertation, Technische Fakultät, Universität Erlangen-Nürnberg, Erlangen, 1995.
B.W. Wah, G. Li, and C. Yu. Multiprocessing of combinatorial search problems. In [24], pages 103–145. 1990.
N. Metropolis, A. Rosenbluth, M. Rosenbluth, A. Teller, and E. Teller. Equation of state calculations for fast computing machines. Journal of Chemical Physics, 21(6):1087–1092, 1953.
L. Booker, D. Goldberg, and J. Holland. Classifier systems and genetic algorithms. Artificial Intelligence, 40(1–3):235–282, 1989.
S. Steuer. Erstellung eines ersten Modells in ERNESTzur Identifikation der Straße und der Position des Kamerafahrzeugs im statischen Bild. Technical Report 3.2.B1 Projekt MOVIE, Bayerisches Forschungszentrum für Wissensbasierte Systeme (FORWISS) und Bayerische Motorenwerke AG (BMW AG), München, 1991.
G. Dueck and T. Scheuer. Threshold accepting: A general purpose optimization algorithm appearing superior to simulated annealing. Journal of Computational Physics, 90(1):161–175, 1990.
G. Dueck. New optimization heuristics: The great deluge algorithm and the record-to-record-travel. Journal of Computational Physics, 104(1):86–92, 1993.
V. Fischer, J. Fischer, and H. Niemann. An algorithm for any-time speech understanding. In German Slovenian Workshop on Image and Speech Understanding, to appear, Ljubljana, 1996.
V. Kumar, P. Gopalakrishnan, and L. Kumar, editors. Parallel Algorithms for Machine Intelligence and Vision. Springer-Verlag, New York, 1990.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Niemann, H., Fischer, V., Paulus, D., Fischer, J. (1996). Knowledge based image understanding by iterative optimization. In: Görz, G., Hölldobler, S. (eds) KI-96: Advances in Artificial Intelligence. KI 1996. Lecture Notes in Computer Science, vol 1137. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61708-6_68
Download citation
DOI: https://doi.org/10.1007/3-540-61708-6_68
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61708-2
Online ISBN: 978-3-540-70669-4
eBook Packages: Springer Book Archive