Abstract
Object recognition has developed to the most common approach for detecting arbitrary objects based on their appearance, where viewpoint dependency, occlusions, algorithmic constraints and noise are often hindrances for successful detection. Statistical pattern analysis methods, which are able to extract features from appearing images and enable the classification of the image content have reached a certain maturity and achieve excellent recognition on rather complex problems.
However, these systems do not seem directly scalable to human performance in a cognitive sense and appearance does not contribute to understanding the structure of objects. Syntactical pattern recognition methods are able to deal with structured objects, which may be constructed from primitives that were generated from extracted image features. Here, an eminent problem is how to aggregate image primitives in order to (re-) construct objects from such primitives.
In this paper, we propose a new approach to the aggregation of object prototypes by using geometric primitives derived from features out of image sequences and acquired from changing viewpoints. We apply syntactical rules for forming representations of the implicit object topology of object prototypes by a set of fuzzy graphs. Finally, we find a super-position of a prototype graph set, which can be used for updating and learning new object recipes in hippocampal like episodic memory that paves the way to cognitive understanding of natural scenes. The proposed implementation is exemplified with an object similar to the Necker cube.
This work was supported by project S9101 ”Cognitive Vision” of the Austrian Science Foundation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allen, J.: Introduction to Natural Language Understanding. BENJAMIN/CUMMINGS Publishing (1989)
Campbell, R.J., Flynn, P.J.: A survey of free-form object representation and recognition techniques. Comp. Vis. I. Underst. 81 (2001)
Conte, D., Foggia, P., Sansone, C., Vento, M.: Thirty years of graph matching in pattern recognition. Int. J. of Patt. Rec. and AI (2004)
Chomsky, N.: Three models for the description of language. IRE Transactions on Information Theory 2, 113–124 (1956)
Christou, C.G., Tjan, B.S., Bülthoff, H.H.: Viewpoint information provided by a familiar environment facilitates object identification. TR Max-Planck institute f. biol. cybernetics 68 (1999)
Duda, R., Hart, P.: Use of the hough transformation to detect lines and curves in pictures. Comm. ACM 15 (1972)
Dynkin, E.B.: Theory of Markov Processes. Prentice-Hall, Englewood Cliffs (1961)
Eckes, C., Triesch, J., Malsburg, C.: Analysis of cluttered scenes using an elastic matching approach for stereo images. Neural Computation 18, 1441–1471 (2006)
Faugeras, O.: Three-Dimensional Computer Vision. MIT Press, Cambridge (2001)
Fisher, R.B.: From Surfaces to Objects. Wiley, Chichester (1989)
Goebel, P.M., Vincze, M.: Implicit modeling of object topology with guidance from temporal view attention. In: ICVW 2007. Proceedings of the International Cognitive Vision Workshop, Bielefeld University (D) (2007)
Goebel, P.M., Vincze, M.: Vision for cognitive systems: A new compound concept connecting natural scenes with cognitive models. In: INDIN 2007. LNCS, Springer, Heidelberg (to appear)
Hayworth, K.J., Biederman, I.: Neural evidence for intermediate representations in object recognition. In: Vision Research (in press)
Hyundo, K., Murphy-Chutorian, E., Triesch, J.: Semi-autonomous learning of objects. In: Proceedings of the CVPR 2006 (2006)
Johnson-Laird, P.N.: Mental Models, Towards a Cognitive Science of Language, Inference and Consciousness. Harvard Press (1983)
Kermorvant, C., Dupont, P.: Stochastic grammatical inference with multinomial tests. In: Adriaans, P., Fernau, H., van Zaanen, M. (eds.) ICGI 2002. LNCS (LNAI), vol. 2484, Springer, Heidelberg (2002)
Koffka, K.: Principles of Gestalt Psychology. Hartcourt, NY (1935)
Klima, R.E., Sigmon, N.P., Sitzinger, E.L. (eds.): Applications of abstract algebra with Maple and Matlab. CRC Press, Boca Raton, USA (2007)
Liebers, A.: Planarizing graphs – a survey and annotated bibliography. Journal of Graph Algorithms and Applications 5, 1–74 (2001)
Marr, D.: Vision: A Computational Approach. Freeman & Co., San Francisco (1982)
Nishizeki, T.: Planar Graphs: Theory and Algorithms. Elsevier, Amsterdam (1988)
Peschl, M.F.: Modes of knowing and modes of coming to know. Constructivist Foundations 1(3), 111–123 (2006)
Pinz, A.: Object categorization. Foundations and Trends in Computer Graphics and Vision 1(4), 257–353 (2005)
Poole, D., Mackworth, A., Goebel, R.: Computational Intelligence. Oxford Press, Oxford (1998)
Riesenhuber, M., Poggio, T.: Neural mechanisms of object recognition. Current Opinion in Neurobiology 12, 162–168 (2002)
Schellewald, C., Schnörr, C.: Probabilistic subgraph matching approach based on convex relaxation. In: Rangarajan, A., Vemuri, B., Yuille, A.L. (eds.) EMMCVPR 2005. LNCS, vol. 3757, pp. 171–186. Springer, Heidelberg (2005)
Wang, R.F., Spelke, E.S.: Human spatial representation: insights from animals. Trends in Cognitive Sciences 6(9) (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Goebel, P.M., Vincze, M. (2007). A Cognitive Modeling Approach for the Semantic Aggregation of Object Prototypes from Geometric Primitives: Toward Understanding Implicit Object Topology. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2007. Lecture Notes in Computer Science, vol 4678. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74607-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-74607-2_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74606-5
Online ISBN: 978-3-540-74607-2
eBook Packages: Computer ScienceComputer Science (R0)