Abstract
In this paper, we describe a biological motivated image representation in terms of local multi–modal primitives. These primitives are functional abstractions of hypercolumns in V1 [13]. The efficient and generic coding of visual information in terms of local symbolic descriptiones allows for a wide range of applications. For example, they have been used to investigate the multi–modal character of Gestalt laws in natural scenes [14], to code a multi–modal stereo matching and to investigate the role of different visual modalities for stereo [11], and to use a combination of stereo and grouping as well as Rigid Body Motion to acquire reliable 3D information as demonstrated in this publication.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aloimonos, Y., Shulman, D.: Integration of Visual Modules — An extension of the Marr Paradigm. Academic Press, London (1989)
Barron, J.L., Fleet, D.J., Beauchemin, S.S.: Performance of optical flow techniques. International Journal of Computer Vision 12(1), 43–77 (1994)
Felsberg, M.: Optical flow estimation from monogenic phase. In: Jähne, B., Barth, E., Mester, R., Scharr, H. (eds.) Complex Motion, Proceedings 1st Int. Workshop, Günzburg, 12.-14.10 (2004)
Felsberg, M., Krüger, N.: A probablistic definition of intrinsic dimensionality for images. In: Michaelis, B., Krell, G. (eds.) DAGM 2003. LNCS, vol. 2781, pp. 140–147. Springer, Heidelberg (2003)
Felsberg, M., Sommer, G.: The monogenic signal. IEEE Transactions on Signal Processing 49(12), 3136–3144 (2001)
Granlund, G.H., Knutsson, H.: Signal Processing for Computer Vision. Kluwer Academic Publishers, Dordrecht (1995)
Hubel, D.H., Wiesel, T.N.: Anatomical demonstration of columns in the monkey striate cortex. Nature 221, 747–750 (1969)
Kovesi, P.: Image features from phase congruency. Videre: Journal of Computer Vision Research 1(3), 1–26 (1999)
Krüger, N., Ackermann, M., Sommer, G.: Accumulation of object representations utilizing interaction of robot action and perception. Knowledge Based Systems 15, 111–118 (2002)
Krüger, N., Felsberg, M.: A continuous formulation of intrinsic dimension. In: Proceedings of the British Machine Vision Conference, pp. 261–270 (2003)
Krüger, N., Felsberg, M.: An explicit and compact coding of geometric and structural information applied to stereo matching. Pattern Recognition Letters 25(8), 849–863 (2004)
Krüger, N., Felsberg, M., Wörgötter, F.: Processing multi-modal primitives from image sequences. In: Fourth International ICSC Symposium on Engineering of Intelligent Systems (2004)
Krüger, N., Lappe, M., Wörgötter, F.: Biologically motivated multi-modal processing of visual primitives. The Interdisciplinary Journal of Artificial Intelligence and the Simulation of Behaviour 1(5), 417–428 (2004)
Krüger, N., Wörgötter, F.: Multi modal estimation of collinearity and parallelism in natural image sequences. Network: Computation in Neural Systems 13, 553–576 (2002)
Krüger, N., Wörgötter, F.: Statistical and deterministic regularities: Utilisation of motion and grouping in biological and artificial visual systems. Advances in Imaging and Electron Physics 131 (2004)
Nagel, H.-H., Enkelmann, W.: An investigation of smoothness constraints for the estimation of displacement vector fields from image sequences. IEEE Transactions on Pattern Analysis and Machine Intelligence 8, 565–593 (1986)
Pugeault, N., Krüger, N.: Multi–modal matching applied to stereo. In: Proceedings of the BMVC 2003, pp. 271–280 (2003)
Pugeault, N., Wörgötter, F., Krüger, N.: Stereo matching with a contextual confidence based on collinear groups. In: Ilg, U., et al. (eds.) Dynamic Perception. Infix Verlag, St. Augustin (2004)
Shevelev, I.A., Lazareva, N.A., Tikhomirov, A.S., Sharev, G.A.: Sensitivity to cross–like figures in the cat striate neurons. Neuroscience 61, 965–973 (1995)
Watt, R.J., Phillips, W.A.: The function of dynamic grouping in vision. Trends in Cognitive Sciences 4(12), 447–454 (2000)
Wörgötter, F., Krüger, N., Pugeault, N., Calow, D., Lappe, M., Pauwels, K., Van Hulle, M., Tan, S., Johnston, A.: Early cognitive vision: Using gestalt-laws for task-dependent, active image-processing. Natural Computing 3(3), 293–321 (2004)
Wurtz, R.H., Kandel, E.R.: Perception of motion, depth and form. In: Kandell, E.R., Schwartz, J.H., Messel, T.M. (eds.) Principles of Neural Science, 4th edn., pp. 548–571 (2000)
Zetzsche, C., Barth, E.: Fundamental limits of linear filters in the visual processing of two dimensional signals. Vision Research 30 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Krüger, N., Wörgötter, F. (2005). Multi-modal Primitives as Functional Models of Hyper-columns and Their Use for Contextual Integration. In: De Gregorio, M., Di Maio, V., Frucci, M., Musio, C. (eds) Brain, Vision, and Artificial Intelligence. BVAI 2005. Lecture Notes in Computer Science, vol 3704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11565123_16
Download citation
DOI: https://doi.org/10.1007/11565123_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29282-1
Online ISBN: 978-3-540-32029-6
eBook Packages: Computer ScienceComputer Science (R0)