Abstract
By examining the problem of image correspondence (binocular stereo and optical flow) and its relationship with other modules such as segmentation, shape and depth estimation, occlusion detection, and local signal processing, we argue that early visual modules are entangled in chicken-and-egg relationships, and unraveling these necessitates a compositional approach. In this paper, we present compositional algorithms which can match images containing slanted surfaces and images having different contrast, while simultaneously solving other problems as part of the same process. Ultimately, our goal is to motivate the application of the compositional approach to unify many other early visual modules. Experimental results have been presented on a large variety of stereo and motion images, including images with contrast mismatch and images containing untextured slanted surfaces.
Similar content being viewed by others
References
Aloimonos, Y. and Shulman, D. 1989. Integration of Visual Modules: An Extension of the Marr Paradigm. Academic Press.
Alvarez, L., Deriche, R., Papadapoulo, T. and Sanchez, J. 2002. Symmetrical dense optical flow estimation with occlusion detection. European Conference on Computer Vision, I:721ff.
Barnard, S. T. 1989. Stochastic stereo matching over scale. International Journal of Computer Vision, 3(1):17–32.
Barron, J., Fleet, D., Beauchemin, S. and Burkitt, T. 1994. Performance of optical flow techniques. IEEE Conference on Computer Vision and Pattern Recognition, (92):236–242.
Beauchemin, S.S. and Barron, J.L. 1995. The computation of optical flow. ACM Computing Surveys, 27(3):433–467.
Beauchemin, S. and Barron, J. 2000. On the fourier properties of discontinuous visual motion. Journal of Mathematical Imaging and Vision, 13(3):155–172.
Birchfield, S. and Tomasi, C. 1998. A pixel dissimilarity measure that is insensitive to image sampling. IEEE Trans. on Pattern Analysis and Machine Intelligence, 20(4):401–406.
Black, M. and Fleet, D. 2000. Probabilistic detection and tracking of motion discontinuities. International Journal of Computer Vision, 38(3):231–245.
Bobick, A.F. and Intille, S.S. 1999. Large occlusion stereo. International Journal of Computer Vision, 33(3):181–200.
Boykov, Y., Veksler, O. and Zabih, R. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 23(11):1222–1239.
Boykov, Y., Veksler, O. and Zabih, R. 1998. A variable window approach to early vision. IEEE Trans. on Pattern Analysis and Machine Intelligence, 20(12):1283–1294.
Egnal, G. and Wildes, R. 2002. Detecting binocular half-occlusions: empirical comparisons of five approaches. IEEE Trans. on Pattern Analysis and Machine Intelligence, 24(8):1127–1133.
Fleet, D. 1994. Disparity from local weighted phase-correlation. IEEE International Conference on SMC, 48–56.
Fleet, D., Jepson, A. and Jenkin, M. 1991. Phase-based disparity measurement. CVGIP: Image Understanding, (53):198–210.
Fusiello, A., Roberto, V. and Trucco, E. 1997. Efficient stereo with multiple windowing. IEEE Conference on Computer Vision and Pattern Recognition, 858–863.
Galvin, B., McCane, B., Novins, K., Mason, D. and Mills, S. 1998. Recovering motion fields: An evaluation of eight optical flow algorithms. Proceedings of the British Machine Vision Converence.
Geiger, D., Ladendorf, B. and Yuille, A. 1992. Occlusions and binocular stereo. European Conference on Computer Vision, 425–433.
Geman, S. and Geman, D. 1984. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images. IEEE Trans. on Pattern Analysis and Machine Intelligence, 6(6): 721–741.
Jenkin, M. and Jepson, A. 1988. Computational processes in Human Vision. (ed.) Z. Pylyshn, Ablex Press, NJ, ch. The measurement of binocular disparity.
Julesz, B. 1971. Foundations of Cyclopean Perception. University of Chicago Press, Chicago.
Kanade, T. and Okutomi, M. 1994. A stereo matching algorithm with an adaptive window: theory and experiment. IEEE Trans. on Pattern Analysis and Machine Intelligence, 16(9):920–932.
Kim, J., Kolmogorov, V. and Zabih, R. 2003. Visual correspondence using energy minimization and mutual information. International Conference on Computer Vision, (2):1033–1040.
Kolmogorov, V. and Zabih, R. 2001. Computing visual correspondence with occlusions using graph cuts. International Conference on Computer Vision, 508–515.
Liu, H., Hong, T., Herman, M., Camus, T. and Chellappa, R. 1998. Accuracy vs efficiency trade-offs in optical flow algorithms. Computer Vision and Image Understanding, 72(3):271–286.
Mitiche, A. and Bouthemy, P. 1996. Computation and analysis of image motion: a synopsis of current problems and methods. International Journal of Computer Vision, 19(1):29–55.
Nestares, O., Navarro, R., Portilla, J. and Tabernero, A. 1998. Efficient spatial-domain implementation of a multiscale image representation based on gabor functions. J. Electronic Imaging, (7):166–173.
Ogale, A. and Aloimonos, Y. 2004. Stereo correspondence with slanted surfaces: critical implications of horizontal slant. IEEE Conference on Computer Vision and Pattern Recognition, (1):568–573.
Ogale, A. and Aloimonos, Y. 2005. Shape and the stereo correspondence problem. International Journal of Computer Vision, 65(3):147–162.
Ogale, A., Fermuller, C. and Aloimonos, Y. 2005. Motion segmentation using occlusions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(6):988–992.
Ohta, Y. and Kanade, T. 1985. Stereo by intra- and inter-scanline search using dynamic programming. IEEE Trans. on Pattern Analysis and Machine Intelligence, 7(2):139–154.
Okutomi, M. and Kanade, T. 1993. A multiple baseline stereo. IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4):353–363.
Qian, N. 1994. Computing stereo disparity and motion with known binocular cell properties. Neural Computation, (6):390–404.
Roy, S. and Cox, I. 1998. A maximum-flow formulation of the n-camera stereo correspondence problem. International Conference on Computer Vision, 492–499.
Sanger, T. 1988. Stereo disparity computation using gabor filters. Biological Cybernetics, (59):405–418.
Scharstein, D. and Szeliski, R. 1998. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2):155–174.
Scharstein, D. and Szeliski, R. 2002. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. International Journal of Computer Vision, 47(1):7–42.
Szeliski, R. 1990. Bayesian modeling of uncertainty in low-level vision. International Journal of Computer Vision, 5(3):271–302.
Tao, H., Sawhney, H. and Kumar, R. 2001. A global matching framework for stereo computation. International Conference on Computer Vision, (1):532–539.
Weng, J. 1994. Image matching using windowed fourier phase. International Journal of Computer Vision, (11):211–236.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ogale, A.S., Aloimonos, Y. A Roadmap to the Integration of Early Visual Modules. Int J Comput Vision 72, 9–25 (2007). https://doi.org/10.1007/s11263-006-8890-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-006-8890-9