A Computational Model of Human Vision Based on Visual Routines

Ballard, Dana H.; Rao, Rajesh

doi:10.1007/978-3-642-79980-8_75

Dana H. Ballard² &
Rajesh Rao²

Part of the book series: Informatik aktuell ((INFORMAT))

173 Accesses
3 Citations

Abstract

We argue that human vision has natural timescales, and that models of human vision at these different timescales are qualitatively different. In particular, at the timescale of a few seconds, human vision can be modeled in terms of two primitive functional routines. A “what” routine determines object identity from a segmented input and a “Where” routine determines the retinal location of a desired object. More complicated functions can be composed from these two. In particular, a complicated visuo-motor task such as copying can be described in terms of these two routines. The primary subroutine needed is one that computes the relationship of the parts of an object with respect to an object-centered frame.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Literatur

Ballard, D. H. and Wixson, L. E. (1993). Object recognition using steerable filters at multiple scales. In Proceedings of the IEEE Workshop on Qualitative Vision.
Google Scholar
Freeman, W. T. and Adelson, E. H. (1991). The design and use of steerable filters. IEEE PAMI, 13(9):891–906.
Article Google Scholar
Hinton, G. F. (1981). Shape recognition in parallel systems. In International Joint Conference on Artificial Intelligence, pages 1088–1096.
Google Scholar
Kanerva, P. (1988). Sparse Distributed Memory. Cambridge, MA: Bradford Books.
MATH Google Scholar
Marr, D. (1982). Vision: A Computational Investigation into the Human Representation and Processing of Visual Information. San Francisco: W.H. EVeeman and Company.
Google Scholar
Mulier, F. and Cherkassky, V. (1995). Self- organization as an iterative kernel smoothing process. To appear in Neural Computation.
Google Scholar
Murase, H. and Nayar, S. (1993). Learning and recognition of 3-d objects from brightness images. Working Notes, AAAI Fall Symp, Series (Machine Learning in Computer Vision: What, Why, and How?), pages 25–29.
Google Scholar
Newell, A. (1990). Unified Theories of Cognition. Cambrdige, MA: Harvard University Press.
Google Scholar
Nowlan, S. J. (1990). Maximum likelihood competitive learning. In Advances in Neural Information Processing Systems 2, pages 574–582. Morgan Kaufmann
Google Scholar
Pylyshyn, Z. (1993). Some primitive mechanisms underlying spatial attention. Technical Report RuCCS TR-8, Rutgers University.
Google Scholar
Rao, R. P. and Ballard, D. H. (1995a). An active vision architecture based on iconic representations. To appear in AI Journal Special Issue on Vision.
Google Scholar
Rao, R. P. and Ballard, D. H. (1995b). Learning saccadic eye movements using multiscale spatial filters. In Tesauro, G., Touretzky, D., and Leen, T., editors. Advances in Neural Information Processing Systems 7. Cambridge, MA: MIT Press.
Google Scholar
Rao, R. P. and Ballard, D. H. (1995c). Object indexing using an iconic sparse distributed memory. Technical Report 559, Department of Computer Science, University of Rochester,.
Google Scholar
Ritter, H., Martinetz, T., and Schulten, K. (1992). Neural Computation and Self-Organizing Maps: An Introduction. Reading, MA: Addison-Wesley.
MATH Google Scholar
Ullman, S. (1987). Visual routines. In Readings in Computer Vision: Issues, Problems, Principles, and Paradigms, pages 298–328. Los Altos, CA: Morgan Kaufmann Publishers, Inc.
Google Scholar
Yair, E., Zeger, K., and Gersho, A. (1992). Competitive learning and soft competition for vector quantizer design. IEEE Trans. Signal Processing, 40(2):294–309.
Article Google Scholar
Young, R. (1985). The Gaussian derivative theory of spatial vision: Analysis of cortical cell receptive field line-weighting profiles. General Motors Research Publication GMR-4920.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Rochester, 14627, Rochester, NY, USA
Dana H. Ballard & Rajesh Rao

Authors

Dana H. Ballard
View author publications
You can also search for this author in PubMed Google Scholar
Rajesh Rao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technische Fakultät, Universität Bielefeld, Postfach 10 01 31, D-33501, Bielefeld, Deutschland
Gerhard Sagerer , Stefan Posch & Franz Kummert , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ballard, D.H., Rao, R. (1995). A Computational Model of Human Vision Based on Visual Routines. In: Sagerer, G., Posch, S., Kummert, F. (eds) Mustererkennung 1995. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-79980-8_75

Download citation

DOI: https://doi.org/10.1007/978-3-642-79980-8_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60293-4
Online ISBN: 978-3-642-79980-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics