Abstract
Object recognition in visual scenes by computer has proven to be more difficult than anyone would have thought three decades ago, at the beginning of the research program to achieve this goal. A central issue for further progress is the design and effective implementation of an object representation which captures all of the requirements for description and perceptual organization. In this paper, the major issues surrounding the development of such a representation are established and placed in the setting of relatively recent discoveries in the philosophy of recognition and object classification. From this viewpoint, definitions for representation, recognition, identification and classification are established and related to standard approaches to object recognition in visual scenes.
The use of biological models and introspection as a source of design ideas for representation is discussed. It is argued that the most profitable source of ideas will emerge from an engineering approach, based on principles from geometric reasoning, photogrammetry. and signal processing. The role of context in object recognition is outlined with emphasis on its use throughout all of stages of recognition. The paper concludes with a description of a object recognition system, called MORSE, which embodies many of the principles derived from these philosophical considerations.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Roberts, L. G., Machine Perception of 3D Solids in Optical and Electro-Optical Information Processing, Tippet, J. T., et al eds., MIT Press, 1965.
Ayre, A.J., Language Truth and Logic, Dover Books, 1935, with a new introduction, 1946.
Hume, D., An Enquiry Concerning Human Understanding, 1748, in The English Philosophers From Bacon to Mill, Burtt, E. ed., The Modern Library, 1939.
Jowett, B., The Dialogs of Plato, vol. 3, Scribner, Armstrong and Co., 1874.
Davis, R., Shrobe, H. and Szolovits, P., The AI Magazine, Spring 1993, p. 17.
Dennett, D., Brainstorms, MIT Press, 1981
Weiskrantz, L., Thought Without Language, Oxford:Clarenden, 1988.
Raichle, M., Visualizing the Mind, Scientific American, April, 1994.
Freedman, D., Brainmakers, Simon and Schuster, 1994.
Edelman, G., Bright Air, Brilliant Fire, Basic Books, 1992. p. 194.
Kapur, D. and Mundy, J.L., editors Geometric Reasoning, MIT Press, 1989.
McCarthy, J., Epistemological Problems of Artificial Intelligence, in Readings in Knowledge Representation, Brachman, R. and Levesque, H., eds., Morgan Kaufmann, 1985.
Dennett, D., Consciousness Explained, Little, Brown and Company, 1991.
Shepard, R. N., Metzler, Mental Rotation of Three-Dimensional Objects, Science, 171, pp. 701–703.
Lowe, D., Perceptual Organization and Visual Recognition, Kluwer Academic Publishers, 1985.
Sayre, K. M., Recognition: A Study in the Philosophy of Artificial Intelligence, University of Notre Dame Press, 1965.
Grimson, W.E.L., Object Recognition by Computer: The Role of Geometric Constraints, MIT Press, 1990.
Aristotle, Aristotle: Categories on Interpretation and Prior Analytics, Loeb Classical Library, Cooke, H., translation, 1938.
Hayes, P., The Second Naive Physics Manifesto, in Readings in Knowledge Representation, Brachman, R. and Levesque, H., eds., Morgan Kaufmann, 1985.
Kline, M., Mathematics and the Search for Knowledge, Oxford University Press, 1985.
Wittgenstein, L., Philosophical Investigations, Ancombe, G., translator, The Macmillan Co., 1953.
Wantanabe, S., Pattern Recognition, Human and Mechanical, John Wiley and Sons, 1985.
Bowyer, K. and Stark, L. Form and function; a theory of purposive, qualitative 3-D object recognition, Artificial Intelligence and Computer Vision, Proc. 7'th Israeli Conference, pp 137–146, 1990.
Mundy, J.L. and Heller, A., The Evolution and Testing of a Model-Based Object Recognition System in Computer Vision: Advances and Applications, Kasturi, R. and Jain, R. editors, IEEE Computer Society Press, 1991.
Huttenlocher, D. and Ullman, S., Object Recognition Using Alignment, Proc. First International Conference on Computer Vision, 1987.
Strat, T. and Fischler, M, CONDOR, IEEE Transactions on Machine Intelligence and Pattern Analysis, Oct. 1991.
Mundy, J. L. and Vrobel, P., The Role of IU Technology in RADIUS Phase II, Proc. ARPA Image Understanding Workshop, Morgan Kaufman, 1994.
Mundy, J.L. and Zisserman, A., Repeated Structures: Image Correspondence Constraints and 3D Structure in Proc. 2nd Workshop on Applications of Geometric Invariants in Computer Vision, Lecture Notes on Computer Science, 825, Springer Verlag, 1994.
Zerroug, M. and Nevatia, R., Using Invariance and Quasi-invariance for the Segmentation and Recovery of Curved Objects in Proc. 2nd Workshop on Applications of Geometric Invariants in Computer Vision, Lecture Notes on Computer Science, 825, Springer Verlag, 1994.
Liu J., Mundy J.L., Forsyth D.A., Zisserman A. and Rothwell C.A., Efficient Recognition of Rotationally Symmetric Surfaces and Straight Homogeneous Generalized Cylinders, Proc. Conference on Computer Vision and Pattern Recognition, 1993.
Arnheim, R. Art and Visual Perception, University of California Press, 1971.
Mundy, J.L. and Zisserman, A. editors, Geometrical Invariance in Computer Vision, MIT Press, 1992.
Sullivan, S., Sandford, L. and Ponce, J., On Useing Geometric Distance Fits to Estimate 3D Object Shape, Pose and Deformation from Range, CT and Video Images, Proc. Computer Vision and Pattern Recognition Conference, 1993.
Mundy, J.L., Forsyth, D., Zisserman, A. and Rothwell, C., MORSE: Multiple Object Recognition by Scene Entailment, Draft Report, GE Research Report, Feb. 1994.
Intille, S. and Bobick, A., Tracking Using a Local Closed-World Assumption: Tracking in the Football Domain, MIT Media Lab Report 296, August, 1994.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mundy, J.L. (1995). Object recognition: The search for representation. In: Hebert, M., Ponce, J., Boult, T., Gross, A. (eds) Object Representation in Computer Vision. ORCV 1994. Lecture Notes in Computer Science, vol 994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60477-4_2
Download citation
DOI: https://doi.org/10.1007/3-540-60477-4_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60477-8
Online ISBN: 978-3-540-47526-2
eBook Packages: Springer Book Archive