Skip to main content
Log in

Dual-coding representations for robot vision programming in Tekkotsu

  • Published:
Autonomous Robots Aims and scope Submit manuscript

Abstract

We describe complementary iconic and symbolic representations for parsing the visual world. The iconic pixmap representation is operated on by an extensible set of “visual routines” (Ullman, 1984; Forbus et al., 2001). A symbolic representation, in terms of lines, ellipses, blobs, etc., is extracted from the iconic encoding, manipulated algebraically, and re-rendered iconically. The two representations are therefore duals, and iconic operations can be freely intermixed with symbolic ones. The dual-coding approach offers robot programmers a versatile collection of primitives from which to construct application-specific vision software. We describe some sample applications implemented on the Sony AIBO.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Agre, P. and Chapman, D. 1987. PENGI: An implementation of a theory of activity. In Proceedings of the Sixth National Conference on Artificial Intelligence (AAAI-87), pp. 268–272.

  • Bresenham, J.E. 1965. Algorithm for computer control of a digital plotter. IBM Systems Journal, 4(1):25–30.

    Article  Google Scholar 

  • Bruce, J., Balch, T., and Veloso, M. 2000. Fast and inexpensive color image segmentation for interactive robots. In Proceedings of the 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 00), vol. 3, pp. 2061– 2066.

  • Forbus, K.D., Mahoney, J.V., and Dill, K. 2001. How qualitative spatial reasoning can improve strategy game AIs. In 15th International Workshop on Qualitative Reasoning.

  • Glassner, A.S. 1990. Graphics Gems. Academic Press.

  • Halelamien, N.S. 2004. Visual routines for spatial cognition on a mobile robot. Senior honors thesis, Computer Science Department, Carnegie Mellon University, Pittsburgh, PA. Available at <http://www.Tekkotsu.org/media/thesis_neilh.pdf>.

  • Landre, J. 2004 Programming with Intel IPP (Integrated Performance Primitives) and Intel OpenCV (Open Computer Vision) Under GNU Linux. Accessed May 31 at: http://sourceforge.net/projects/opencvlibr-ary.

  • Paivio, A. 1986. Mental Representations: A Dual-Coding Approach. New York: Oxford University Press

    Google Scholar 

  • Prokop, R.J. and Reeves, A.P. 1992. A survey of moment-based techniques for unoccluded object representation and recognition. In CVGIP: Graphical Models and Image Processing, vol. 54, pp. 438–460.

    Article  Google Scholar 

  • Rao, R.P.N. and Ballard, D.H. 1995. An active vision architecture based on iconic representations. In Artificial Intelligence, 78(1–2):461–505.

  • Roelfsema, P.R., Lamme, V.A.F., and Spekreijse, H. 2000. The implementation of visual routines. Vision Research, 40: 1385–1411.

    Article  Google Scholar 

  • Sony Corp. 2004. OPEN-R SDK: Model Information for ERS-7.

  • Tira-Thompson, E.J. 2004. Tekkotsu: A Rapid Development Framework for Robotics. Masters thesis, Robotics Institute, Carnegie Mellon University. Available at <http://www.cs.cmu.edu/~tekkotsu/media/thesis_ejt.pdf>.

  • Ullman, S. 1984. Visual routines. Cognition, 18:97–159.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David S. Touretzky.

Additional information

David S. Touretzky is a Research Professor in the Computer Science Department and the Center for the Neural Basis of Cognition at Carnegie Mellon University. He earned his B.A. in Computer Science from Rutgers University in 1978, and his M.S. (1979) and Ph.D. (1984) in Computer Science from Carnegie Mellon. Dr. Touretzky’s research interests are in computational neuroscience, particularly representations of space in the rodent hippocampus and related structures, and high level primitives for robot programming. He is presently developing an undergraduate curriculum in cognitive robotics based on the Tekkotsu software framework described in this article.

Neil S. Halelamien earned a B.S. in Computer Science and a B.S. in Cognitive Science at Carnegie Mellon University in 2004, and is currently pursuing his Ph.D. in the Computation & Neural Systems program at the California Institute of Technology. His research interests are in studying vision from both a computational and biological perspective. He is currently using transcranial magnetic stimulation to study visual representations and information processing in visual cortex.

Ethan J. Tira-Thompson is a graduate student in the Robotics Institute at Carnegie Mellon University. He earned a B.S. in Computer Science and a B.S. in Human-Computer Interaction in 2002, and an M.S. in Robotics in 2004, at Carnegie Mellon. He is interested in a wide variety of computer science topics, including machine learning, computer vision, software architecture, and interface design. Ethan’s research has revolved around the creation of the Tekkotsu framework to enable the rapid development of robotics software and its use in education. He intends to specialize in mobile manipulation and motion planning for the completion of his degree.

Jordan J. Wales is completing a Master of Studies in Theology at the University of Notre Dame. He earned a B.S. in Engineering (Swarthmore College, 2001), an M.Sc. in Cognitive Science (Edinburgh, UK, 2002), and a Postgraduate Diploma in Theology (Oxford, UK, 2003). After a year as a graduate research assistant in Computer Science at Carnegie Mellon, he entered the master’s program in Theology at Notre Dame and is now applying to doctoral programs. His research focus in early and medieval Christianity is accompanied by an interest in medieval and modern philosophies of mind and their connections with modern cognitive science.

Kei Usui is a masters student in the Robotics Institute at Carnegie Mellon University. He earned his B.S. in Physics from Carnegie Mellon University in 2005. His research interests are reinforcement learning, legged locomotion, and cognitive science. He is presently working on algorithms for humanoid robots to maintain balance against unexpected external forces.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Touretzky, D.S., Halelamien, N.S., Tira-Thompson, E.J. et al. Dual-coding representations for robot vision programming in Tekkotsu. Auton Robot 22, 425–435 (2007). https://doi.org/10.1007/s10514-007-9024-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10514-007-9024-0

Keywords

Navigation