Skip to main content
Log in

What you say is what you see — Interactive generation, manipulation and modification of 3-D shapes based on verbal descriptions

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

The advent of virtual reality (VR) introduced a paradigm for human-to-human communication in which 3-D shapes can be manipulated in real time in a new kind of computer supported cooperative workspace (CSCW) (Takemura and Kishino 1992). However, mere manipulation — either with 3-D input devices (e.g., the DataGlove) or with spoken language (Mochizuki and Kishino 1991) — does not do justice to this new paradigm, which could prove to be revolutionary for human-to-human and human-to-machine — communication. This paper discusses the possibility of providing the means for VR-based CSCW participants not only to interactively manipulate, but also to generate and modify 3-D shapes using verbal descriptions, along with simple hand gestures. To this end, the paper also proposes a framework for interactive indexing of knowledge-level descriptions (Newell 1982, Tijerino and Mizoguchi 1993) of human intentions to a symbol-level representation based on deformable superquadrics (Pentland 1986; Horikoshi and Kasahara 1990, Terzopoulos 1991). This framework, at least, breaks ground in integration of natural language with interactive computer graphics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Anderson, J. R. (1978). Arguments Concerning Representations for Mental Images.Psychological Review 85: 249–277.

    Google Scholar 

  • Biederman, I. (1987). Recognition-by-Components: A Theory of Human Image Understanding.Psychological Review 94(2): 115–147.

    Google Scholar 

  • Boose, J. H. (1986).Expertise Transfer for Expert Systems. Elsevier: Amsterdam.

    Google Scholar 

  • Boose, J. H. & Bradshaw, J. M. (1987). Expertise Transfer and Complex Problems: Using AQUINAS as a Knowledge Acquisition Workbench for Knowledge-Based Systems.International Journal of Man-Machine Studies 26: 3–28.

    Google Scholar 

  • Bradshaw, J. M., Ford, K. M., Adams-Webber, J. R. & Boose, J. H. (1993). Beyond the Repertory Grid: New Approaches to Constructivist Knowledge Acquisition Tool Development.International Journal of Intelligent Systems 8(2): 287–333.

    Google Scholar 

  • Butterworth, J., Davison, A., Hench, S. & March Olano, T. (1992). 3DM: A Three Dimensional Modeler Using a Head-Mounted Display. ACM 0-89791-471-6/92/0003/0135.

  • Chandrasekaran, B. & Narayanan, N. H. (1990). Towards a Theory of Commonsense Visual Reasoning. In Nori, K. V. & Veni Madhavan, C. E. (eds.)Lecture Notes in Computer Science 472, 388–409. Springer-Verlag: Berlin.

    Google Scholar 

  • Chandrasekaran, B., Narayanan, N. H. & Iwasaki, Y. (1993). Reasoning with Diagrammatic Representations — A Report on the Spring Symposium —.AI Magazine, 49–56.

  • Dejong, G. F. (1986). Explanation-Based Learning. In Michalski, R. S., Carbonell, J. G. & Mitchell, T. M. (eds.)Machine Learning: An Artificial Intelligence Approach. Volume II. Morgan Kaufmann: Los Altos, CA.

    Google Scholar 

  • Diederich, J., Ruhmann & May M. (1987). KRITON: A Knowledge Acquisition Tool for Expert Systems.International Journal of Man-Machine Studies 26(1): 29–40.

    Google Scholar 

  • Ford, K. M., Cañas, A., Jones, J., Stahl, H., Novak, J. & Adams-Webber, J. (1990). ICONKAT: An Integrated Constructivist Knowledge Acquisition Tool.Knowledge Acquisition 3(2): 215–236.

    Google Scholar 

  • Gard-Jarnadan, C. & Salvendy, G. (1987). A Conceptual Framework for Knowledge Elicitation.International Journal of Man-Machine Studies 26(4): 521–531.

    Google Scholar 

  • Gardiner, M. (1965). The Superellipse: A Curve Between the Ellipse and the Rectangle.Scientific America 213: 222–234.

    Google Scholar 

  • Gruber, T. (1992). A Translation Approach to Portable Ontology Specifications. Stanford University KSL Technical Report KSL 92–72.

  • Horikoshi, T. & Kasahara, H. (1990). 3-D Shape Indexing Language. In Proceedings ofThe 1990 International Conference on Computers and Communications, 493–499.

  • Johansson, G. (1950).Configurations in Event Perception. Almqvist and Wiksell: Stockholm.

    Google Scholar 

  • Kelly, G. A. (1955).The Psychology of Personal Constructs. Norton: New York.

    Google Scholar 

  • Kishino, F. Communication with realistic sensations (1990).3-D Image, 4, 2 (in Japanese).

  • Klinker, G., Marques, D., McDermott, J., Marsereau, T. & Stintson, L. (1992). The Active Glossary: Taking Integration Seriously. In Proceedings ofThe Seventh Knowledge Acquisition for Knowledge-Based Systems Workshop, 14–1 to 14–19. Banff, Canada.

  • Lass, U., Lüer, G., Ulrich, M. & Werner, S. (1993). Access to Analog Representations in Memory for Visually Perceived Forms: The Facilitating Effect of Declarative Knowledge. In Strube, G. & Wender, K. F. (eds)The Cognitive Psychology of Knowledge, 75–96. Elsevier Science Publishers B. V.: The Netherlands.

    Google Scholar 

  • Lenat, D. B. & Guha, R. V. (1990). Cyc: Toward Programs with Common Sense.Communications of the ACM 33(8): 30–49.

    Google Scholar 

  • Mizoguchi, R., Tijerino, Y. A. & Ikeda, M. (1992). Two-Level Mediating Representation for a Task Analysis Interview System. In Proceedings ofAAAI-92 Workshop for Knowledge Representation Aspects of Knowledge Acquisition, 107–114. San Jose, Ca.

  • Mochizuki, K. & Kishino, F. (1991). A 3-D Scene Access Interface Considering an Individual Variations of Spatial Indication Concepts. In Proceedings ofThe Seventh Symp. on Human Interface, 51–54. Kyoto, Japan.

  • Neches, R., Fikes, R., Finin, T., Gruber, T., Patil, T., Snator, T. & Swartout, W. R. (1991). Enabling Technology for Knowledge Sharing.AI Magazine 12(3): 36–56.

    Google Scholar 

  • Newell, A. (1982). The Knowledge Level.Artificial Intelligence 18(1): 87–127.

    Google Scholar 

  • Nishihara, H. K. (1981). Intensity, Visible-Surface, and Volumetric Representations.Artificial Intelligence 28: 293–331.

    Google Scholar 

  • Pentland, A. P. (1986). Perceptual Organization and the Representation of Form.Artificial Intelligence 28: 292–331.

    Google Scholar 

  • Quinlan, R. (1986). Induction of Decision Trees.Machine Learning 1(1): 81–106.

    Google Scholar 

  • Rosch, E. (1973). On the Internal Structure of Perceptual and Semantic Categories. In Moore, T. E. (ed.)Cognitive Development and the Acquisition of Language. Academic Press: New York.

    Google Scholar 

  • Shaw, M. L. G. & Gaines, B. R. (1987). KITTEN: Knowledge Initiation and Transfer Tools for Experts and Novices.International Journal of Man-Machine Studies 27(3): 251–280.

    Google Scholar 

  • Steels, J. (1992). End-User Configuration of Applications. In Proceedings ofThe Second Japanese Knowledge Acquisition for Knowledge-Based Systems Workshop, 47–64, Kobe, Japan.

  • Stevens, S. (1974).Patterns in Nature. Atlantic-Little, Brown Books: Boston, MA.

    Google Scholar 

  • Takemura, H. & Kishino, F. (1992). Cooperative Work Environment Using Virtual Workspace. In Proceedings ofACM Conf. on CSCW'92, 226–232. Toronto, Canada.

  • Terzopoulos, D. (1991). Dynamic 3D Models with Local and Global Deformations: Deformable Superquadrics.IEEE Transactions on Pattern Analysis and Machine Intelligence 13(7): 703–714.

    Google Scholar 

  • Thompson, D-A. (1942).On Growth and Form. University Press: Cambridge, U.K., 2nd ed.

    Google Scholar 

  • Tijerino, Y. A., Abe, S. Miyasato, T. & Kishino F. (1993). In Proceedings ofThe 47th National Conference of the Information Processing Society of Japan, 385–386. Tottori, Japan. Vol. 2.

  • Tijerino, Y. A. & Mizoguchi, R. (1993). MULTIS II: Enabling End-Users to Design Problem-Solving Engines via Two-Level Task Ontologies. In Aussenac, N., Boy, G., Gaines, B., Linster, M., Ganascia, J. G. & Kodratoff, Y. (eds.)Lecture Notes in Artificial Intelligence 723 — Knowledge Acquisition for Knowledge-Based Systems -, 340–359. Springer-Verlag.

  • Umamichi, T. & Tijerino, Y. A. (1993). A Report on the Acquireability of Descriptive Concepts for Cars Based on Personal Construct Psychology. ATR Technical Report TR-C-0092 (in Japanese).

  • Wertheimer, M. (1923). Laws of Organization in Perceptual Forms. In Ellis, W. D. (ed.)A Source Book of Gestalt Psychology. Harcourt Brace: New York.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tijerino, Y.A., Abe, S., Miyasato, T. et al. What you say is what you see — Interactive generation, manipulation and modification of 3-D shapes based on verbal descriptions. Artif Intell Rev 8, 215–234 (1994). https://doi.org/10.1007/BF00849075

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00849075

Key words

Navigation