What you say is what you see — Interactive generation, manipulation and modification of 3-D shapes based on verbal descriptions

Tijerino, Yuri A.; Abe, Shinji; Miyasato, Tsutomu; Kishino, Fumio

doi:10.1007/BF00849075

What you say is what you see — Interactive generation, manipulation and modification of 3-D shapes based on verbal descriptions

Published: March 1994

Volume 8, pages 215–234, (1994)
Cite this article

Artificial Intelligence Review Aims and scope Submit manuscript

Yuri A. Tijerino¹,
Shinji Abe¹,
Tsutomu Miyasato¹ &
…
Fumio Kishino¹

62 Accesses
Explore all metrics

Abstract

The advent of virtual reality (VR) introduced a paradigm for human-to-human communication in which 3-D shapes can be manipulated in real time in a new kind of computer supported cooperative workspace (CSCW) (Takemura and Kishino 1992). However, mere manipulation — either with 3-D input devices (e.g., the DataGlove^™) or with spoken language (Mochizuki and Kishino 1991) — does not do justice to this new paradigm, which could prove to be revolutionary for human-to-human and human-to-machine — communication. This paper discusses the possibility of providing the means for VR-based CSCW participants not only to interactively manipulate, but also to generate and modify 3-D shapes using verbal descriptions, along with simple hand gestures. To this end, the paper also proposes a framework for interactive indexing of knowledge-level descriptions (Newell 1982, Tijerino and Mizoguchi 1993) of human intentions to a symbol-level representation based on deformable superquadrics (Pentland 1986; Horikoshi and Kasahara 1990, Terzopoulos 1991). This framework, at least, breaks ground in integration of natural language with interactive computer graphics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mixed interaction: evaluating user interactions for object manipulations in virtual space

Article 22 May 2024

VRTactileDraw: A Virtual Reality Tactile Pattern Designer for Complex Spatial Arrangements of Actuators

A comparative study on user gestural inputs for navigation in NUI-based 3D virtual environments

Article 04 November 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Anderson, J. R. (1978). Arguments Concerning Representations for Mental Images.Psychological Review 85: 249–277.
Google Scholar
Biederman, I. (1987). Recognition-by-Components: A Theory of Human Image Understanding.Psychological Review 94(2): 115–147.
Google Scholar
Boose, J. H. (1986).Expertise Transfer for Expert Systems. Elsevier: Amsterdam.
Google Scholar
Boose, J. H. & Bradshaw, J. M. (1987). Expertise Transfer and Complex Problems: Using AQUINAS as a Knowledge Acquisition Workbench for Knowledge-Based Systems.International Journal of Man-Machine Studies 26: 3–28.
Google Scholar
Bradshaw, J. M., Ford, K. M., Adams-Webber, J. R. & Boose, J. H. (1993). Beyond the Repertory Grid: New Approaches to Constructivist Knowledge Acquisition Tool Development.International Journal of Intelligent Systems 8(2): 287–333.
Google Scholar
Butterworth, J., Davison, A., Hench, S. & March Olano, T. (1992). 3DM: A Three Dimensional Modeler Using a Head-Mounted Display. ACM 0-89791-471-6/92/0003/0135.
Chandrasekaran, B. & Narayanan, N. H. (1990). Towards a Theory of Commonsense Visual Reasoning. In Nori, K. V. & Veni Madhavan, C. E. (eds.)Lecture Notes in Computer Science 472, 388–409. Springer-Verlag: Berlin.
Google Scholar
Chandrasekaran, B., Narayanan, N. H. & Iwasaki, Y. (1993). Reasoning with Diagrammatic Representations — A Report on the Spring Symposium —.AI Magazine, 49–56.
Dejong, G. F. (1986). Explanation-Based Learning. In Michalski, R. S., Carbonell, J. G. & Mitchell, T. M. (eds.)Machine Learning: An Artificial Intelligence Approach. Volume II. Morgan Kaufmann: Los Altos, CA.
Google Scholar
Diederich, J., Ruhmann & May M. (1987). KRITON: A Knowledge Acquisition Tool for Expert Systems.International Journal of Man-Machine Studies 26(1): 29–40.
Google Scholar
Ford, K. M., Cañas, A., Jones, J., Stahl, H., Novak, J. & Adams-Webber, J. (1990). ICONKAT: An Integrated Constructivist Knowledge Acquisition Tool.Knowledge Acquisition 3(2): 215–236.
Google Scholar
Gard-Jarnadan, C. & Salvendy, G. (1987). A Conceptual Framework for Knowledge Elicitation.International Journal of Man-Machine Studies 26(4): 521–531.
Google Scholar
Gardiner, M. (1965). The Superellipse: A Curve Between the Ellipse and the Rectangle.Scientific America 213: 222–234.
Google Scholar
Gruber, T. (1992). A Translation Approach to Portable Ontology Specifications. Stanford University KSL Technical Report KSL 92–72.
Horikoshi, T. & Kasahara, H. (1990). 3-D Shape Indexing Language. In Proceedings ofThe 1990 International Conference on Computers and Communications, 493–499.
Johansson, G. (1950).Configurations in Event Perception. Almqvist and Wiksell: Stockholm.
Google Scholar
Kelly, G. A. (1955).The Psychology of Personal Constructs. Norton: New York.
Google Scholar
Kishino, F. Communication with realistic sensations (1990).3-D Image, 4, 2 (in Japanese).
Klinker, G., Marques, D., McDermott, J., Marsereau, T. & Stintson, L. (1992). The Active Glossary: Taking Integration Seriously. In Proceedings ofThe Seventh Knowledge Acquisition for Knowledge-Based Systems Workshop, 14–1 to 14–19. Banff, Canada.
Lass, U., Lüer, G., Ulrich, M. & Werner, S. (1993). Access to Analog Representations in Memory for Visually Perceived Forms: The Facilitating Effect of Declarative Knowledge. In Strube, G. & Wender, K. F. (eds)The Cognitive Psychology of Knowledge, 75–96. Elsevier Science Publishers B. V.: The Netherlands.
Google Scholar
Lenat, D. B. & Guha, R. V. (1990). Cyc: Toward Programs with Common Sense.Communications of the ACM 33(8): 30–49.
Google Scholar
Mizoguchi, R., Tijerino, Y. A. & Ikeda, M. (1992). Two-Level Mediating Representation for a Task Analysis Interview System. In Proceedings ofAAAI-92 Workshop for Knowledge Representation Aspects of Knowledge Acquisition, 107–114. San Jose, Ca.
Mochizuki, K. & Kishino, F. (1991). A 3-D Scene Access Interface Considering an Individual Variations of Spatial Indication Concepts. In Proceedings ofThe Seventh Symp. on Human Interface, 51–54. Kyoto, Japan.
Neches, R., Fikes, R., Finin, T., Gruber, T., Patil, T., Snator, T. & Swartout, W. R. (1991). Enabling Technology for Knowledge Sharing.AI Magazine 12(3): 36–56.
Google Scholar
Newell, A. (1982). The Knowledge Level.Artificial Intelligence 18(1): 87–127.
Google Scholar
Nishihara, H. K. (1981). Intensity, Visible-Surface, and Volumetric Representations.Artificial Intelligence 28: 293–331.
Google Scholar
Pentland, A. P. (1986). Perceptual Organization and the Representation of Form.Artificial Intelligence 28: 292–331.
Google Scholar
Quinlan, R. (1986). Induction of Decision Trees.Machine Learning 1(1): 81–106.
Google Scholar
Rosch, E. (1973). On the Internal Structure of Perceptual and Semantic Categories. In Moore, T. E. (ed.)Cognitive Development and the Acquisition of Language. Academic Press: New York.
Google Scholar
Shaw, M. L. G. & Gaines, B. R. (1987). KITTEN: Knowledge Initiation and Transfer Tools for Experts and Novices.International Journal of Man-Machine Studies 27(3): 251–280.
Google Scholar
Steels, J. (1992). End-User Configuration of Applications. In Proceedings ofThe Second Japanese Knowledge Acquisition for Knowledge-Based Systems Workshop, 47–64, Kobe, Japan.
Stevens, S. (1974).Patterns in Nature. Atlantic-Little, Brown Books: Boston, MA.
Google Scholar
Takemura, H. & Kishino, F. (1992). Cooperative Work Environment Using Virtual Workspace. In Proceedings ofACM Conf. on CSCW'92, 226–232. Toronto, Canada.
Terzopoulos, D. (1991). Dynamic 3D Models with Local and Global Deformations: Deformable Superquadrics.IEEE Transactions on Pattern Analysis and Machine Intelligence 13(7): 703–714.
Google Scholar
Thompson, D-A. (1942).On Growth and Form. University Press: Cambridge, U.K., 2nd ed.
Google Scholar
Tijerino, Y. A., Abe, S. Miyasato, T. & Kishino F. (1993). In Proceedings ofThe 47th National Conference of the Information Processing Society of Japan, 385–386. Tottori, Japan. Vol. 2.
Tijerino, Y. A. & Mizoguchi, R. (1993). MULTIS II: Enabling End-Users to Design Problem-Solving Engines via Two-Level Task Ontologies. In Aussenac, N., Boy, G., Gaines, B., Linster, M., Ganascia, J. G. & Kodratoff, Y. (eds.)Lecture Notes in Artificial Intelligence 723 — Knowledge Acquisition for Knowledge-Based Systems -, 340–359. Springer-Verlag.
Umamichi, T. & Tijerino, Y. A. (1993). A Report on the Acquireability of Descriptive Concepts for Cars Based on Personal Construct Psychology. ATR Technical Report TR-C-0092 (in Japanese).
Wertheimer, M. (1923). Laws of Organization in Perceptual Forms. In Ellis, W. D. (ed.)A Source Book of Gestalt Psychology. Harcourt Brace: New York.
Google Scholar

Download references

Author information

Authors and Affiliations

ATR Communication Systems Research Laboratories, 2-2 Hikaridai, Seikacho, Sorakugun, 619-02, Kyoto, Japan
Yuri A. Tijerino, Shinji Abe, Tsutomu Miyasato & Fumio Kishino

Authors

Yuri A. Tijerino
View author publications
You can also search for this author in PubMed Google Scholar
Shinji Abe
View author publications
You can also search for this author in PubMed Google Scholar
Tsutomu Miyasato
View author publications
You can also search for this author in PubMed Google Scholar
Fumio Kishino
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tijerino, Y.A., Abe, S., Miyasato, T. et al. What you say is what you see — Interactive generation, manipulation and modification of 3-D shapes based on verbal descriptions. Artif Intell Rev 8, 215–234 (1994). https://doi.org/10.1007/BF00849075

Download citation

Issue Date: March 1994
DOI: https://doi.org/10.1007/BF00849075

Key words

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

What you say is what you see — Interactive generation, manipulation and modification of 3-D shapes based on verbal descriptions

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Mixed interaction: evaluating user interactions for object manipulations in virtual space

VRTactileDraw: A Virtual Reality Tactile Pattern Designer for Complex Spatial Arrangements of Actuators

A comparative study on user gestural inputs for navigation in NUI-based 3D virtual environments

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key words

Subscribe and save

Buy Now