Abstract
State of the art artificial agents rely heavily on human intervention for performing vision-language integration; apart from being cost and effort effective, this intervention deprives artificial agents from the ability to react intelligently and to show intentionality when engaged in situated multimodal communication. In this paper, we suggest an alternative way of building vision-language integration prototypes with limited human intervention. The suggestions have emerged from the development of such a prototype for the verbalisation of visual scenes in a property-surveillance task.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Pastra, K., Wilks, Y.: Vision-language integration in AI: a reality check. In: Proceedings of the 16th European Conference in Artificial Intelligence, pp. 937–941 (2004)
Pastra, K.: Viewing vision-language integration as a double-grounding case. In: Proceedings of the AAAI Fall Symposium on “Achieving Human-Level Intelligence through Integrated Systems and Research”, pp. 62–69 (2004)
Searle, J.: Minds, brains, and programs. Behavioral and Brain Sciences 3, 417–457 (1980)
Harnad, S.: The symbol grounding problem. Physica D 42, 335–346 (1990)
Pastra, K.: Vision-Language Integration: a Double-Grounding Case. PhD thesis, University of Sheffield (2005)
Kanade, T., Rander, P., Narayanan, R.: Virtualised reality: constructing virtual worlds from real scenes. IEEE Multimedia 4, 34–46 (1997)
Minsky, M.: The Society of Mind. Simon and Schuster Inc. (1986)
Landau, B., Jackendoff, R.: “What” and “Where” in spatial language and cognition. Behavioural and Brain Sciences 16, 217–265 (1993)
Kaplan, F.: Talking AIBO: First experimentation of verbal interactions with an autonomous four-legged robot. In: Proceedings of the TWENTE Workshop on Language Technology, pp. 57–63 (2000)
Roy, D.: Learning visually grounded words and syntax for a scene description task. Computer speech and language 16, 353–385 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pastra, K. (2006). An Alternative Suggestion for Vision-Language Integration in Intelligent Agents. In: Antoniou, G., Potamias, G., Spyropoulos, C., Plexousakis, D. (eds) Advances in Artificial Intelligence. SETN 2006. Lecture Notes in Computer Science(), vol 3955. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11752912_75
Download citation
DOI: https://doi.org/10.1007/11752912_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34117-8
Online ISBN: 978-3-540-34118-5
eBook Packages: Computer ScienceComputer Science (R0)