Abstract
Although considerable effort has been already devoted to studying various aspects of human-machine interaction, we are still a long way from developing socially believable conversational agents. This paper identifies some of the main causes of the current state in the field: (i) socially believable behaviour of a technical system is misinterpreted as a functional requirement, rather than a qualitative, (ii) the currently prevalent statistical approaches cannot address research problems of managing human-machine interaction that require some sort of contextual analysis, and (iii) the structure of human-machine interaction is unjustifiably reduced to a task structure. In addition, we propose a way to address these pitfalls. We consider the capability of a technical system to simulate fundamental features of human consciousness as one of the key desiderata to perform socially believable behaviour. In line with this, the paper discusses the possibilities for the computational realization of (iv) unified interpretation, (v) learning through interaction, and (vi) context-dependent perception in the context of human-machine interaction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
And possibly other contexts (e.g., the user’s electroencelographic activity, etc.).
- 2.
Apart from the fact that the study of architecture to support intelligent behaviour may be one of the most ill-defined enterprises in the field of artificial intelligence [26].
- 3.
In our previous work, we refer to these two groups as to focus stimuli and negative reinforcement stimuli. However, the terms stimuli and inhibitors reflect more appropriately the dichotomy between these groups.
- 4.
- 5.
The practice of long-term collection of the user’s data opens many important ethical questions that are out of the scope of this paper. However, these questions still remain to be properly addressed.
- 6.
Only for the purpose of illustration, the lower end of the range of significant retrieval costs for a given balanced focus tree can be estimated as the ratio of the height of the focus tree to the number of all nodes in the focus tree.
References
Alexandersson J, Reithinger N (1997) Learning dialogue structures from a corpus. In: Proceedings of EuroSpeech-97, Rhodes, pp 2231–2235
Bacchiani M, Rybach D (2014) Context dependent state tying for speech recognition using deep neural network acoustic models. In: IEEE international conference on acoustics, speech and signal processing (ICASSP 2014), Florence, pp 230–234
Bernsen NO, Dybkjær L (2010) Multimodal usability. Springer, London
Bilange E (2000) In: Néel F, Bouwhuis DG, Taylor MM (eds) The structure of multimodal dialoque II. An approach to oral dialogue modelling. John Benjamins Publishing Company, Amsterdam
Bledowski C, Kaiser J, Rahm B (2010) Basic operations in working memory: contributions from functional imaging studies. Behav Brain Res 214(2):172–179
Boehm BW (1988) A spiral model of software development and enhancement. Computer 21(5):61–72
Brouwer H, Fitz H, Hoeks J (2012) Getting real about semantic illusions: rethinking the functional role of the P600 in language comprehension. Brain Res 1446:127–143
Casadio M, Giannoni P, Morasso PG, Sanguineti V (2009) A proof of concept study for the integration of robot therapy with physiotherapy in the treatment of stroke patients. Clin Rehabil 23(3):217–228
Chomsky N (2000) New Horizons in the study of language and mind. Cambridge University Press, Cambridge
Chomsky N (2002) Syntactic Structures. Walter De Gruyter
Christensen HI, Sloman A, Kruijff G-J, Wyatt JL (2010) Cognitive systems introduction. In: Christensen HI, Kruijff G-J, Wyatt JL (eds) Cognitive systems. Springer, Berlin, pp 3–50
Clodic A, Alami R, Montreuil V, Li S, Wrede B, Swadzba A (2007) A study of interaction between dialog and decision for human-robot collaborative task achievement. In: Proceedings of the 16th IEEE international symposium on robot and human interactive communication, RO-MAN 2007, Jeju Island, Korea, pp 913–918
Cortés U, Annicchiarico R, Urdiales C, Barrué C, Martínez A, Villar A, Caltagirone C (2008) Supported Human Autonomy for Recovery and Enhancement of Cognitive and Motor Abilities Using Agent Technologies. Agent Technology and e-Health. Whitestein Series in Software Agent Technologies and Autonomic Computing, Birkhäuser Basel, pp 117–140
Dahl G, Yu D, Deng L, Acero A (2012) Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans Audio Speech Lang Process 20(1):30–42
Douglas-Cowie E, Cowie R, Sneddon I, Cox C, Lowry O, McRorie M, Martin JC, Devillers L, Batliner A (2007) The humaine database: addressing the needs of the affective. In: Paiva A, Prada R, Picard R (eds) 2nd international conference on affective computing and intelligent interaction (ACII’, 2007), LNCS, 4738. Lisbon, Portugal, pp 488–500
Esposito A, Vinciarelli A, Haykin S, Hussain A, Faundez-Zanuy M (2011) Cognitive behavioural systems. Cognit Comput 3(3):417–418
Esposito A, Fortunati L, Lugano G (2014) Modeling emotion, behavior and context in socially believable robots and ICT interfaces. Cognit Comput 6(4):623–627
Garlan D, Schmerl B (2007) The RADAR architecture for personal cognitive assistance. Int J Softw Eng Knowl Eng 17(2):171–190
Gnjatović M (2014) Therapist-centered design of a robot’s dialogue behavior. Cognit Comput 6(4):775–788
Gnjatović M (2014) Changing concepts of machine dialogue management. In: Proceedings of the 5th IEEE international conference on cognitive infocommunications, cogInfoCom 2014, Vietri sul Mare, Italy, pp 367–372
Gnjatović M, Delić V (2012) A cognitively-inspired method for meaning representation in dialogue systems. In: Proceedings of the 3th IEEE international conference on cognitive infocommunications, cogInfoCom 2012, Košice, Slovakia, pp 383–388
Gnjatović M, Delić V (2013) Electrophysiologically-inspired evaluation of dialogue act complexity. In: Proceedings of the 4th IEEE international conference on cognitive infocommunications, cogInfoCom 2013, Budapest, Hungary, pp 167–172
Gnjatović M, Delić V (2014) Cognitively-inspired representational approach to meaning in machine dialogue. Knowl-Based Syst 71:25–33
Gnjatović M, Janev M, Delić V (2012) Focus tree: modeling attentional information in task-oriented human-machine interaction. Appl Intell 37(3):305–320
Grosz B, Sidner C (1986) Attention, intentions, and the structure of discourse. Comput Linguist 12(3):175–204
Hawes N, Wyatt JL, Sridharan M, Jacobsson H, Dearden R, Sloman A, Kruijff G-J (2010) Architecture and representations. In: Christensen HI, Kruijff G-J, Wyatt JL (eds) Cognit Syst. Springer, Berlin, pp 51–93
Hermansky H (2006) A Brief description of DIRAC project. In: Proceedings of the euCognition inaugural meeting: the European network for the advancement of artificial cognitive systems, nice, France
Karreman DE, van Dijk EMAG, Evers V (2012) Using the visitor experiences for mapping the possibilities of implementing a robotic guide in outdoor sites. In: Proceedings of the 21st IEEE international symposium on robot and human interactive communication, RO-MAN 2012. France, Paris, pp 1059–1065
Kautz H, Fox D, Etzioni O, Borriello G, Arnstein L (2002) An Overview of the Assisted Cognition Project. In: Proc. of the AAAI (2002) workshop on Automation as Caregiver: The Role of Intelligent Technology in Elder Care. Edmonton, Alberta, Canada, pp 60–65
Kutas M, Federmeier KD (2000) Electrophysiology reveals semantic memory use in language comprehension. Trends Cognit Sci 4(12):463–470
Leishman TR, Cook DA (2002) Requirements Risks Can Drown Software Projects. CrossTalk: The Journal of Defense Software Engineering (April 2002), pp 4–8
MacLeod CM (2007) The concept of inhibition in cognition. In: Gorfein DS, MacLeod CM (eds) Inhibition in cognition. American Psychological Association, Washington, pp 3–23
Maheswaran R, Tambe M, Varakantham P, Myers K (2004) Adjustable autonomy challenges in personal assistant agents: a position paper. In: Nickles M, Rovatsos M, Weiss G (eds) Agents and computational autonomy. Lecture notes in computer science, vol 2969. Springer, Berlin, pp 187–194
Reithinger N, Gebhard P, Löckelt M, Ndiaye A, Pfleger N, Klesen M (2006) VirtualHuman—dialogic and emotional interaction with virtual characters. In: Proceedings of the eighth international conference on multimodal interfaces, (ICMI, 2006), Banff, Canada
Rickheit G, Wachsmuth I (1996) Collaborative research centre situated artificial communicators at the University of Bielefeld, Germany. In: Mc Kevitt P (ed) Integration of natural language and vision processing, Springer, Netherlands, pp 165–170
Roulet E (1992) On the structure of conversation as negotiation. In: Parret H, Verschueren J (eds) (On) Searle on conversation. John Benjamins Publishing Company, Amsterdam, pp 91–99
Saffran JR (2003) Statistical language learning: mechanisms and constraints. Curr Direct Psychol Sci 12(4):110–114
Schegloff EA (1968) Sequencing in conversational openings. Am Anthropol 70:1075–1095
Searle J (1992) Conversation. In: Parret H, Verschueren J (eds) (On) Searle on conversation. John Benjamins Publishing Company, Amsterdam, pp 7–29
Searle J (2002) Consciousness. Consciousness and language. Cambridge University Press, Cambridge
Searle J (2002) The problem of consciousness. Consciousness and language. Cambridge University Press, Cambridge
Siekmann J, Crocker MW (2010) Resource-Adaptive Cognitive Processes. In: Crocker MW, Siekmann J (eds) Resource-adaptive cognitive processes. Springer-Verlag, Berlin Heidelberg, pp 1–10
Sinclair J (2005) Corpus and text—basic principles. In: Wynne M (ed) Developing linguistic corpora: a guide to good practice. Oxbow Books, Oxford, pp 1–16
Tognini-Bonelli E (2001) Corpus linguistics at work. John Benjamins, Amsterdam
Tomasello M (2003) Constructing a Language: A Usage-Based Theory of Language Acquisition. Harvard University Press, Cambridge
Turkle S (2011) Alone together: why we expect more from technology and less from each other. Basic Books, New York
Vergados D, Alevizos A, Mariolis A, Caragiozidis M (2008) Intelligent Services for Assisting Independent Living of Elderly People at Home. In: Proceedings of the international ACM conference on pervasive technologies related to assistive environments (PETRA’08), Athens, Greece, 79:1–4
Wahlster W (2003) SmartKom: Symmetric multimodality in an adaptive and reusable dialogue shell. In: Krahl R, Günther D (eds) Proceedings of the human computer interaction status conference 2003, Berlin, Germany, pp 47–62
Waibel A, Steusloff H, Stiefelhagen R, Watson K (2009) Computers in the human interaction loop. In: Waibel A, Stiefelhagen R (eds) Computers in the human interaction loop. Springer, London, pp 3–6
Wendemuth A, Biundo S (2012) A companion technology for cognitive technical systems. In: Esposito A, Esposito AM, Vinciarelli A, Hoffmann R, Müller VC (eds) Cognitive behavioural systems, LNCS 7403. Springer, Berlin, pp 89–103
Wendemuth A, Braun J, Michaelis B, Ohl F, Rösner D, Scheich H, Warnemünde R (2008) Neurobiologically inspired, multimodal intention recognition for technical communication systems (NIMITEK). In: Proceedings of the 4th IEEE tutorial and research workshop on perception and interactive technologies for speech-based systems, PIT 2008, Kloster Irsee, Germany, pp 141–144
Wilks Y (2007) Is there progress on talking sensibly to machines? Science 318(5852):927–928
Wilks Y (2010) Introducing artificial companions. In: Wilks Y (ed) Close engagements with artificial companions. Key social, psychological, ethical and design issues. John Benjamins: Amsterdam, pp 11–20
Winkler I, Denham SL, Nelken I (2009) Modeling the auditory scene: predictive regularity representations and perceptual objects. Trends Cognit Sci 13(12):532–540
Ziemke T (2008) On the role of emotion in biological and robotic autonomy. BioSystems 91:401–408
Acknowledgments
The presented study was sponsored by the Ministry of Education, Science and Technological Development of the Republic of Serbia under the Research grants III44008 and TR32035. The responsibility for the content of this paper lies with the authors.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Gnjatović, M., Borovac, B. (2016). Toward Conscious-Like Conversational Agents. In: Esposito, A., Jain, L. (eds) Toward Robotic Socially Believable Behaving Systems - Volume II . Intelligent Systems Reference Library, vol 106. Springer, Cham. https://doi.org/10.1007/978-3-319-31053-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-31053-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31052-7
Online ISBN: 978-3-319-31053-4
eBook Packages: EngineeringEngineering (R0)