Abstract
For a successful speech-controlled human-computer interaction (HCI) the pure textual information as well as individual skills, preferences, and affective states of the user have to be known. However, verbal human interaction consists of several information layers. Apart from pure textual information, further details regarding the speaker’s feelings, believes, and social relations are transmitted. The additional information is encoded through acoustics. Especially, the intonation reveals details about the speakers communicative relation and their attitude towards the ongoing dialogue.
Since the intonation is influenced by semantic and grammatical information, it is advisable to investigate the intonation of so-called discourse particles (DPs) as “hm” or “uhm”. They cannot be inflected but can be emphasised. DPs have the same intonation curves (pitch-contours) as whole sentences and thus may indicate the same functional meanings. For German language J. E. Schmidt empirically discovered seven types of form-function-concurrences on the isolated DP “hm”.
To determine the function within the dialogue of the DPs, methods are needed that preserve pitch-contours and are feasible to assign defined form-prototypes. Furthermore, it must be investigated which pitch-contours occur in naturalistic HCI and whether these contours are congruent with the findings by linguists.
In this paper we present first results on the extraction and correlation of the DP “hm”. We investigate the different form-function-relations in the naturalistic LAST MINUTE corpus and determine expectable formfunction relations in naturalistic HCI in general.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allwood, J., Nivre, J., Ahlsn, E.: On the semantics and pragmatics of linguistic feedback. Journal of Semantics 9(1), 1–26 (1992)
Benus, S., Gravana, A., Hirschberg, J.: The Prosody of Backchannels in American Englisch. In: Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrcken, Germany, pp. 1065–1068 (2007)
Fischer, K., Wrede, B., Brindpke, C., Johanntokrax, M.: Quantitative und funktionale Analysen von Diskurspartikeln im Computer Talk. International Journal for Language Data Processing 20(1-2), 85–100 (1996)
Gerhard, D.: Pitch Extraction and Fundamental Frequency: History and Current Techniques. Tech. Rep. TR-CS 2003-06, Regina, Saskatchewan, Canada (2003)
Kehrein, R., Rabanus, S.: Ein Modell zur funktionalen Beschreibung von Diskurspartikeln. In: Neue Wege der Intonationsforschung. Germanistische Linguistik, vol. 157-158, pp. 33–50. Georg Olms Verlag, Hildesheim (2001)
Krippendorff, K.: Content Analysis: An Introduction to Its Methodology, 3rd edn. SAGE Publications, Thousand Oaks (2012)
Ladd, R.D.: Intonational Phonology. Studies in Linguistics, vol. 79. Cambridge University Press (1996)
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977)
Lange, J., Frommer, J.: Subjektives Erleben und intentionale Einstellung in Interviews zur Nutzer-Companion-Interaktion (Subjective experience and intentional setting within intervies of User-Companion-Interaction). In: Informatik 2011: Informatik schafft Communities, Beitrge der 41. Jahrestagung der GI. Lecture Notes in Informatics, vol. 192, p. 240 (2011)
Rabiner, L., Cheng, M.J., Rosenberg, A.E., McGonegal, C.A.: A comparative performance study of several pitch detection algorithms. IEEE Trans. on ASSP 24, 399–417 (1976)
Rösner, D., Friesen, R., Otto, M., Lange, J., Haase, M., Frommer, J.: Intentionality in interacting with companion systems – an empirical approach. In: Jacko, J.A. (ed.) Human-Computer Interaction, Part III, HCII 2011. LNCS, vol. 6763, pp. 593–602. Springer, Heidelberg (2011)
Schmidt, J.E.: Bausteine der Intonation. In: Neue Wege der Intonationsforschung, Germanistische Linguistik, vol. 157-158, pp. 9–32. Georg Olms Verlag, Hildesheim (2001)
Schuller, B., Batliner, A., Steidl, S., Seppi, D.: Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge. Speech Commun. 53(9-10), 1062–1087 (2011)
Siegert, I., Böck, R., Wendemuth, A.: The influence of context knowledge for multimodal annotation on natural material. In: Proc. of the First Workshop on Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction (MA3), Santa Cruz, USA (2012)
Ward, N.: Pragmatic functions of prosodic features in non-lexical utterances. In: Proceedings of Speech Prosody 2004, Nara, Japan, pp. 325–328 (2004)
Wendemuth, A., Biundo, S.: A Companion Technology for Cognitive Technical Systems. In: Esposito, A., Esposito, A.M., Vinciarelli, A., Hoffmann, R., Müller, V.C. (eds.) COST 2102. LNCS, vol. 7403, pp. 89–103. Springer, Heidelberg (2012)
Zeng, Z., Pantic, M., Roisman, G.I., Huang, T.S.: A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions. IEEE Trans. on Pattern Analysis and Machine Intelligence 31, 39–58 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Siegert, I., Prylipko, D., Hartmann, K., Böck, R., Wendemuth, A. (2014). Investigating the Form-Function-Relation of the Discourse Particle “hm” in a Naturalistic Human-Computer Interaction. In: Bassis, S., Esposito, A., Morabito, F. (eds) Recent Advances of Neural Network Models and Applications. Smart Innovation, Systems and Technologies, vol 26. Springer, Cham. https://doi.org/10.1007/978-3-319-04129-2_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-04129-2_39
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04128-5
Online ISBN: 978-3-319-04129-2
eBook Packages: EngineeringEngineering (R0)