research-article

Sentiment analysis of speech prosody for dialogue adaptation in a diet suggestion program

Authors:

Scott Crouch,

Rajiv KhoslaAuthors Info & Claims

ACM SIGHIT Record, Volume 2, Issue 1

Page 8

https://doi.org/10.1145/2180796.2180800

Published: 01 March 2012 Publication History

Get Access

Abstract

In recent years, programs have been developed which allow robots to engage in simple dialogues with hospital and aged-care patients in order to provide information on and give healthrelated advice. To enable the robot to be persuasive and be accepted by the patient, it must not only understand their responses, but also understand their emotional state [1]. This information can then be used to modify the robot's responses. In an example dialogue, the robot asks whether the patient believes they overeat, to which the patient might respond "I don't overeat". If the patient has responded in a negative emotional tone, this may indicate a refusal to acknowledge the problem rather than the absence of it. In addition, the robot needs to learn to avoid responses which may provoke the patient. At that stage the goal of the robot is to convince the patient to acknowledge the problem before developing ways to solve it.

One method of collecting data about patients' emotional state is to analyze prosodic features of their speech. Prosodic features are the patterns of frequency, energy (volume), and rate of speech. Prosodic features have been known for a long time to reflect the speaker's emotional state, as was first documented by Charles Darwin in The Descent of Man [2], which also showed that, even in other animals whose vocalizations contain no linguistic properties, feelings can be expressed.

The main motivation for the development of this software is to improve upon a diet-suggestion dialogue system currently being developed and tested in aged-care homes.1 The elderly subject engages in dialogue with a health care robot, which provides suggestions to that person's diet, whilst also raising their motivation levels, and improve their perception of the robotic agent.

References

[1]

V. Carofiglio, F. Rosis, and N. Novielli. Dynamic User modeling in health promotion dialogs. Affective computing and intelligent interaction, pages 723--730, 2005

Digital Library

Google Scholar

[2]

C. Darwin. The Descent Of Man And Selection In Relation To Sex-Vol. 1. John Murray, London, 1871.

Google Scholar

Cited By

View all

Mohanty MMohanty M(2022)Verbal sentiment analysis and detection using recurrent neural networkAdvanced Data Mining Tools and Methods for Social Computing10.1016/B978-0-32-385708-6.00012-6(85-106)Online publication date: 2022
https://doi.org/10.1016/B978-0-32-385708-6.00012-6
Soleymani MGarcia DJou BSchuller BChang SPantic M(2017)A survey of multimodal sentiment analysisImage and Vision Computing10.1016/j.imavis.2017.08.00365(3-14)Online publication date: Sep-2017
https://doi.org/10.1016/j.imavis.2017.08.003
Dahl D(2012)Natural Language Processing: Past, Present and FutureMobile Speech and Advanced Natural Language Solutions10.1007/978-1-4614-6018-3_4(49-73)Online publication date: 12-Dec-2012
https://doi.org/10.1007/978-1-4614-6018-3_4

Recommendations

Prosody modification for speech recognition in emotionally mismatched conditions

A degradation in the performance of automatic speech recognition systems (ASR) is observed in mismatched training and testing conditions. One of the reasons for this degradation is due to the presence of emotions in the speech. The main objective of ...
Neutral Speech to Anger Speech Conversion Using Prosody Modification
MIKE 2013: Proceedings of the First International Conference on Mining Intelligence and Knowledge Exploration - Volume 8284

In this paper, the dynamics of prosodic features are exploited for speech emotion conversion. In particular, emotion conversion of neutral speech to anger speech is accomplished. The database used for analysis of prosody is the Indian Institute of ...
Prosody conversion from neutral speech to emotional speech

Emotion is an important element in expressive speech synthesis. Unlike traditional discrete emotion simulations, this paper attempts to synthesize emotional speech by using "strong", "medium", and "weak" classifications. This paper tests different ...

Comments

Information & Contributors

Information

Published In

ACM SIGHIT Record Volume 2, Issue 1

March 2012

27 pages

EISSN:2158-8813

DOI:10.1145/2180796

Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 March 2012

Published in SIGHIT Volume 2, Issue 1

Check for updates

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
178
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Mohanty MMohanty M(2022)Verbal sentiment analysis and detection using recurrent neural networkAdvanced Data Mining Tools and Methods for Social Computing10.1016/B978-0-32-385708-6.00012-6(85-106)Online publication date: 2022
https://doi.org/10.1016/B978-0-32-385708-6.00012-6
Soleymani MGarcia DJou BSchuller BChang SPantic M(2017)A survey of multimodal sentiment analysisImage and Vision Computing10.1016/j.imavis.2017.08.00365(3-14)Online publication date: Sep-2017
https://doi.org/10.1016/j.imavis.2017.08.003
Dahl D(2012)Natural Language Processing: Past, Present and FutureMobile Speech and Advanced Natural Language Solutions10.1007/978-1-4614-6018-3_4(49-73)Online publication date: 12-Dec-2012
https://doi.org/10.1007/978-1-4614-6018-3_4

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Recommendations

Prosody modification for speech recognition in emotionally mismatched conditions

Neutral Speech to Anger Speech Conversion Using Prosody Modification

Prosody conversion from neutral speech to emotional speech

Comments

Information

Published In

Publisher

Publication History

Check for updates

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations