research-article

Volume adaptation and visualization by modeling the volume level in noisy environments for telepresence system

Authors:

Akira Hayamizu,

Keisuke Nakamura,

Kazuhiro NakadaiAuthors Info & Claims

HAI '14: Proceedings of the second international conference on Human-agent interaction

Pages 67 - 74

https://doi.org/10.1145/2658861.2658875

Published: 29 October 2014 Publication History

Abstract

The Lombard effect is the involuntary tendency of speakers to increase their vocal effort when speaking in a loud noise to enhance the audibility of their voice. There is a problem in telecommunication due to the Lombard effect. A speaker talks at a louder volume than necessary for the conversation partner at a remote location. This paper proposes a volume model that is required in order to automatically adjust the volume of an operator's voice at a remote communication via a telepresence robot, and develops an optimal volume control system LombaBot equipped on a telepresence robot with the model. The volume model measures the level of noise around the robot and the distance between a conversation partner and the robot to adjust the volume of the operator's voice. It has two types of volume adjustments. Those are called comfortable volume and secret talk volume. LombaBot enables people at a remote site to listen comfortably to the voice of a robot operator. Moreover, the operator is able to talk in low voices when s/he wants to talk in secret with nearby people. We confirmed that LombaBot adjusted the volume of an operator's voice properly in the noisy remote location.

Supplementary Material

suppl.mov (hai014.wmv)

Supplemental video

Download
9.99 MB

References

[1]

Texai: http://www.willowgarage.com/pages/texai/principles

[2]

QB: https://www.anybots.com/

[3]

Min Kyung Lee et al. ""Now, I Have a Body": Uses and Social Norms for Mobile Remote Presence in the Workplace", in Proc. of CHI2011, pp.33--42, 2011.

Digital Library

[4]

F. Tanaka et al., "Child-operated telepresence robot: A field trial connecting classrooms between Australia and Japan," in Proc. of IEEE/RAS IROS2013, pp. 5896 - 5901, 2013.

[5]

Lombard, E. "Le signe de le elevation de la voix," Ann. Malad. l'Orielle. Larynx. Nez. Pharynx 37, 101--119, 1911.

[6]

Barbara Hilsenbeck et al., "Listening for people: Exploiting the spectral stracture of speech to robustly perceive the presence of people" in Proc. of IEEE/RAS IROS 2011, pp. 2903--2909, 2011.

[7]

A, Deleforge et al., "The Cocktail Party Robot: Sound Source Separation and Localisation with an Active Binaural Head," in Proc. of HRI2012,pp.431--438,2012.

Digital Library

[8]

R.Takeda et al., "ICA-Based efficient dereverberation and echo cancellation method for barge-in-able robot audition" in Proc. of IEEE ICASSP2009, pp. 3367--3680, 2009.

Digital Library

[9]

A.Berkhout et al., "Acoustic control by wave field synthesis," J. acoust. Soc. Amer., vol.93, pp. 2764--2778, 1993.

[10]

D.Malhan et al., "3-D sound spatialization using ambisonic techniques," J. Conput. Music, vol. 19, no. 4, pp. 58--70, 1995

[11]

V. Pullki, "Virtual sound source positioning using vector base amplitude panning" J. Audio Eng. Soc., vol. 45, pp. 456--466, 1997.

[12]

Myung-Suk Song et al.,"An Interactive 3-D Audio System With Loudspeakes". IEEE TRANSACTION ON MULTIMEDIA, vol.13, no.5,pp. 844--855, 2011.

Digital Library

[13]

Goldenberg et.al, "R.,The Lombard Effect's Influence on Automatic Speaker Verification Systems and Methods for its Compensation, Information Technology: Research and Education,pp.233--237, 2006.

[14]

Ogawa,T et.al.,"Adequacy Analysis of Simulation-based Assessment of Speech Recognition System", " in Proc. of IEEE ICASSP'98, pp. 1153--1156, 1998.

[15]

John H.L.Hansen et.al., "Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition",IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LAUNGUAGE PROCESSING, Vol.17, No.2, pp.366--378, 2009.

Digital Library

[16]

W. Van Summers, D. B. Pisoni, R. Bernacki, R. Pedlow, and M. Stokes, "Effects of noise on speech production:Acoustical and perceptualanalyses," J. Acous. Soc. Amer., pp. 917--928, Sep. 1988.

[17]

Andreas Paepcke et al., "Yelling In the Hall: Using Sidetone to Address a Problem with Mobile Remoto Presence Systems," in Proc. of ACM UIST2011, pp 107--116, 2011.

Digital Library

[18]

A.Kimura et al, "Visual Feedback: Its effect on teleconferencing," in proceeding of HCI international, pp.591--600, 2007

Digital Library

[19]

P.L.Chu, "Voice-Activated AGC for Teleconferencing," in Proc. of IEEE ICASSP'96, pp. 929--932, 1996.

Digital Library

[20]

G.R.Steber, "Digital Signal Processing In Automatic Gain Control Systems," Industrial Electonics Society(IECON), pp. 381--384,1988.

[21]

J. J. Lopez et al, "Measurement of cross-talk cancelation and equalization zones in 3-D sound reproduction under real listening conditions," in Proc. of Audio Engineering Society 16th Int. Conf.,1999.

[22]

E. T. Hall, The Hidden Dimension. Doubleday, NY, 1966.

[23]

M. L. Waltes, "Human Approach Distance to a Mechanical-Looking Robot with Different Robot Voice Styles,", in Proc. of IEEE RO-MAN2008, pp 707--712, 2008. http://www.sunrisemusic.co.jp/database/database00.html

[24]

Kazuhiro Nakadai, Toru Takahashi, Hiroshi G.Okuno, Hirofumi Nakajima, Yuji Hasegawa, Huroshi Tsujino: Design and Implementation of Robot Audition System "HARK," Advanced Robotics, vol.24 pp.739--761, 2010. HARK Main Page: http://winnie.kuis.kyoto-u.ac.jp/HARK/ .

[25]

Morgan Quigley, Brian Gerkey, Ken Conley, Josh, Faust, Tully Foote, Jeremy Leibs, Eric Berger, RobWheeler, Andrew Ng: ROS: an open-source Robot Operating System in IEEE-RAS International Conference on Robotics and Automation (ICRA) Work shop on Open Source Software in Robotics, 2009.

[26]

ROS: http://www.ros.org

Cited By

Matsushima KKubota TMurakami HSato SOgawa K(2024)Voice Volume Gauge to Encourage Vocal Adaptation of an Operator of a Teleoperated Social RobotProceedings of the 12th International Conference on Human-Agent Interaction10.1145/3687272.3688313(91-99)Online publication date: 24-Nov-2024
https://dl.acm.org/doi/10.1145/3687272.3688313
Tuttosi PHughson EMatsufuji AZhang CLim A(2023)Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS55552.2023.10341925(3998-4005)Online publication date: 1-Oct-2023
https://doi.org/10.1109/IROS55552.2023.10341925
Rueben MSyed MLondon ECamarena MShin EZhang YWang TGroechel TLee RMatarić M(2021)Long-Term, in-the-Wild Study of Feedback about Speech Intelligibility for K-12 Students Attending Class via a Telepresence RobotProceedings of the 2021 International Conference on Multimodal Interaction10.1145/3462244.3479893(567-576)Online publication date: 18-Oct-2021
https://dl.acm.org/doi/10.1145/3462244.3479893
Show More Cited By

Index Terms

Volume adaptation and visualization by modeling the volume level in noisy environments for telepresence system
1. Human-centered computing

Recommendations

Speech recognition in noisy environments
Speech intelligibility improvement in noisy environments based on energy correlation in frequency bands

A new speech processing algorithm is proposed to improve speech intelligibility in noisy environments without increasing speech energy. The method improves the near-end speech intelligibility by optimizing the frame-based spectral energy correlation ...
A Robust Speech Recognition System for Communication Robots in Noisy Environments

The application range of communication robots could be widely expanded by the use of automatic speech recognition (ASR) systems with improved robustness for noise and for speakers of different ages. In past researches, several modules have been proposed ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

HAI '14: Proceedings of the second international conference on Human-agent interaction

October 2014

412 pages

ISBN:9781450330350

DOI:10.1145/2658861

General Chairs:
Hideaki Kuzuoka
University of Tsukuba
,
Tetsuo Ono
Hokkaido University
,
Program Chairs:
Michita Imai
Keio University
,
James E. Young
University of Manitoba

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

SIGCHI: ACM Special Interest Group on Computer-Human Interaction
University of Tsukuba: University of Tsukuba

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

HAI '14

HAI '14: The Second International Conference on Human-Agent Interaction

October 29 - 31, 2014

Tsukuba, Japan

Acceptance Rates

HAI '14 Paper Acceptance Rate 27 of 62 submissions, 44%;

Overall Acceptance Rate 121 of 404 submissions, 30%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
197
Total Downloads

Downloads (Last 12 months)18
Downloads (Last 6 weeks)6

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Matsushima KKubota TMurakami HSato SOgawa K(2024)Voice Volume Gauge to Encourage Vocal Adaptation of an Operator of a Teleoperated Social RobotProceedings of the 12th International Conference on Human-Agent Interaction10.1145/3687272.3688313(91-99)Online publication date: 24-Nov-2024
https://dl.acm.org/doi/10.1145/3687272.3688313
Tuttosi PHughson EMatsufuji AZhang CLim A(2023)Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS55552.2023.10341925(3998-4005)Online publication date: 1-Oct-2023
https://doi.org/10.1109/IROS55552.2023.10341925
Rueben MSyed MLondon ECamarena MShin EZhang YWang TGroechel TLee RMatarić M(2021)Long-Term, in-the-Wild Study of Feedback about Speech Intelligibility for K-12 Students Attending Class via a Telepresence RobotProceedings of the 2021 International Conference on Multimodal Interaction10.1145/3462244.3479893(567-576)Online publication date: 18-Oct-2021
https://dl.acm.org/doi/10.1145/3462244.3479893
Higuchi SOku H(2021)Wide angular range dynamic projection mapping method applied to drone-based avatar robotAdvanced Robotics10.1080/01691864.2021.192855035:11(675-684)Online publication date: 19-May-2021
https://doi.org/10.1080/01691864.2021.1928550
Osawa MOkuoka KTakimoto YImai M(2020)Is Automation Appropriate? Semi-autonomous Telepresence Architecture Focusing on Voluntary and Involuntary MovementsInternational Journal of Social Robotics10.1007/s12369-020-00620-512:5(1119-1134)Online publication date: 1-Feb-2020
https://doi.org/10.1007/s12369-020-00620-5
Myodo EXu JTasaka KYanagihara HSakazawa S(2018)[Invited Paper] Issues and Solutions to Informal Communication in Working from Home Using a Telepresence RobotITE Transactions on Media Technology and Applications10.3169/mta.6.306:1(30-45)Online publication date: 2018
https://doi.org/10.3169/mta.6.30
Neustaedter CSinghal SPan RHeshmat YForghani ATang J(2018)From Being There to WatchingACM Transactions on Computer-Human Interaction10.1145/324321325:6(1-39)Online publication date: 13-Dec-2018
https://dl.acm.org/doi/10.1145/3243213
Neustaedter CVenolia GProcyk JHawkins DGergle DMorris MBjørn PKonstan J(2016)To Beam or Not to BeamProceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing10.1145/2818048.2819922(418-431)Online publication date: 27-Feb-2016
https://dl.acm.org/doi/10.1145/2818048.2819922
Misawa KRekimoto JMase KLangheinrich MGatica-Perez DVan Laerhoven KTerada T(2015)Wearing another's personalityProceedings of the 2015 ACM International Symposium on Wearable Computers10.1145/2802083.2808392(125-132)Online publication date: 7-Sep-2015
https://dl.acm.org/doi/10.1145/2802083.2808392
Misawa KRekimoto JBegole BKim JInkpen KWoo W(2015)ChameleonMaskProceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems10.1145/2702613.2732506(401-411)Online publication date: 18-Apr-2015
https://dl.acm.org/doi/10.1145/2702613.2732506
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten