skip to main content
10.1145/2658861.2658875acmotherconferencesArticle/Chapter ViewAbstractPublication PageshaiConference Proceedingsconference-collections
research-article

Volume adaptation and visualization by modeling the volume level in noisy environments for telepresence system

Published: 29 October 2014 Publication History

Abstract

The Lombard effect is the involuntary tendency of speakers to increase their vocal effort when speaking in a loud noise to enhance the audibility of their voice. There is a problem in telecommunication due to the Lombard effect. A speaker talks at a louder volume than necessary for the conversation partner at a remote location. This paper proposes a volume model that is required in order to automatically adjust the volume of an operator's voice at a remote communication via a telepresence robot, and develops an optimal volume control system LombaBot equipped on a telepresence robot with the model. The volume model measures the level of noise around the robot and the distance between a conversation partner and the robot to adjust the volume of the operator's voice. It has two types of volume adjustments. Those are called comfortable volume and secret talk volume. LombaBot enables people at a remote site to listen comfortably to the voice of a robot operator. Moreover, the operator is able to talk in low voices when s/he wants to talk in secret with nearby people. We confirmed that LombaBot adjusted the volume of an operator's voice properly in the noisy remote location.

Supplementary Material

suppl.mov (hai014.wmv)
Supplemental video

References

[1]
Texai: http://www.willowgarage.com/pages/texai/principles
[2]
QB: https://www.anybots.com/
[3]
Min Kyung Lee et al. ""Now, I Have a Body": Uses and Social Norms for Mobile Remote Presence in the Workplace", in Proc. of CHI2011, pp.33--42, 2011.
[4]
F. Tanaka et al., "Child-operated telepresence robot: A field trial connecting classrooms between Australia and Japan," in Proc. of IEEE/RAS IROS2013, pp. 5896 - 5901, 2013.
[5]
Lombard, E. "Le signe de le elevation de la voix," Ann. Malad. l'Orielle. Larynx. Nez. Pharynx 37, 101--119, 1911.
[6]
Barbara Hilsenbeck et al., "Listening for people: Exploiting the spectral stracture of speech to robustly perceive the presence of people" in Proc. of IEEE/RAS IROS 2011, pp. 2903--2909, 2011.
[7]
A, Deleforge et al., "The Cocktail Party Robot: Sound Source Separation and Localisation with an Active Binaural Head," in Proc. of HRI2012,pp.431--438,2012.
[8]
R.Takeda et al., "ICA-Based efficient dereverberation and echo cancellation method for barge-in-able robot audition" in Proc. of IEEE ICASSP2009, pp. 3367--3680, 2009.
[9]
A.Berkhout et al., "Acoustic control by wave field synthesis," J. acoust. Soc. Amer., vol.93, pp. 2764--2778, 1993.
[10]
D.Malhan et al., "3-D sound spatialization using ambisonic techniques," J. Conput. Music, vol. 19, no. 4, pp. 58--70, 1995
[11]
V. Pullki, "Virtual sound source positioning using vector base amplitude panning" J. Audio Eng. Soc., vol. 45, pp. 456--466, 1997.
[12]
Myung-Suk Song et al.,"An Interactive 3-D Audio System With Loudspeakes". IEEE TRANSACTION ON MULTIMEDIA, vol.13, no.5,pp. 844--855, 2011.
[13]
Goldenberg et.al, "R.,The Lombard Effect's Influence on Automatic Speaker Verification Systems and Methods for its Compensation, Information Technology: Research and Education,pp.233--237, 2006.
[14]
Ogawa,T et.al.,"Adequacy Analysis of Simulation-based Assessment of Speech Recognition System", " in Proc. of IEEE ICASSP'98, pp. 1153--1156, 1998.
[15]
John H.L.Hansen et.al., "Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition",IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LAUNGUAGE PROCESSING, Vol.17, No.2, pp.366--378, 2009.
[16]
W. Van Summers, D. B. Pisoni, R. Bernacki, R. Pedlow, and M. Stokes, "Effects of noise on speech production:Acoustical and perceptualanalyses," J. Acous. Soc. Amer., pp. 917--928, Sep. 1988.
[17]
Andreas Paepcke et al., "Yelling In the Hall: Using Sidetone to Address a Problem with Mobile Remoto Presence Systems," in Proc. of ACM UIST2011, pp 107--116, 2011.
[18]
A.Kimura et al, "Visual Feedback: Its effect on teleconferencing," in proceeding of HCI international, pp.591--600, 2007
[19]
P.L.Chu, "Voice-Activated AGC for Teleconferencing," in Proc. of IEEE ICASSP'96, pp. 929--932, 1996.
[20]
G.R.Steber, "Digital Signal Processing In Automatic Gain Control Systems," Industrial Electonics Society(IECON), pp. 381--384,1988.
[21]
J. J. Lopez et al, "Measurement of cross-talk cancelation and equalization zones in 3-D sound reproduction under real listening conditions," in Proc. of Audio Engineering Society 16th Int. Conf.,1999.
[22]
E. T. Hall, The Hidden Dimension. Doubleday, NY, 1966.
[23]
M. L. Waltes, "Human Approach Distance to a Mechanical-Looking Robot with Different Robot Voice Styles,", in Proc. of IEEE RO-MAN2008, pp 707--712, 2008. http://www.sunrisemusic.co.jp/database/database00.html
[24]
Kazuhiro Nakadai, Toru Takahashi, Hiroshi G.Okuno, Hirofumi Nakajima, Yuji Hasegawa, Huroshi Tsujino: Design and Implementation of Robot Audition System "HARK," Advanced Robotics, vol.24 pp.739--761, 2010. HARK Main Page: http://winnie.kuis.kyoto-u.ac.jp/HARK/ .
[25]
Morgan Quigley, Brian Gerkey, Ken Conley, Josh, Faust, Tully Foote, Jeremy Leibs, Eric Berger, RobWheeler, Andrew Ng: ROS: an open-source Robot Operating System in IEEE-RAS International Conference on Robotics and Automation (ICRA) Work shop on Open Source Software in Robotics, 2009.
[26]
ROS: http://www.ros.org

Cited By

View all
  • (2024)Voice Volume Gauge to Encourage Vocal Adaptation of an Operator of a Teleoperated Social RobotProceedings of the 12th International Conference on Human-Agent Interaction10.1145/3687272.3688313(91-99)Online publication date: 24-Nov-2024
  • (2023)Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS55552.2023.10341925(3998-4005)Online publication date: 1-Oct-2023
  • (2021)Long-Term, in-the-Wild Study of Feedback about Speech Intelligibility for K-12 Students Attending Class via a Telepresence RobotProceedings of the 2021 International Conference on Multimodal Interaction10.1145/3462244.3479893(567-576)Online publication date: 18-Oct-2021
  • Show More Cited By

Index Terms

  1. Volume adaptation and visualization by modeling the volume level in noisy environments for telepresence system

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    HAI '14: Proceedings of the second international conference on Human-agent interaction
    October 2014
    412 pages
    ISBN:9781450330350
    DOI:10.1145/2658861
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. adjusting volume
    2. tele-communication
    3. tele-presence robot
    4. volume model

    Qualifiers

    • Research-article

    Conference

    HAI '14

    Acceptance Rates

    HAI '14 Paper Acceptance Rate 27 of 62 submissions, 44%;
    Overall Acceptance Rate 121 of 404 submissions, 30%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 17 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Voice Volume Gauge to Encourage Vocal Adaptation of an Operator of a Teleoperated Social RobotProceedings of the 12th International Conference on Human-Agent Interaction10.1145/3687272.3688313(91-99)Online publication date: 24-Nov-2024
    • (2023)Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS55552.2023.10341925(3998-4005)Online publication date: 1-Oct-2023
    • (2021)Long-Term, in-the-Wild Study of Feedback about Speech Intelligibility for K-12 Students Attending Class via a Telepresence RobotProceedings of the 2021 International Conference on Multimodal Interaction10.1145/3462244.3479893(567-576)Online publication date: 18-Oct-2021
    • (2021)Wide angular range dynamic projection mapping method applied to drone-based avatar robotAdvanced Robotics10.1080/01691864.2021.192855035:11(675-684)Online publication date: 19-May-2021
    • (2020)Is Automation Appropriate? Semi-autonomous Telepresence Architecture Focusing on Voluntary and Involuntary MovementsInternational Journal of Social Robotics10.1007/s12369-020-00620-512:5(1119-1134)Online publication date: 1-Feb-2020
    • (2018)[Invited Paper] Issues and Solutions to Informal Communication in Working from Home Using a Telepresence RobotITE Transactions on Media Technology and Applications10.3169/mta.6.306:1(30-45)Online publication date: 2018
    • (2018)From Being There to WatchingACM Transactions on Computer-Human Interaction10.1145/324321325:6(1-39)Online publication date: 13-Dec-2018
    • (2016)To Beam or Not to BeamProceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing10.1145/2818048.2819922(418-431)Online publication date: 27-Feb-2016
    • (2015)Wearing another's personalityProceedings of the 2015 ACM International Symposium on Wearable Computers10.1145/2802083.2808392(125-132)Online publication date: 7-Sep-2015
    • (2015)ChameleonMaskProceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems10.1145/2702613.2732506(401-411)Online publication date: 18-Apr-2015
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media