skip to main content
10.1145/1499224.1499285acmconferencesArticle/Chapter ViewAbstractPublication PagesicicConference Proceedingsconference-collections
research-article

Cultural voice markers in speech-to-speech machine translation systems

Published:20 February 2009Publication History

ABSTRACT

Current implementations of real-time speech-to-speech (S2S) translation systems for intercultural collaboration have mainly focused on the accuracy of the recognition and translated content. Typically, the translated utterance is presented to users through text-to-speech (TTS), without projecting cultural nuances in the tone of voice. This study investigates whether there are cross-cultural markers of variations in voice dynamics, and, if these have any impact on user satisfaction. Based on subjective user evaluations (Chinese and English), we conclude that there are salient cross-cultural voice markers relevant to the interaction of culture and system design; with noticeable impact on user satisfaction in TTS and S2S systems.

References

  1. Abercombie, D. Elements of Phonetics. Edinburgh University Press, Edinburgh, 1967Google ScholarGoogle Scholar
  2. Barber, W., and Badre, A. Culturability: The merging of culture and usability. In Proc. Human Factors and the Web (1998)Google ScholarGoogle Scholar
  3. Beu, A., Honold, P., and Yuan, X. How to build up an infrastructure for intercultural usability engineering. International Journal of Human-Computer Interaction, 12 (3&4), (2000), 347--358Google ScholarGoogle ScholarCross RefCross Ref
  4. Chao, Y.R. A Grammar of Spoken Chinese, University of California Press, Berkeley, 1968Google ScholarGoogle Scholar
  5. Clemmensen, T., and Plocher, T. The cultural usability (CULTUSAB) project: Studies of cultural models in psychological usability evaluation methods, Lecture Notes in Computer Science 4559, (2007), 274--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Crystal, D. Prosodic Systems and Intonation in English, Cambridge University Press, Cambridge, 1969Google ScholarGoogle Scholar
  7. Day, D., and Evers, V. The role of culture in interface acceptance. International Conference on Human-Computer Interaction, (1997), 260--267 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Day, D., and Evers, V., Questionnaire development for multicultural data collection. In Proc. Workshop on Internationalization of Products and Systems, (1999)Google ScholarGoogle Scholar
  9. Hofstede, G., Cultures and Organizations: Software of the Mind. McGraw Hill, New York, New York, 1991Google ScholarGoogle Scholar
  10. Ladefoged, P., and Maddieson, I. The Sounds of the World's Languages, Blackwell publishers, Oxford, 1996Google ScholarGoogle Scholar
  11. Lehmann, W.P. Language and Linguistics in the People's Republic of China, University of Texas Press, Austin, 1975Google ScholarGoogle Scholar
  12. Lyons, J. Semantics 1 and 2. Cambridge University Press, Cambridge, 1977Google ScholarGoogle Scholar
  13. Marcus, A. Culture: Wanted? Alive or dead? Journal of Usability Studies, 1(2), (2006), 62--63Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Marcus, A., and Gould, E.M., Cultural dimensions and global web-user interface design. Interactions, (2000), 32--46 Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Murray, I., and Arnott, J. Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion, in Journal of Acoustical Society of America, 93 (1993), 1097--1108Google ScholarGoogle ScholarCross RefCross Ref
  16. O'Grady, W., and Dobrovolsky, M. Contemporary Linguistic Analysis: An Introduction, Copp Clark Pitman Press, Toronto, 1992Google ScholarGoogle Scholar
  17. Pike, K.L. The intonation of American English. In Blinger, D. (eds) Intonation, Penguin Press: Harmondsworth, (1945/1972), 53--83Google ScholarGoogle Scholar
  18. Vatrapu, R., and Pérez-Quiñones, M. Culture and usability evaluation: The effects of culture in structured interviews. Journal of Usability Studies, 1 (4), (2006), 156--170Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Yeo, W.A., Cultural effects in usability assessment. In Proc. Human Factors in Computing Systems, (1998) Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Cultural voice markers in speech-to-speech machine translation systems

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        IWIC '09: Proceedings of the 2009 international workshop on Intercultural collaboration
        February 2009
        342 pages
        ISBN:9781605585024
        DOI:10.1145/1499224

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 February 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate47of77submissions,61%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader