skip to main content
10.1145/971478.971481acmotherconferencesArticle/Chapter ViewAbstractPublication PagespuiConference Proceedingsconference-collections
Article

Experimental evaluation of vision and speech based multimodal interfaces

Authors Info & Claims
Published:15 November 2001Publication History

ABSTRACT

Progress in computer vision and speech recognition technologies has recently enabled multimodal interfaces that use speech and gestures. These technologies o er promising alternatives to existing interfaces because they emulate the natural way in which humans communicate. However, no systematic work has been reported that formally evaluates the new speech/gesture interfaces. This paper is concerned with formal experimental evaluation of new human-computer interactions enabled by speech and hand gestures.The paper describes an experiment conducted with 23 subjects that evaluates selection strategies for interaction with large screen displays. The multimodal interface designed for this experiment does not require the user to be in physical contact with any device. Video cameras and long range microphones are used as input for the system. Three selection strategies are evaluated and results for Different target sizes and positions are reported in terms of accuracy, selection times and user preference. Design implications for vision/speech based interfaces are inferred from these results. This study also raises new question and topics for future research.

References

  1. ACM. 2001 Workshop on Perceptive User Interfaces (PUI '01), to be held on November 11--14 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Advanced Interface Technologies, Inc. http://www.advancedinterfaces.com.Google ScholarGoogle Scholar
  3. H. Ando, Y. Kitahara, and N. Hataoka. Evaluation of multimodal interface using spoken language and pointing gesture on interior design system. In International Conference on Spoken Language Processing, pages 567--570, 1994.Google ScholarGoogle Scholar
  4. T. Baudel and M. Beaudouin-Lafon. Charade: Remote control of objects using free-hand gestures. Communications of the ACM, 36(7):28--35, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. R. Bolt. Put-that-there: voice and gesture at the graphics interface. Computer Graphics, 14(3):262--270, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. A. Douglas, A. E. Kirkpatrick, and I. S. MacKenzie. Testing pointing device performance and user assessment with the ISO 9241, part 9 standard. Proceeding of the CHI 99 conference on Human factors in computing systems, pages 215--222, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Franklin. Cooperating with people: the intelligent classroom. In AAAI/IAAI, pages 555--560, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. W. Furnas, T. K. Landauer, L. M. Gomez, and S. T. Dumais. The vocabulary problem in human-system communication. Communications of the ACM, 30(11):964--971, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E. Graham and C. L. MacKenzie. Pointing on a computer display. Conference companion on Human factors in computing systems, pages 314--315, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. A. Grasso, D. S. Ebert, and T. W. Finin. The integrality of speech in multimodal interfaces. ACM Transactions on Computer-Human Interaction, 5(4):303--325, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. ISO. Report number ISO/TC 159/SC4/WG3 N147: Ergonomic requirements for office work with visual display terminals (VDTs) - part 9 - requirements for non-keyboard input devices (ISO 9241-9). International Organisation for Standardisation, 1998.Google ScholarGoogle Scholar
  12. D. B. Koons, C. J. Sparrell, and K. R. Thorisson. Integrating simultaneous input from speech, gaze, and hand gestures. In AAAI Workshop on Intelligent Multimedia Interfaces, pages 257--276, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. I. S. MacKenzie and A. Oniszczak. A comparison of three selection techniques for touchpads. Proceedings of the ACM Conference on Human Factors in Computing Systems - CHI '98, pages 336--343, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. L. Mark, G. Zwart, and A. George. Visualization space: A testbed for deviceless multimodal user interface. In Intelligent Environments 98, AAAI Spring Symposium Series, pages 87--92, 1998.Google ScholarGoogle Scholar
  15. X. Ren and S. Moriya. Improving selection performance on pen-based systems: a study of pen-based interaction for selection tasks. ACM Transactions on Computer-Human Interaction, 7(3):384--416, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. E. Schapira. Experimental evaluation of vision and speech based multimodal interfaces. Master's thesis, The Pennsylvania State University, Aug. 2001.Google ScholarGoogle Scholar
  17. R. Sharma, V. Pavlovic, and T. Huang. Toward multimodal human-computer interface. Proc. IEEE, Special issue on Multimedia Signal Processing, 86(5):853--869, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  18. R. Sharma, I. Poddar, E. Ozyildiz, S. Kettebekov, H. Kim, and T. S. Huang. Toward interpretation of natural speech/gesture for spatial planning on a virtual map. In In Proc. 1999 Advanced Display Federated Laboratory Symposium, pages 35--39, Adelphi, MD, 1999.Google ScholarGoogle Scholar
  19. B. Suhm, B. Myers, and A. Waibel. Multimodal error correction for speech user interfaces. ACM Transactions on Computer-Human Interaction, 8(1):60--98, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Experimental evaluation of vision and speech based multimodal interfaces

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      PUI '01: Proceedings of the 2001 workshop on Perceptive user interfaces
      November 2001
      241 pages
      ISBN:9781450374736
      DOI:10.1145/971478

      Copyright © 2001 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 November 2001

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader