skip to main content
10.1145/1028630.1028648acmconferencesArticle/Chapter ViewAbstractPublication PagesassetsConference Proceedingsconference-collections
Article

Speech-based cursor control: a study of grid-based solutions

Published:01 September 2003Publication History

ABSTRACT

Speech recognition can be a powerful tool for use in human-computer interaction. Many researchers are investigating the use of speech recognition systems for dictation-based activities, resulting in dramatic improvements in recent years. However, this same experimentation has confirmed that recognition errors and the delays inherent with speech recognition result in unacceptably long task completion times and error rates for cursor control tasks. This study explores the potential of a speech-controlled grid-based cursor control mechanism. An experiment evaluated two alternative grid-based solutions, both using 3-3 grids. One provided a single cursor in the middle of the grid. The second allows users to select a target using any of nine cursors. The results confirm that the nine-cursor solution allowed users to select targets of varying size, distance and direction significantly faster than the one-cursor solution. Overall results are encouraging when compared to earlier evaluations of other speech-based cursor control solutions.

References

  1. Christian, K., Kules, B., Shneiderman, B. and Youssef, A. (2000). A comparison of voice controlled and mouse controlled web browsing. Proceedings of Assets 2000, pp. 72--79. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. De Mauro, C., Gori, M., Maggini, M., and Martinelli E. (2001). Easy access to graphical interfaces by voice mouse. Available from the author at: [email protected].Google ScholarGoogle Scholar
  3. Kamel H, Landay J, (1999) The Integrated Communication 2 Draw (IC2D): A drawing program for the visually impaired. CHI 99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Kamel H, Landay J, (2000) A study of blind drawing practice: Creating graphical information without the visual channel. ASSETS'00. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Kamel H, Landay J, (2002) Sketching images eyes-free: A grid-based dynamic drawing tool for the blind. ASSETS 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Karimullah, A. S. and Sears, A. (2002). Speech-based cursor control. Proceedings of Assets 2002, pp. 178--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Karimullah, A. S., Sears, A., Lin, M. and Goldman, R. (2003). Speech-based cursor control: Understanding the effects of variable cursor speed on target selection. Proceedings of HCII 2003, pp. 681--685.Google ScholarGoogle Scholar
  8. Manaris, B. & Harkreader, A. (1998). SUITEKeys: A speech understanding interface for the motor-control challenged. Proceedings of the 3rd International ACM SIGCAPH Conference on Assistive Technologies (ASSETS'98), pp. 108--115. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. McNair, A. and Waibel, A. (1994). Improving recognizer acceptance through robust, natural speech repair. Proceedings of the International Conference on Spoken Language Processing, pp. 1299--1302.Google ScholarGoogle ScholarCross RefCross Ref
  10. Oviatt, S. L. (1997). Multimodal interactive maps: Designing for human performance. Human-Computer Interaction, 12, 93--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Oviatt, S. L. (2000). Taming Speech Recognition Errors ithin a Multimodal Interface, Communications of the ACM, 43 (9), 45--51. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Sears, A., Feng, J., Oseitutu, K., Karat, C-M. (2003). Hands-free speech-based navigation during dictation: Difficulties, consequences, and solutions. Human Computer Interaction, 18,3, 229--257. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Sears, A., Karat, C-M., Oseitutu, K., Karimullah, A., & Feng, J. (2001). Productivity, satisfaction, and interaction strategies of individual with spinal cord injuries and traditional users interacting with speech recognition software. Universal Access in the Information Society, 1, 4--15.Google ScholarGoogle ScholarCross RefCross Ref
  14. Sears, A., Lin, M. and Karimullah, A. S. (2002). Speech-Based Cursor Control: Understanding the effects of target size, cursor speed, and command selection. Universal Access in the Information Society, 2(1), 30--43.Google ScholarGoogle ScholarCross RefCross Ref
  15. Suhm, B., Myers, B. and Wailbel, A. (2001). Multimodal error correction for speech user interfaces. ACM Transactions on Computer-Human Interaction, 8(1), 60--98. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Speech-based cursor control: a study of grid-based solutions

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          Assets '04: Proceedings of the 6th international ACM SIGACCESS conference on Computers and accessibility
          October 2004
          202 pages
          ISBN:158113911X
          DOI:10.1145/1028630
          • cover image ACM SIGACCESS Accessibility and Computing
            ACM SIGACCESS Accessibility and Computing Just Accepted
            Sept. 2003 - Jan. 2004
            192 pages
            ISSN:1558-2337
            EISSN:1558-1187
            DOI:10.1145/1029014
            Issue’s Table of Contents

          Copyright © 2003 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 September 2003

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Assets '04 Paper Acceptance Rate25of47submissions,53%Overall Acceptance Rate436of1,556submissions,28%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader