skip to main content
10.1145/1294211.1294238acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
Article

Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes

Published:07 October 2007Publication History

ABSTRACT

Although mobile, tablet, large display, and tabletop computers increasingly present opportunities for using pen, finger, and wand gestures in user interfaces, implementing gesture recognition largely has been the privilege of pattern matching experts, not user interface prototypers. Although some user interface libraries and toolkits offer gesture recognizers, such infrastructure is often unavailable in design-oriented environments like Flash, scripting environments like JavaScript, or brand new off-desktop prototyping environments. To enable novice programmers to incorporate gestures into their UI prototypes, we present a "$1 recognizer" that is easy, cheap, and usable almost anywhere in about 100 lines of code. In a study comparing our $1 recognizer, Dynamic Time Warping, and the Rubine classifier on user-supplied gestures, we found that $1 obtains over 97% accuracy with only 1 loaded template and 99% accuracy with 3+ loaded templates. These results were nearly identical to DTW and superior to Rubine. In addition, we found that medium-speed gestures, in which users balanced speed and accuracy, were recognized better than slow or fast gestures for all three recognizers. We also discuss the effect that the number of templates or training examples has on recognition, the score falloff along recognizers' N-best lists, and results for individual gestures. We include detailed pseudocode of the $1 recognizer to aid development, inspection, extension, and testing.

References

  1. Anderson, D., Bailey, C. and Skubic, M. (2004) Hidden Markov Model symbol recognition for sketch-based interfaces. AAAI Fall Symposium. Menlo Park, CA: AAAI Press, 15--21.Google ScholarGoogle Scholar
  2. Cao, X. and Balakrishnan, R. (2003) VisionWand: Interaction techniques for large displays using a passive wand tracked in 3D. Proc. UIST '03. New York: ACM Press, 173--182. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cao, X. and Balakrishnan, R. (2005) Evaluation of an on-line adaptive gesture interface with command prediction. Proc. Graphics Interface '05. Waterloo, Ontario: CHCCS, 187--194. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cho, M.G. (2006) A new gesture recognition algorithm and segmentation method of Korean scripts for gesture-allowed ink editor. Information Sciences 176 (9), 1290--1303. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Guimbretière, F., Stone, M. and Winograd, T. (2001) Fluid interaction with high-resolution wall-size displays. Proc. UIST '01. New York: ACM Press, 21--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Henry, T.R., Hudson, S.E. and Newell, G.L. (1990) Integrating gesture and snapping into a user interface toolkit. Proc. UIST '90. New York: ACM Press, 112--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Hinckley, K., Ramos, G., Guimbretiere, F., Baudisch, P. and Smith, M. (2004) Stitching: Pen gestures that span multiple displays. Proc. AVI '04. New York: ACM Press, 23--31. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hong, J.I. and Landay, J.A. (2000) SATIN: A toolkit for informal ink-based applications. Proc. UIST '00. New York: ACM Press, 63--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Kara, L.B. and Stahovich, T.F. (2004) An image-based trainable symbol recognizer for sketch-based interfaces. AAAI Fall Symposium. Menlo Park, CA: AAAI Press, 99--105.Google ScholarGoogle Scholar
  10. Karlson, A.K., Bederson, B.B. and SanGiovanni, J. (2005) AppLens and LaunchTile: Two designs for one-handed thumb use on small devices. Proc. CHI '05. New York: ACM Press, 201--210. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kristensson, P. and Zhai, S. (2004) SHARK2: A large vocabulary shorthand writing system for pen-based computers. Proc. UIST '04. New York: ACM Press, 43--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Landay, J. and Myers, B.A. (1993) Extending an existing user interface toolkit to support gesture recognition. Adjunct Proc. CHI '93. New York: ACM Press, 91--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Lin, J., Newman, M.W., Hong, J.I. and Landay, J.A. (2000) DENIM: Finding a tighter fit between tools and practice for web site design. Proc. CHI '00. New York: ACM Press, 510--517. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Long, A.C., Landay, J.A. and Rowe, L.A. (1999) Implications for a gesture design tool. Proc. CHI '99. New York: ACM Press, 40--47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Mitchell, T.M. (1997) Machine Learning. New York: McGraw-Hill. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Morris, M.R., Huang, A., Paepcke, A. and Winograd, T. (2006) Cooperative gestures: Multi-user gestural interactions for co-located groupware. Proc. CHI '06. New York: ACM Press, 1201--1210. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Myers, B.A., McDaniel, R.G., Miller, R.C., Ferrency, A.S., Faulring, A., Kyle, B.D., Mickish, A., Klimovitski, A. and Doane, P. (1997) The Amulet environment: New models for effective user interface software development. IEEE Trans. Software Engineering 23 (6), 347--365. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Myers, C.S. and Rabiner, L.R. (1981) A comparative study of several dynamic time-warping algorithms for connected word recognition. The Bell System Technical J. 60 (7), 1389--1409.Google ScholarGoogle ScholarCross RefCross Ref
  19. Notowidigdo, M. and Miller, R.C. (2004) Off-line sketch interpretation. AAAI Fall Symposium. Menlo Park, CA: AAAI Press, 120--126.Google ScholarGoogle Scholar
  20. Pittman, J.A. (1991) Recognizing handwritten text. Proc. CHI '91. New York: ACM Press, 271--275. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Plamondon, R. and Srihari, S.N. (2000) On-line and off-line handwriting recognition: A comprehensive survey. IEEE Trans. Pattern Analysis & Machine Int. 22 (1), 63--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Press, W.H., Teukolsky, S.A., Vetterling, W.T. and Flannery, B.P. (1992) Numerical Recipes in C. Cambridge Univ. Press.Google ScholarGoogle Scholar
  23. Rubine, D. (1991) Specifying gestures by example. Proc. SIGGRAPH '91. New York: ACM Press, 329--337. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Salvador, S. and Chan, P. (2004) FastDTW: Toward accurate dynamic time warping in linear time and space. 3rd Wkshp. on Mining Temporal and Sequential Data, ACM KDD '04. Seattle, Washington (August 22--25, 2004).Google ScholarGoogle Scholar
  25. Sezgin, T.M. and Davis, R. (2005) HMM-based efficient sketch recognition. Proc. IUI '05. New York: ACM Press, 281--283. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Stojmenovi., M., Nayak, A. and Zunic, J. (2006) Measuring linearity of a finite set of points. Proc. CIS '06. Los Alamitos, CA: IEEE Press, 1--6.Google ScholarGoogle Scholar
  27. Swigart, S. (2005) Easily write custom gesture recognizers for your Tablet PC applications. Tablet PC Technical Articles.Google ScholarGoogle Scholar
  28. Tappert, C.C. (1982) Cursive script recognition by elastic matching. IBM J. of Research & Development 26 (6), 765--771.Google ScholarGoogle Scholar
  29. Tappert, C.C., Suen, C.Y. and Wakahara, T. (1990) The state of the art in online handwriting recognition. IEEE Trans. Pattern Analysis & Machine Int. 12 (8), 787--808. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Vermunt, J.K. (1997) Log-linear Models for Event Histories. Thousand Oaks, CA: Sage Publications.Google ScholarGoogle Scholar
  31. Wilson, A.D. and Shafer, S. (2003) XWand: UI for intelligent spaces. Proc. CHI '03. New York: ACM Press, 545--552. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Zhai, S. and Kristensson, P. (2003) Shorthand writing on stylus keyboard. Proc. CHI '03. New York: ACM Press, 97--104. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader