Skip to main content

Multiclass SVM/HMM Vowels Recognition System Towards Improving Human Computer Interaction

  • Conference paper
  • First Online:
Dependability Engineering and Complex Systems (DepCoS-RELCOMEX 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 470))

Included in the following conference series:

  • 1056 Accesses

Abstract

In this paper, we present experimental results on using vowels and one syllable words (VOSW) to activate some input device such as the control of mouse pointer on the screen. This type of Human Computer Interface might be used by a category of disabled person such as upper arm and finger stroke. Our goal is to design a system for the control of mouse cursor based on voice input command, using the pronunciation of certain vowels and some syllable words which can be produced even by persons affected by chronic inflammation of the larynx and vocal fold nodules. Linear predictive coefficients (LPC) and Mel Frequency Cepstral Coefficients (MFCCs) with derivatives are selected as features. Multiclass SVM then Hidden Markov Models (HMMs) have been tested as classifiers for matching components (vowels and short words). Tests and results using different features were are presented on tables, then the two classifiers were experimented independently and results were registered. Finally a verbal user interface has been designed and experienced on web page browser.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Harada, S., Landay, J.A., Malkin, J., Li, X., Bilmes, J.A.: The vocal joystick: evaluation of voice-based cursor control techniques. In: Proceedings of the ASSETS 2006, Portland, Oregon (2006)

    Google Scholar 

  2. Dai, L., Goldman, R., Sears, A., Lozier, J.: Speech-based cursor control: a study of grid-based solutions. SIGACCESS Access. Comput. (77–78), 94–101 (2004)

    Google Scholar 

  3. Igarashi, T., Hughes, J.F.: Voice as sound: using non-verbal voice input for interactive control. In: ACM UIST (2001)

    Google Scholar 

  4. Harada, S., Landay, J., Malkin, J., Li, X, Bilmes, J.: The vocal joystick: evaluation of voice-based cursor control techniques. In: ASSETS’06 (2006)

    Google Scholar 

  5. Sporka, A.J., Kurniawan, S.H., Slavík, P.: Acoustic control of mouse pointer. Univ. Access Inf. Soc. 4(3), 237–245 (2006)

    Article  Google Scholar 

  6. Rabiner, L.: A tutorial on hidden markov models and selected applications in speech recognition. In: Proceedings of IEEE (1989)

    Google Scholar 

  7. Wijoyo, T.S.: Speech recognition using linear predictive coding and artificial neural network for controlling movement of mobile robot. In: International Conference on Information and Electronics Engineering IPCSIT, vol. 6, pp. 179–183 (2011)

    Google Scholar 

  8. Shaneh, M., Taheri, A.: Voice command recognition system based on MFCC and VQ algorithms. World Acad. Sci. Eng. Technol. 57, 534–538 (2009)

    Google Scholar 

  9. Hadri, A., Boughazi, M., Fezari, M.: Improvement of arabic digits recognition rate based in the parameters choice. In: Proceedings of International Conference CISA Annaba (2008)

    Google Scholar 

  10. Bilmes, J.A. et al.: The vocal joystick: a voice-based human-computer interface for individuals with motor impairments. UWEE Technical Report Number UWEETR-2005-0007 (2005)

    Google Scholar 

  11. Abdeen, M., Moshammad, H., Yagoub, M.C.E.: An Architecture for Multi-Lingual Hands Free Desktop Control System for PC Windows, Niagara Falls. IEEE, Canada (2008)

    Google Scholar 

  12. Eggermont, P.P.B., LaRiccia, V.N..: Maximum Penalized Likelihood Estimation Volume I: Density Estimation. Springer, New York (2001)

    Google Scholar 

  13. Salem, C., ZhaS, I.: An isometric tongue pointing device. CHI 97 Electronic Publications: Technical Notes (1997)

    Google Scholar 

  14. Sporka, A.J., Kurniawan, S.H., Mahmud, M., Slavík, P.: Tonal control of mouse cursor (U3I): a usability study with the elderly. In: Proceedings of the HCI International 2005. Lawrence Erlbaum Associates (2005)

    Google Scholar 

  15. Cortes, C., Vapnik, V.: Support vector networks. Mach. Learn. pp. 273–297 (1995)

    Google Scholar 

  16. Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. In: Data Mining and Knowledge Discovery, vol. 2, pp. 121–167 (1998)

    Google Scholar 

  17. Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using gmm super vectors for speaker verification. In: IEEE Signal Processing Letters, vol. 13, pp. 308–311 (2006)

    Google Scholar 

  18. Chapelle, O., Haffner, P., Vapnik, V.: SVMs for Histogram-Based Image Classification (1999)

    Google Scholar 

  19. Putzer, M., Koreman, J.: A German Databse for a Pattern for Vacal Fold Vibration, Phonus 3. Institute of Phonetics, University of the Saarland, pp. 143–153(1997)

    Google Scholar 

  20. Fezari, M., Al-Dahoud, A.: An approach for: improving voice command processor based on better features and classifiers selection. In: The 13th International Arab Conference on Information Technology ACIT’2012, 10–13 Dec 2012, pp. 1–5 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ali Al-Dahoud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Al-Dahoud, A., Fezari, M., Wassila, A., Al-Rawashdeh, T. (2016). Multiclass SVM/HMM Vowels Recognition System Towards Improving Human Computer Interaction. In: Zamojski, W., Mazurkiewicz, J., Sugier, J., Walkowiak, T., Kacprzyk, J. (eds) Dependability Engineering and Complex Systems. DepCoS-RELCOMEX 2016. Advances in Intelligent Systems and Computing, vol 470. Springer, Cham. https://doi.org/10.1007/978-3-319-39639-2_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39639-2_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39638-5

  • Online ISBN: 978-3-319-39639-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics