Abstract
In this paper, we present experimental results on using vowels and one syllable words (VOSW) to activate some input device such as the control of mouse pointer on the screen. This type of Human Computer Interface might be used by a category of disabled person such as upper arm and finger stroke. Our goal is to design a system for the control of mouse cursor based on voice input command, using the pronunciation of certain vowels and some syllable words which can be produced even by persons affected by chronic inflammation of the larynx and vocal fold nodules. Linear predictive coefficients (LPC) and Mel Frequency Cepstral Coefficients (MFCCs) with derivatives are selected as features. Multiclass SVM then Hidden Markov Models (HMMs) have been tested as classifiers for matching components (vowels and short words). Tests and results using different features were are presented on tables, then the two classifiers were experimented independently and results were registered. Finally a verbal user interface has been designed and experienced on web page browser.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Harada, S., Landay, J.A., Malkin, J., Li, X., Bilmes, J.A.: The vocal joystick: evaluation of voice-based cursor control techniques. In: Proceedings of the ASSETS 2006, Portland, Oregon (2006)
Dai, L., Goldman, R., Sears, A., Lozier, J.: Speech-based cursor control: a study of grid-based solutions. SIGACCESS Access. Comput. (77–78), 94–101 (2004)
Igarashi, T., Hughes, J.F.: Voice as sound: using non-verbal voice input for interactive control. In: ACM UIST (2001)
Harada, S., Landay, J., Malkin, J., Li, X, Bilmes, J.: The vocal joystick: evaluation of voice-based cursor control techniques. In: ASSETS’06 (2006)
Sporka, A.J., Kurniawan, S.H., SlavÃk, P.: Acoustic control of mouse pointer. Univ. Access Inf. Soc. 4(3), 237–245 (2006)
Rabiner, L.: A tutorial on hidden markov models and selected applications in speech recognition. In: Proceedings of IEEE (1989)
Wijoyo, T.S.: Speech recognition using linear predictive coding and artificial neural network for controlling movement of mobile robot. In: International Conference on Information and Electronics Engineering IPCSIT, vol. 6, pp. 179–183 (2011)
Shaneh, M., Taheri, A.: Voice command recognition system based on MFCC and VQ algorithms. World Acad. Sci. Eng. Technol. 57, 534–538 (2009)
Hadri, A., Boughazi, M., Fezari, M.: Improvement of arabic digits recognition rate based in the parameters choice. In: Proceedings of International Conference CISA Annaba (2008)
Bilmes, J.A. et al.: The vocal joystick: a voice-based human-computer interface for individuals with motor impairments. UWEE Technical Report Number UWEETR-2005-0007 (2005)
Abdeen, M., Moshammad, H., Yagoub, M.C.E.: An Architecture for Multi-Lingual Hands Free Desktop Control System for PC Windows, Niagara Falls. IEEE, Canada (2008)
Eggermont, P.P.B., LaRiccia, V.N..: Maximum Penalized Likelihood Estimation Volume I: Density Estimation. Springer, New York (2001)
Salem, C., ZhaS, I.: An isometric tongue pointing device. CHI 97 Electronic Publications: Technical Notes (1997)
Sporka, A.J., Kurniawan, S.H., Mahmud, M., SlavÃk, P.: Tonal control of mouse cursor (U3I): a usability study with the elderly. In: Proceedings of the HCI International 2005. Lawrence Erlbaum Associates (2005)
Cortes, C., Vapnik, V.: Support vector networks. Mach. Learn. pp. 273–297 (1995)
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. In: Data Mining and Knowledge Discovery, vol. 2, pp. 121–167 (1998)
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using gmm super vectors for speaker verification. In: IEEE Signal Processing Letters, vol. 13, pp. 308–311 (2006)
Chapelle, O., Haffner, P., Vapnik, V.: SVMs for Histogram-Based Image Classification (1999)
Putzer, M., Koreman, J.: A German Databse for a Pattern for Vacal Fold Vibration, Phonus 3. Institute of Phonetics, University of the Saarland, pp. 143–153(1997)
Fezari, M., Al-Dahoud, A.: An approach for: improving voice command processor based on better features and classifiers selection. In: The 13th International Arab Conference on Information Technology ACIT’2012, 10–13 Dec 2012, pp. 1–5 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Al-Dahoud, A., Fezari, M., Wassila, A., Al-Rawashdeh, T. (2016). Multiclass SVM/HMM Vowels Recognition System Towards Improving Human Computer Interaction. In: Zamojski, W., Mazurkiewicz, J., Sugier, J., Walkowiak, T., Kacprzyk, J. (eds) Dependability Engineering and Complex Systems. DepCoS-RELCOMEX 2016. Advances in Intelligent Systems and Computing, vol 470. Springer, Cham. https://doi.org/10.1007/978-3-319-39639-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-39639-2_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39638-5
Online ISBN: 978-3-319-39639-2
eBook Packages: EngineeringEngineering (R0)