research-article

Unvoiced Speech Recognition Algorithm Based on Myoelectric Signal

Authors:
Jianrong He

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China
View Profile

,
Xin'an Wang

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China
View Profile

,
Xing Zhang

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China
View Profile

,
Bo Wang

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China
View Profile

,
Qiuping Li

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China
View Profile

,
Changpei Qiu

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China

The Key Lab of IMS, School of ECE, Peking University Shenzhen Graduate School, Shenzhen, China
View Profile

ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and ComputingFebruary 2020Pages 450–456https://doi.org/10.1145/3383972.3384029

Published:26 May 2020Publication History

ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing

Pages 450–456

ABSTRACT

This paper presented our studies of automatic speech recognition based on myoelectric signal in the application of Chinese pronounce recognition. Facial myoelectric signal records the articulatory apparatus and thus allows us to recognize spoken words even in silence. In this way, communication is not prone to ambient noise and can be used for patients with language difficulties. Activity sections were segment from original data by moving average method combined with a threshold comparison. According to MES characteristic, the coefficients of time domain and frequency domain, wavelet energy and Mel-frequency cepstral coefficients are selected from the original data for speech recognition. The principal component analysis (PCA) method was applied to reduce dimension and generate a 24-dimensional feature vector. We examined different classifiers with optimized parameters and found that the support vector machine classifier performs the best among all. Our final system achieved a 90.04% accuracy rate on a 10 Chinese digit words task. Therefore, this approach can bring muscle speech recognition within a powerful potential in the application of silent speech.

References

Schultz, T., & Wand, M. 2010. Modeling coarticulation in EMG-based continuous speech recognition. Speech Communication, 52(4), 341--353.Google ScholarDigital Library
Thum Wei Seong, M. Z. Ibrahim, and D. J. Mulvaney, WADA-W: A Modified WADA SNR Estimator for Audio-Visual Speech Recognition, International Journal of Machine Learning and Computing vol. 9, no. 4, pp. 446--451, 2019.Google Scholar
Jou, S. C. S., Maierhein, L., Schultz, T., & Waibel, A.. 2006. Articulatory Feature Classification using Surface Electromyography. IEEE International Conference on Acoustics. IEEE.Google Scholar
B. Denby, T. Schultz, K. Honda, T. Hueber, J. M. Gilbert, and J. S. Brumberg. 2010. Silent speech interfaces. Speech Commun. 52, 4 (April 2010), 270--287. DOI=http://dx.doi.org/10.1016/j.specom.2009.08.002Google ScholarDigital Library
Shaun V. Ault, Rene J. Perez, Chloe A. Kimble, and Jin Wang, On Speech Recognition Algorithms, International Journal of Machine Learning and Computing vol. 8, no. 6, pp. 518--523, 2018.Google Scholar
Morse, M. S., Day, S. H., Trull, B., & Morse, H.. 1989. Use of myoelectric signals to recognize speech. Engineering in Medicine and Biology Society, 1989. Images of the Twenty-First Century. Proceedings of the Annual International Conference of the IEEE Engineering in. IEEE.Google ScholarCross Ref
Chan, A. D. C., Englehart, K., Hudgins, B., & Lovely, D. F. 2002. A multi-expert speech recognition system using acoustic and myoelectric signals. Engineering in Medicine and Biology, 2002. 24th Annual Conference and the Annual Fall Meeting of the Biomedical Engineering Society EMBS/BMES Conference, 2002. Proceedings of the Second Joint. IEEEGoogle ScholarCross Ref
Hiroyuki Manabe, Akira Hiraiwa, and Toshiaki Sugimura. 2003. Unvoiced speech recognition using EMG - mime speech recognition. In CHI '03 Extended Abstracts on Human Factors in Computing Systems (CHI EA '03). ACM, New York, NY, USA, 794--795. DOI: https://doi.org/10.1145/765891.765996Google ScholarDigital Library
Fraiwan, L., Lweesy, K., Al-Nemrawi, A., Addabass, S., & Saifan, R.. 2011. Voiceless Arabic vowel recognition using facial EMG. Medical & Biological Engineering & Computing, 49(7), 811--818.Google ScholarCross Ref
Dai Limei, Yao Xiaodong, Wang Pei, et al., The Application of EMG in Speech Recognition, Computer Application (China), January 2005, pp. 5--7.Google Scholar
Y. Li, Y. Tian, Z. Xu and Z. Yang, 2014, Multi-channel sEMG detection and pattern recognition, 2014 9th IEEE Conference on Industrial Electronics and Applications, Hangzhou, pp. 845--850. DOI: 10.1109/ICIEA.2014.6931280Google Scholar
Hudgins, B., Parker, P., Scott, R.N., 1993. A new strategy for multifunction myoelectric control. IEEE Trans. Biomed. Eng. 40, 82--94. httpS://dx.doi.org/10.1109/10.204774Google ScholarCross Ref
Goldstein, E.A., Heaton, J.T., Kobler, J.B., Stanley, G.B., Hillman, R.E., 2004. Design and implementation of a hands-free electrolarynx device controlled by neck strap muscle electromyographic activity. IEEE Trans. Biomed. Eng. 51, 325--332. http://dx.doi.org/10.1109/TBME.2003.820373Google ScholarCross Ref
F. B. Stulen and C. J. De Luca, 1981, Frequency Parameters of the Myoelectric Signal as a Measure of Muscle Conduction Velocity, in IEEE Transactions on Biomedical Engineering, vol. BME-28, no. 7, pp. 515--523, July 1981.doi: 10.1109/TBME.1981.324738Google ScholarCross Ref
De Armas, W., Mamun, K. A., & Chau, T. 2014. Vocal frequency estimation and voicing state prediction with surface EMG pattern recognition. Speech Communication, 63--64, 15--26.Google Scholar
Longting Xu and Zhen Yang, 2013, Speaker identification based on sparse subspace model, 2013 19th Asia-Pacific Conference on Communications (APCC), Denpasar, pp. 37--41. DOI: 10.1109/APCC.2013.6765912Google ScholarCross Ref
C. Jorgensen and K. Binsted, 2005, Web Browser Control Using EMG Based Sub Vocal Speech Recognition, Proceedings of the 38th Annual Hawaii International Conference on System Sciences, Big Island, HI, USA, 2005, pp. 294c-294c. DOI: 10.1109/HICSS.2005.683Google ScholarDigital Library
Kumar, Chandar & Ur Rehman, Faizan & Kumar, Shubash & Mehmood, Atif & Shabir, Ghulam. 2018. Analysis of MFCC and BFCC in a speaker identification system. 10.1109/ICOMET.2018.8346330.Google Scholar
Ferda Ernawan and Nur Azman Abu, Efficient Discrete Tchebichef on Spectrum Analysis of Speech Recognition, International Journal of Machine Learning and Computing vol. 1, no. 1, pp. 1--6, 2011.Google Scholar
Mitra, V., Nam, H., Espy-Wilson, C., Saltzman, E., & Goldstein, L.. 2011. Robust speech recognition with articulatory features using dynamic bayesian networks. The Journal of the Acoustical Society of America, 130(4), 2408.Google ScholarCross Ref

Index Terms

Unvoiced Speech Recognition Algorithm Based on Myoelectric Signal
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Speech recognition
  2. Machine learning
    1. Machine learning algorithms
      1. Feature selection
2. Hardware
  1. Communication hardware, interfaces and storage
    1. Signal processing systems
      1. Digital signal processing

Recommendations

"Unvoiced speech recognition using EMG - mime speech recognition"
CHI EA '03: CHI '03 Extended Abstracts on Human Factors in Computing Systems

We propose unvoiced speech recognition, "Mime Speech Recognition". It recognizes speech by observing the muscles associated with speech. It is not based on voice signals but electromyography (EMG). It will realize unvoiced communication, which is a new ...
Read More
On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR

This correspondence investigates the recognition of cochlear implant-like spectrally reduced speech (SRS) using mel frequency cepstral coefficient (MFCC) and hidden Markov model (HMM)-based automatic speech recognition (ASR). The SRS was synthesized ...
Read More
Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System

Dysarthria is a motor speech disorder that causes inability to control and coordinate one or more articulators. This makes it difficult for a dysarthric speaker to utter certain speech sound units, thereby producing poorly articulated, slurred, and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing
February 2020
607 pages
ISBN:9781450376426
DOI:10.1145/3383972

Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 May 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Mel Frequency Cepstral Coefficient (MFCC)
Speech recognition
feature extraction
myoelectric signal(MES)
segmentation
support vector machine (SVM)
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 90
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Unvoiced Speech Recognition Algorithm Based on Myoelectric Signal

ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

"Unvoiced speech recognition using EMG - mime speech recognition"

On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR

Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Unvoiced Speech Recognition Algorithm Based on Myoelectric Signal

ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

"Unvoiced speech recognition using EMG - mime speech recognition"

On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR

Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media