Abstract
Smart home is a very hot development area in which voice-based control devices are receiving special attention from major technology companies and researchers. Despite many studies on this problem in the world, there has not been a formal study for the Vietnamese language. In addition, many studies did not offer a solution that can be expanded easily in the future. This paper provides a speech collection and processing software and shares a dataset of speech commands is labeled and organized to the language research community. This study also designs and evaluates Recurrent Neural Networks to apply it to the data collected. The average recognition accuracy on the set of 15 commands for controlling smart home devices is 98.19%. Finally, the paper presents the implementation and performance evaluation of machine learning model on a Raspberry PI-based intelligent home control unit.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
https://www.statista.com/outlook/389/100/smart-appliances/worldwide#market-users. Accessed 10 Apr 2019
Ribeiro, F.C., Carvalho, R.T.S., Cortez, P.C., Albuquerque, V.H.C.D., Filho, P.P.R.: Binary neural networks for classification of voice commands from throat microphone. IEEE Access 6, 70130–70144 (2018). https://doi.org/10.1109/ACCESS.2018.2881199
Sidiq, M., Budi, W.T.A., Sa’adah, S.: Design and implementation of voice command using MFCC and HMMs method. In: 2015 3rd International Conference on Information and Communication Technology (ICoICT), Nusa Dua, pp. 375–380 (2015). https://doi.org/10.1109/icoict.2015.7231454
Guiming, D., Xia, W., Guangyan, W., Yan, Z., Dan, L.: Speech recognition based on convolutional neural networks. In: 2016 IEEE International Conference on Signal and Image Processing (ICSIP), Beijing, pp. 708–711 (2016). https://doi.org/10.1109/siprocess.2016.7888355
Bae, J., Kim, D.: End-to-end speech command recognition with capsule network. In: Proceedings of Interspeech, pp. 776–780 (2018). https://doi.org/10.21437/interspeech.2018-1888
Smith III., J.O.: Spectral audio signal processing. https://www.dsprelated.com/freebooks/sasp/. Accessed 10 Apr 2019
Josh, P., Adam, G.: Deep Learning, A Practitioner’s Approach, Chap. 4. O’Reilly Media, Inc., Sebastopol (2017)
https://www.tensorflow.org/tutorials/sequences/audio_recognition. Accessed 10 Apr 2019
Andradea, D.C.D., Leob, S., Da Silva Vianac, M.L., Bernkopf, C.: A neural attention model for speech command recognition (2018). https://arxiv.org/pdf/1808.08929.pdf
Tutorial. https://appcodelabs.com/introduction-to-iot-build-an-mqtt-server-using-raspberry-pi. Accessed 10 Apr 2019
Tutorial. https://techtutorialsx.com/2017/04/09/esp8266-onnecting-to-mqtt-broker/. Accessed 10 Apr 2019
Nam, N.T., Hung, P.D.: Pest detection on traps using deep convolutional neural networks. In: Proceedings of the 2018 International Conference on Control and Computer Vision (ICCCV 2018), pp. 33–38. ACM, New York (2018). https://doi.org/10.1145/3232651.3232661
Hung, P.D., Linh, D.Q.: Implementing an Android application for automatic Vietnamese business card recognition. Pattern Recognit. Image Anal. 29, 156 (2019). https://doi.org/10.1134/S1054661819010188
Hung, P.D.: Detection of central sleep apnea based on a single-lead ECG. In: Proceedings of the 2018 5th International Conference on Bioinformatics Research and Applications (ICBRA 2018), pp. 78–83. ACM, New York (2018). https://doi.org/10.1145/3309129.3309132
Hung, P.D.: Central sleep apnea detection using an accelerometer. In: Proceedings of the 2018 International Conference on Control and Computer Vision (ICCCV 2018), pp. 106–111. ACM, New York (2018). https://doi.org/10.1145/3232651.3232660
Nam, N.T., Hung, P.D.: Padding methods in convolutional sequence model: an application in Japanese handwriting recognition. In: Proceedings of the 3rd International Conference on Machine Learning and Soft Computing (ICMLSC 2019), pp. 138–142. ACM, New York (2019). https://doi.org/10.1145/3310986.3310998
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Hung, P.D., Giang, T.M., Nam, L.H., Duong, P.M., Van Thang, H., Diep, V.T. (2020). Smarthome Control Unit Using Vietnamese Speech Command. In: Vasant, P., Zelinka, I., Weber, GW. (eds) Intelligent Computing and Optimization. ICO 2019. Advances in Intelligent Systems and Computing, vol 1072. Springer, Cham. https://doi.org/10.1007/978-3-030-33585-4_29
Download citation
DOI: https://doi.org/10.1007/978-3-030-33585-4_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33584-7
Online ISBN: 978-3-030-33585-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)