Skip to main content

Smarthome Control Unit Using Vietnamese Speech Command

  • Conference paper
  • First Online:
Intelligent Computing and Optimization (ICO 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1072))

Included in the following conference series:

Abstract

Smart home is a very hot development area in which voice-based control devices are receiving special attention from major technology companies and researchers. Despite many studies on this problem in the world, there has not been a formal study for the Vietnamese language. In addition, many studies did not offer a solution that can be expanded easily in the future. This paper provides a speech collection and processing software and shares a dataset of speech commands is labeled and organized to the language research community. This study also designs and evaluates Recurrent Neural Networks to apply it to the data collected. The average recognition accuracy on the set of 15 commands for controlling smart home devices is 98.19%. Finally, the paper presents the implementation and performance evaluation of machine learning model on a Raspberry PI-based intelligent home control unit.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. https://www.statista.com/outlook/389/100/smart-appliances/worldwide#market-users. Accessed 10 Apr 2019

  2. Ribeiro, F.C., Carvalho, R.T.S., Cortez, P.C., Albuquerque, V.H.C.D., Filho, P.P.R.: Binary neural networks for classification of voice commands from throat microphone. IEEE Access 6, 70130–70144 (2018). https://doi.org/10.1109/ACCESS.2018.2881199

    Article  Google Scholar 

  3. Sidiq, M., Budi, W.T.A., Sa’adah, S.: Design and implementation of voice command using MFCC and HMMs method. In: 2015 3rd International Conference on Information and Communication Technology (ICoICT), Nusa Dua, pp. 375–380 (2015). https://doi.org/10.1109/icoict.2015.7231454

  4. Guiming, D., Xia, W., Guangyan, W., Yan, Z., Dan, L.: Speech recognition based on convolutional neural networks. In: 2016 IEEE International Conference on Signal and Image Processing (ICSIP), Beijing, pp. 708–711 (2016). https://doi.org/10.1109/siprocess.2016.7888355

  5. Bae, J., Kim, D.: End-to-end speech command recognition with capsule network. In: Proceedings of Interspeech, pp. 776–780 (2018). https://doi.org/10.21437/interspeech.2018-1888

  6. https://github.com/VSC-FU2019/VSC_FU

  7. Smith III., J.O.: Spectral audio signal processing. https://www.dsprelated.com/freebooks/sasp/. Accessed 10 Apr 2019

  8. Josh, P., Adam, G.: Deep Learning, A Practitioner’s Approach, Chap. 4. O’Reilly Media, Inc., Sebastopol (2017)

    Google Scholar 

  9. https://www.tensorflow.org/tutorials/sequences/audio_recognition. Accessed 10 Apr 2019

  10. Andradea, D.C.D., Leob, S., Da Silva Vianac, M.L., Bernkopf, C.: A neural attention model for speech command recognition (2018). https://arxiv.org/pdf/1808.08929.pdf

  11. Tutorial. https://appcodelabs.com/introduction-to-iot-build-an-mqtt-server-using-raspberry-pi. Accessed 10 Apr 2019

  12. Tutorial. https://techtutorialsx.com/2017/04/09/esp8266-onnecting-to-mqtt-broker/. Accessed 10 Apr 2019

  13. Nam, N.T., Hung, P.D.: Pest detection on traps using deep convolutional neural networks. In: Proceedings of the 2018 International Conference on Control and Computer Vision (ICCCV 2018), pp. 33–38. ACM, New York (2018). https://doi.org/10.1145/3232651.3232661

  14. Hung, P.D., Linh, D.Q.: Implementing an Android application for automatic Vietnamese business card recognition. Pattern Recognit. Image Anal. 29, 156 (2019). https://doi.org/10.1134/S1054661819010188

    Article  Google Scholar 

  15. Hung, P.D.: Detection of central sleep apnea based on a single-lead ECG. In: Proceedings of the 2018 5th International Conference on Bioinformatics Research and Applications (ICBRA 2018), pp. 78–83. ACM, New York (2018). https://doi.org/10.1145/3309129.3309132

  16. Hung, P.D.: Central sleep apnea detection using an accelerometer. In: Proceedings of the 2018 International Conference on Control and Computer Vision (ICCCV 2018), pp. 106–111. ACM, New York (2018). https://doi.org/10.1145/3232651.3232660

  17. Nam, N.T., Hung, P.D.: Padding methods in convolutional sequence model: an application in Japanese handwriting recognition. In: Proceedings of the 3rd International Conference on Machine Learning and Soft Computing (ICMLSC 2019), pp. 138–142. ACM, New York (2019). https://doi.org/10.1145/3310986.3310998

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Phan Duy Hung .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hung, P.D., Giang, T.M., Nam, L.H., Duong, P.M., Van Thang, H., Diep, V.T. (2020). Smarthome Control Unit Using Vietnamese Speech Command. In: Vasant, P., Zelinka, I., Weber, GW. (eds) Intelligent Computing and Optimization. ICO 2019. Advances in Intelligent Systems and Computing, vol 1072. Springer, Cham. https://doi.org/10.1007/978-3-030-33585-4_29

Download citation

Publish with us

Policies and ethics