ABSTRACT
In order to achieve voice operation of the Smart Water Supply System, the voice signal noise reduction algorithm in industrial environment is studied and simulated. The simulation results show that the algorithm can effectively improve the signal-to-noise ratio. Using WebRTC voice processing technology as the front-end tool, the experiment of integrating voice processing module into the Smart Water Supply System is carried out. The results show that the speech recognition system can achieve 98% recognition accuracy in real operation environment, and the response time of the device is not more than 1500ms, which can meet the requirements of the voice operation of the Smart Water Supply System.
- D. T. Liu, K. Guo, B. K. Wang, and Y. Peng. 2018. Summary and perspective survey on digital twin technology. Chinese Journal of Scientific Instrument, 39(11), 1-10.Google Scholar
- C. M. Wang, L Ding, Q. H. Li, and Y. Sun. 2020. Design of an intelligent voice garbage classification system based on human-computer interaction. Journal of Science of Teachers' College and University, 41(10), 30-48.Google Scholar
- D. Yu, and L. Deng, 2016. Automatic Speech Recognition. Springer, London, England, 79-95.Google Scholar
- M. Labied, A. Belangour, and M. Banane, 2022. An overview of Automatic Speech Recognition Preprocessing Techniques. International Conference on Decision Aid Sciences and Applications. IEEE, 804-809.Google Scholar
- G. E. Hinton, and R. R. Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science, 313(5786), 504-507.Google ScholarCross Ref
- D. Wang, and X. W. Zhang. 2015. THCHS-30: A Free Chinese speech corpus. CoRR abs, 1512.01882.Google Scholar
- W. D. Zhang, F. Zhang, and W. Chen. 2019. Fault state recognition of rolling bearing based fully convolutional network. Computing in Science & Engineering, 21(5), 55–63.Google ScholarDigital Library
- X. H. Zhang, and J. Q. Huang. 2015. A Survey of WebRTC Based Real Time Video Audio Communication. Computer Science, 42(02), 1-6+32.Google Scholar
- R. Shi, H. H. Cheng, and L. M. Sun. 2019. Development of Video Conference System Based on WebRTC. Intelligent Computer and Applications, 9(06), 132-137.Google Scholar
- C. Y. He, and D. Y. Chen. 2022. Summary of research on vibration and noise signal processing technology. Special Purpose Vehicle, (05), 27-31.Google Scholar
- S. J. Shen, and S. F. Ou. 2017. Comparison and analysis of speech enhancement algorithms based on prior snr estimation. Journal of Yantai University, 30(02), 147-154.Google Scholar
- M. W. Zhang, and S. M. Li. 2022. A survey of windowing in digital signal processing. Industrial Control Computer, 30(02), 147-154.Google Scholar
- S. Liu. 2021. Design and implementation of speech processing system in front of intelligent speech robot. Modern Computer, (03), 106-110.Google Scholar
Index Terms
- Research on the application of voice interaction technology based on WebRTC in the Smart Water Supply System
Recommendations
Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System
Dysarthria is a motor speech disorder that causes inability to control and coordinate one or more articulators. This makes it difficult for a dysarthric speaker to utter certain speech sound units, thereby producing poorly articulated, slurred, and ...
Regularized minimum variance distortionless response-based cepstral features for robust continuous speech recognition
We study the low-variance and robust features for speech recognition system on the AURORA-4 corpus.We propose to compute cepstral features from a regularized MVDR (RMVDR) spectral estimates, denoted as RMVDR-based Cepstral Coefficient (RMCC) features.A ...
An improvement in audio-visual voice activity detection for automatic speech recognition
IEA/AIE'10: Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part INoise-robust Automatic Speech Recognition (ASR) is essential for robots which are expected to communicate with humans in a daily environment. In such an environment, Voice Activity Detection (VAD) strongly affects the performance of ASR because there ...
Comments