research-article

Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

Authors:

Zhongtao ChenAuthors Info & Claims

PEAI '24: Proceedings of the 2024 International Conference on Power Electronics and Artificial Intelligence

Pages 439 - 442

https://doi.org/10.1145/3674225.3674303

Published: 31 July 2024 Publication History

Abstract

In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.

References

[1]

Fang X, Misra S, Xue G, Smart grid—The new and improved power grid: A survey [J]. IEEE communications surveys & tutorials, 2011, 14(4): 944-980.

[2]

Tuballa M L, Abundo M L. A review of the development of Smart Grid technologies [J]. Renewable and Sustainable Energy Reviews, 2016, 59: 710-725.

[3]

Ma R, Chen H H, Huang Y R, Smart grid communication: Its challenges and opportunities [J]. IEEE transactions on Smart Grid, 2013, 4(1): 36-46.

[4]

Ipakchi, Ali, and Farrokh Albuyeh. "Grid of the future." IEEE power and energy magazine 7.2. 2009: 52-62.

[5]

Rolnick D, Donti P L, Kaack L H, Tackling climate change with machine learning [J]. ACM Computing Surveys (CSUR), 2022, 55(2): 1-96.

[6]

Yang L Q, Jiang B C, Wang C Y, Application of Human-computer Speech Interaction in Dispatching System [J]. Advanced Materials Research, 2012, 588: 1204-1207.

[7]

Sun H, Wang Z, Wang J, Data-driven power outage detection by social sensors [J]. IEEE Transactions on Smart Grid, 2016, 7(5): 2516-2524.

[8]

Chua L O, Roska T. The CNN paradigm [J]. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, 1993, 40(3): 147-156.

[9]

He K, Gkioxari G, Dollár P, Mask r-cnn [C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.

[10]

Girshick R. Fast r-cnn [C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.

[11]

Paterlini-Brechot P, Benali N L. Circulating tumor cells (CTC) detection: clinical impact and future directions [J]. Cancer letters, 2007, 253(2): 180-204.

[12]

Kim S, Hori T, Watanabe S. Joint CTC-attention based end-to-end speech recognition using multi-task learning [C]//2017 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, 2017: 4835-4839.

[13]

Watanabe S, Hori T, Kim S, Hybrid CTC/attention architecture for end-to-end speech recognition [J]. IEEE Journal of Selected Topics in Signal Processing, 2017, 11(8): 1240-1253.

[14]

Hill-Yardin E L, Hutchinson M R, Laycock R, A Chat (GPT) about the future of scientific publishing [J]. Brain Behav Immun, 2023, 110: 152-154.

[15]

Biswas S S. Role of chat gpt in public health [J]. Annals of biomedical engineering, 2023, 51(5): 868-869.

[16]

Han K, Xiao A, Wu E, Transformer in transformer [J]. Advances in Neural Information Processing Systems, 2021, 34: 15908-15919.

[17]

Rao R M, Liu J, Verkuil R, MSA transformer [C]//International Conference on Machine Learning. PMLR, 2021: 8844-8856.

[18]

Zhao H, Jiang L, Jia J, Point transformer [C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 16259-16268.

[19]

Zhang H, Yuan T, Chen J, Paddlespeech: An easy-to-use all-in-one speech toolkit [J]. arXiv preprint arXiv:2205.12007, 2022.

[20]

Pratap V, Hannun A, Xu Q, Wav2letter++: A fast open-source speech recognition system [C]//ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019: 6460-6464.

[21]

Vaswani A, Shazeer N, Parmar N, Attention is all you need [J]. Advances in neural information processing systems, 2017, 30.

[22]

Yu Y, Si X, Hu C, A review of recurrent neural networks: LSTM cells and network architectures [J]. Neural computation, 2019, 31(7): 1235-1270.

[23]

Devlin J, Chang M W, Lee K, Bert: Pre-training of deep bidirectional transformers for language understanding [J]. arXiv preprint arXiv:1810.04805, 2018.

[24]

Han K, Xiao A, Wu E, Transformer in transformer [J]. Advances in Neural Information Processing Systems, 2021, 34: 15908-15919.

[25]

Rao R M, Liu J, Verkuil R, MSA transformer [C]//International Conference on Machine Learning. PMLR, 2021: 8844-8856.

[26]

Bjorck N, Gomes C P, Selman B, Understanding batch normalization [J]. Advances in neural information processing systems, 2018, 31.

Index Terms

Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching
1. Computing methodologies
  1. Artificial intelligence
    1. Control methods
    2. Natural language processing
      1. Speech recognition
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

A Language Model for Intelligent Speech Recognition of Power Dispatching
ACM TURC '21: Proceedings of the ACM Turing Award Celebration Conference - China

The accuracy of power dispatching speech recognition system is related to the effect of language model. In order to improve the accuracy of power dispatching speech recognition, this paper proposes a class label language model based on double ...
Psycho-acoustics inspired automatic speech recognition
Abstract
Understanding the human spoken language recognition process is still a far scientific goal. Nowadays, commercial automatic speech recognisers (ASRs) achieve high performance at recognising clean speech, but their approaches are poorly ...
Highlights
- We propose a novel Automatic Speech Recognizer inspired by psycho-acoustic studies.
Continuous Punjabi speech recognition model based on Kaldi ASR toolkit

In this paper, continuous Punjabi speech recognition model is presented using Kaldi toolkit. For speech recognition, the extraction of Mel frequency cepstral coefficients (MFCC) features and perceptual linear prediction (PLP) features were extracted ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

PEAI '24: Proceedings of the 2024 International Conference on Power Electronics and Artificial Intelligence

January 2024

969 pages

ISBN:9798400716638

DOI:10.1145/3674225

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 July 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Conference

PEAI 2024

PEAI 2024: 2024 International Conference on Power Electronics and Artificial Intelligence

January 19 - 21, 2024

Xiamen, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
37
Total Downloads

Downloads (Last 12 months)37
Downloads (Last 6 weeks)5

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten