skip to main content
10.1145/3674225.3674303acmotherconferencesArticle/Chapter ViewAbstractPublication PagespeaiConference Proceedingsconference-collections
research-article

Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

Published: 31 July 2024 Publication History

Abstract

In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.

References

[1]
Fang X, Misra S, Xue G, Smart grid—The new and improved power grid: A survey [J]. IEEE communications surveys & tutorials, 2011, 14(4): 944-980.
[2]
Tuballa M L, Abundo M L. A review of the development of Smart Grid technologies [J]. Renewable and Sustainable Energy Reviews, 2016, 59: 710-725.
[3]
Ma R, Chen H H, Huang Y R, Smart grid communication: Its challenges and opportunities [J]. IEEE transactions on Smart Grid, 2013, 4(1): 36-46.
[4]
Ipakchi, Ali, and Farrokh Albuyeh. "Grid of the future." IEEE power and energy magazine 7.2. 2009: 52-62.
[5]
Rolnick D, Donti P L, Kaack L H, Tackling climate change with machine learning [J]. ACM Computing Surveys (CSUR), 2022, 55(2): 1-96.
[6]
Yang L Q, Jiang B C, Wang C Y, Application of Human-computer Speech Interaction in Dispatching System [J]. Advanced Materials Research, 2012, 588: 1204-1207.
[7]
Sun H, Wang Z, Wang J, Data-driven power outage detection by social sensors [J]. IEEE Transactions on Smart Grid, 2016, 7(5): 2516-2524.
[8]
Chua L O, Roska T. The CNN paradigm [J]. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, 1993, 40(3): 147-156.
[9]
He K, Gkioxari G, Dollár P, Mask r-cnn [C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[10]
Girshick R. Fast r-cnn [C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
[11]
Paterlini-Brechot P, Benali N L. Circulating tumor cells (CTC) detection: clinical impact and future directions [J]. Cancer letters, 2007, 253(2): 180-204.
[12]
Kim S, Hori T, Watanabe S. Joint CTC-attention based end-to-end speech recognition using multi-task learning [C]//2017 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, 2017: 4835-4839.
[13]
Watanabe S, Hori T, Kim S, Hybrid CTC/attention architecture for end-to-end speech recognition [J]. IEEE Journal of Selected Topics in Signal Processing, 2017, 11(8): 1240-1253.
[14]
Hill-Yardin E L, Hutchinson M R, Laycock R, A Chat (GPT) about the future of scientific publishing [J]. Brain Behav Immun, 2023, 110: 152-154.
[15]
Biswas S S. Role of chat gpt in public health [J]. Annals of biomedical engineering, 2023, 51(5): 868-869.
[16]
Han K, Xiao A, Wu E, Transformer in transformer [J]. Advances in Neural Information Processing Systems, 2021, 34: 15908-15919.
[17]
Rao R M, Liu J, Verkuil R, MSA transformer [C]//International Conference on Machine Learning. PMLR, 2021: 8844-8856.
[18]
Zhao H, Jiang L, Jia J, Point transformer [C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 16259-16268.
[19]
Zhang H, Yuan T, Chen J, Paddlespeech: An easy-to-use all-in-one speech toolkit [J]. arXiv preprint arXiv:2205.12007, 2022.
[20]
Pratap V, Hannun A, Xu Q, Wav2letter++: A fast open-source speech recognition system [C]//ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019: 6460-6464.
[21]
Vaswani A, Shazeer N, Parmar N, Attention is all you need [J]. Advances in neural information processing systems, 2017, 30.
[22]
Yu Y, Si X, Hu C, A review of recurrent neural networks: LSTM cells and network architectures [J]. Neural computation, 2019, 31(7): 1235-1270.
[23]
Devlin J, Chang M W, Lee K, Bert: Pre-training of deep bidirectional transformers for language understanding [J]. arXiv preprint arXiv:1810.04805, 2018.
[24]
Han K, Xiao A, Wu E, Transformer in transformer [J]. Advances in Neural Information Processing Systems, 2021, 34: 15908-15919.
[25]
Rao R M, Liu J, Verkuil R, MSA transformer [C]//International Conference on Machine Learning. PMLR, 2021: 8844-8856.
[26]
Bjorck N, Gomes C P, Selman B, Understanding batch normalization [J]. Advances in neural information processing systems, 2018, 31.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
PEAI '24: Proceedings of the 2024 International Conference on Power Electronics and Artificial Intelligence
January 2024
969 pages
ISBN:9798400716638
DOI:10.1145/3674225
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 July 2024

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

PEAI 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 37
    Total Downloads
  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)5
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media