skip to main content
10.1145/3647444.3647871acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimmiConference Proceedingsconference-collections
research-article

Contextual transcription and Summarization of audio using AI

Published: 13 May 2024 Publication History

Abstract

The field of Natural Language Processing (NLP) has revolutionized the way human language interacts with computer systems. NLP applications span machine translation, information extraction, summarization, and question answering, driven by vast computational resources and big data methodologies. Despite these advancements, NLP tools haven't fully integrated with Internet of Things (IoT) devices, like audio recorders, hindering their accessibility and usability. This paper introduces an innovative solution: a method for audio transcription and contextual summarization using NLP, addressing this gap and enhancing comprehension. Our approach employs cutting-edge NLP techniques, including word embedding methods and knowledge-based graphs, to create a system that efficiently converts audio content into written text and generates coherent summaries. Unlike existing AI tools, our system's summaries are not only accurate but also rich and deep, providing insightful representations of the original content. This depth is achieved through advanced linguistic analysis, surpassing tools like ChatGPT. Furthermore, our system breaks language barriers, enabling multilingual data traversal, enhancing accessibility on a global scale. Our research methodology ensures the system's adherence to industry standards like Request for Comments (RFC) and Constrained Application Protocol (CoAP), guaranteeing interoperability and reliability. By incorporating knowledge-based graphs, our system comprehensively understands audio content, enhancing the accuracy of summarization. This approach addresses the unmet need for seamlessly integrating NLP with IoT devices, making the technology accessible to a broader audience.

References

[1]
Saleh, A. A., & Weigang, L. (2023). TxLASM: A Novel Language Agnostic Summarization Model for Text Documents. Expert Systems with Applications, 121433.
[2]
Feng, X., Zhang, Y., Meng, M. H., Li, Y., Joe, C. E., Wang, Z., & Bai, G. (2023). Detecting contradictions from IoT protocol specification documents based on neural generated knowledge graph. ISA transactions.
[3]
Alkhaled, L., Adewumi, T., & Sabry, S. S. (2023). Bipol: A Novel Multi-Axes Bias Evaluation Metric with Explainability for NLP. arXiv preprint arXiv:2304.04029.
[4]
Fan, J., Tian, X., Lv, C., Zhang, S., Wang, Y., & Zhang, J. (2023). Extractive social media text summarization based on MFMMR-BertSum. Array, 100322.
[5]
Gidiotis, A., & Tsoumakas, G. (2023). Bayesian active summarization. Computer Speech & Language, 83, 101553.
[6]
Bano, S., Khalid, S., Tairan, N. M., Shah, H., & Khattak, H. A. (2023). Summarization of scholarly articles using BERT and BiGRU: Deep learning-based extractive approach. Journal of King Saud University-Computer and Information Sciences, 35(9), 101739.
[7]
Moro, G., & Ragazzi, L. (2023). Align-Then-Abstract Representation Learning for Low-Resource Summarization. Neurocomputing, 126356.
[8]
Bhuyan, S. S., Mahanta, S. K., Pakray, P., & Favre, B. (2023). Textual entailment as an evaluation metric for abstractive text summarization. Natural Language Processing Journal, 4, 100028.
[9]
Morozovskii, D., & Ramanna, S. (2023). Rare words in text summarization. Natural Language Processing Journal, 3, 100014.
[10]
Fan, G., Chen, S., Wu, H., Gao, C., Xiao, J., Xue, X., & Feng, Z. (2023). Dialog summarization for software collaborative platform via tuning pre-trained models. Journal of Systems and Software, 111763.
[11]
Vo, N. N., Vu, Q. T., Vu, N. H., Vu, T. A., Mach, B. D., & Xu, G. (2022). Domain-specific NLP system to support learning path and curriculum design at tech universities. Computers and Education: Artificial Intelligence, 3, 100042.
[12]
Bhardwaj, R., Vaidya, T., & Poria, S. (2022). Towards solving NLP tasks with optimal transport loss. Journal of King Saud University-Computer and Information Sciences, 34(10), 10434-10443.
[13]
Rani, R., & Lobiyal, D. K. (2021). A weighted word embedding based approach for extractive text summarization. Expert Systems with Applications, 186, 115867.
[14]
Belwal, R. C., Rai, S., & Gupta, A. (2021). Text summarization using topic-based vector space model and semantic measure. Information Processing & Management, 58(3), 102536.
[15]
Zhang, D., Xia, X., Yang, Y., Yang, P., Xie, C., Cui, M., & Liu, Q. (2021). A novel word similarity measure method for IoT-enabled healthcare applications. Future Generation Computer Systems, 114, 209-218.

Cited By

View all
  • (2025)Coding Small Group Communication with AI: RNNs and Transformers with ContextSmall Group Research10.1177/10464964251314197Online publication date: 22-Jan-2025

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIMMI '23: Proceedings of the 5th International Conference on Information Management & Machine Intelligence
November 2023
1215 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Constrained Application Protocol (CoAP)
  2. IOT
  3. NLP
  4. Request for Comments (RFC)
  5. audio transcription
  6. contextual summarization
  7. data summarization
  8. knowledge based graphs
  9. word embedding methods

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICIMMI 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)122
  • Downloads (Last 6 weeks)43
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Coding Small Group Communication with AI: RNNs and Transformers with ContextSmall Group Research10.1177/10464964251314197Online publication date: 22-Jan-2025

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media