loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Asma Trabelsi ; Laurent Werey ; Sébastien Warichet and Emmanuel Helbert

Affiliation: Alcatel-Lucent Enterprise, ALE International, 32, avenue Kléber 92700 Colombes, Paris, France

Keyword(s): Speech Recognition, ASR, Data Sovereignty, Vosk, Whisper.

Abstract: Transcription has becoming an important task on the field of Artificial Intelligence and Machine Learning. Much research has focused on such a field so that we find a lot of paid and open-source ASR solutions. The choose of the best solution is crucial. Open source ones seems to be appropriate especially for companies that would maintain the aspect of data sovereignty. Vosk and Whisper are ASR open-source tools that have been revolutionized this last period. The first idea of this paper is to compare these two solutions in term of Word Error Rate (WER) to conclude who performs best. In the meantime, a lot of models aroused focusing on removing disturbing noises (such as dog barks, child screams, etc) during remote communication. The second idea of the paper is to study the influence of such models applied prior to the transcription service on the quality of the communication transcription. In our study, we focused on voice mail transcription use case.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.14.76.12

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Trabelsi, A.; Werey, L.; Warichet, S. and Helbert, E. (2024). Is Noise Reduction Improving Open-Source ASR Transcription Engines Quality?. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 1221-1228. DOI: 10.5220/0012457100003636

@conference{icaart24,
author={Asma Trabelsi. and Laurent Werey. and Sébastien Warichet. and Emmanuel Helbert.},
title={Is Noise Reduction Improving Open-Source ASR Transcription Engines Quality?},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={1221-1228},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012457100003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Is Noise Reduction Improving Open-Source ASR Transcription Engines Quality?
SN - 978-989-758-680-4
IS - 2184-433X
AU - Trabelsi, A.
AU - Werey, L.
AU - Warichet, S.
AU - Helbert, E.
PY - 2024
SP - 1221
EP - 1228
DO - 10.5220/0012457100003636
PB - SciTePress