loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Sergey Zablotskiy and Maxim Sidorov

Affiliation: Ulm University, Germany

Keyword(s): Russian, LVCSR, Sub-words.

Abstract: Russian is a synthetic language with a large morpheme-per-word ratio and highly inflective nature. These two peculiarities increase the lexicon size for Russian automatic speech recognition (ASR) by tens of times in comparison to that for English covering the same out-of-vocabulary (OOV) rate. The employment of sub-word units is a widely spread state-of-the-art approach to reduce the abundant lexicon and lower the perplexity (PP) of the language model. The choice of sub-word units affects the accuracy of the entire speech recognition system, its performance as well as the complexity of the spoken phrase synthesis. Here, different recognition units are investigated using pocketsphinx-engine while recognizing the vocabulary of several million word forms. A designed text normalization approach is also briefly presented. This rule-based algorithm allows keeping diverse Russian abbreviations and numerals in the language model (LM) and avoiding the statistics distortion. The approach is di rectly applicable and useful for Russian text-to-speech translation as well. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.145.58.169

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Zablotskiy, S. and Sidorov, M. (2014). Russian Sub-Word Based Speech Recognition Using Pocketsphinx Engine. In Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO 2014) - Volume 1: ASAAHMI; ISBN 978-989-758-040-6; ISSN 2184-2809, SciTePress, pages 840-844. DOI: 10.5220/0005148008400844

@conference{asaahmi14,
author={Sergey Zablotskiy. and Maxim Sidorov.},
title={Russian Sub-Word Based Speech Recognition Using Pocketsphinx Engine},
booktitle={Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO 2014) - Volume 1: ASAAHMI},
year={2014},
pages={840-844},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005148008400844},
isbn={978-989-758-040-6},
issn={2184-2809},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO 2014) - Volume 1: ASAAHMI
TI - Russian Sub-Word Based Speech Recognition Using Pocketsphinx Engine
SN - 978-989-758-040-6
IS - 2184-2809
AU - Zablotskiy, S.
AU - Sidorov, M.
PY - 2014
SP - 840
EP - 844
DO - 10.5220/0005148008400844
PB - SciTePress