The GTM-UVIGO System for Albayzin 2018 Speech-to-Text Evaluation

Docío-Fernández, Laura; García-Mateo, Carmen

doi:10.21437/IberSPEECH.2018-58

The GTM-UVIGO System for Albayzin 2018 Speech-to-Text Evaluation

Laura Docío-Fernández, Carmen García-Mateo

This paper describes the Speech-to-Text system developed by the Multimedia Technologies Group (GTM) of the atlanTTic research center at the University of Vigo, for the Albayzin Speech-to-Text Challenge (S2T) organized in the Iberspeech 2018 conference. The large vocabulary automatic speech recognition system is built using the Kaldi toolkit. It uses an hybrid Deep Neural Network - Hidden Markov Model (DNN-HMM) for acoustic modeling, and a rescoring of a trigram based word-lattices, obtained in a first decoding stage, with a fourgram language model or a language model based on a recurrent neural network. The system was evaluated only on the open set training condition.

doi: 10.21437/IberSPEECH.2018-58

Cite as: Docío-Fernández, L., García-Mateo, C. (2018) The GTM-UVIGO System for Albayzin 2018 Speech-to-Text Evaluation. Proc. IberSPEECH 2018, 277-280, doi: 10.21437/IberSPEECH.2018-58

@inproceedings{dociofernandez18_iberspeech,
  author={Laura Docío-Fernández and Carmen García-Mateo},
  title={{The GTM-UVIGO System for Albayzin 2018 Speech-to-Text Evaluation}},
  year=2018,
  booktitle={Proc. IberSPEECH 2018},
  pages={277--280},
  doi={10.21437/IberSPEECH.2018-58}
}