IIITH-ILSC Speech Database for Indain Language Identification

Kumar Vuddagiri, Ravi; Gurugubelli, Krishna; Jain, Priyam; Vydana, Hari Krishna; Kumar Vuppala, Anil

doi:10.21437/SLTU.2018-12

IIITH-ILSC Speech Database for Indain Language Identification

Ravi Kumar Vuddagiri, Krishna Gurugubelli, Priyam Jain, Hari Krishna Vydana, Anil Kumar Vuppala

This work focuses on the development of speech data comprising 23 Indian languages for developing language identification (LID) systems. Large data is a pre-requisite for developing state-of-the-art LID systems. With this motivation, the task of developing multilingual speech corpus for Indian languages has been initiated. This paper describes the composition of the data and the performances of various LID systems developed using this data. In this paper, Mel frequency cepstral feature representation is used for language identification. In this work, various state-of-the-art LID systems are developed using i-vectors, deep neural network (DNN) and deep neural network with attention (DNN-WA) models. The performance of the LID system is observed in terms of the equal error rate for i-vector, DNN and DNN-WA is 17.77%, 17.95%, and 15.18% respectively. Deep neural network with attention model shows a better performance over i-vector and DNN models.

doi: 10.21437/SLTU.2018-12

Cite as: Kumar Vuddagiri, R., Gurugubelli, K., Jain, P., Vydana, H.K., Kumar Vuppala, A. (2018) IIITH-ILSC Speech Database for Indain Language Identification. Proc. 6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018), 56-60, doi: 10.21437/SLTU.2018-12

@inproceedings{kumarvuddagiri18b_sltu,
  author={Ravi {Kumar Vuddagiri} and Krishna Gurugubelli and Priyam Jain and Hari Krishna Vydana and Anil {Kumar Vuppala}},
  title={{IIITH-ILSC Speech Database for Indain Language Identification}},
  year=2018,
  booktitle={Proc. 6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018)},
  pages={56--60},
  doi={10.21437/SLTU.2018-12}
}