Loading [a11y]/accessibility-menu.js
An integrated language identification for code- switched speech using decoded-phonemes and support vector machine | IEEE Conference Publication | IEEE Xplore

An integrated language identification for code- switched speech using decoded-phonemes and support vector machine


Abstract:

Automatic language identification (LID) is a specialized area of Human Language Technology in which the language(s) used in spoken utterances are identified and correctly...Show More

Abstract:

Automatic language identification (LID) is a specialized area of Human Language Technology in which the language(s) used in spoken utterances are identified and correctly classified given a predetermined number of targeted languages. Currently, most multilingual speakers have the ability and tendency for engaging in code-switching - a mixed-language phenomenon that is referred to as the usage of more than one language in utterances. This paper presents the proposed scheme for automatic language identification integrated with an automatic speech recognition system to identify languages used in a mixed-language speech context. The front-end speech recognition system feeds the decoded phonemes into the LID system. We used hidden Markov models to build acoustic models of a combined phoneme set that handles multiple languages within an utterance. A spoken utterance is converted into feature vectors with attributes that represents the statistical occurrences of each acoustic units. A supervised support vector machine (SVM) technique is trained with feature vector sequences of phoneme units. The back-end SVM classifier based on n-gram structures is used to classify/identify the phoneme feature vectors. We conducted experiments with two commonly mixed Northern Sotho and English telephone-based speech corpora. The experimental results showed that, by using shared phonemic vowels in the combined phoneme set, the word error rate (WER) was reduced with 3.6%. Moreover, the proposed approach yields significantly acceptable performance with language identification rate of 85.0% on code-switched speech corpus.
Date of Conference: 16-19 October 2013
Date Added to IEEE Xplore: 02 January 2014
ISBN Information:
Conference Location: Cluj-Napoca, Romania

Contact IEEE to Subscribe

References

References is not available for this document.