Automatic Turn-Level Language Identification for Code-Switched Spanish–English Dialog

Ramanarayanan, Vikram; Pugh, Robert; Qian, Yao; Suendermann-Oeft, David

doi:10.1007/978-981-13-9443-0_5

Vikram Ramanarayanan³⁷,
Robert Pugh³⁷,
Yao Qian³⁷ &
…
David Suendermann-Oeft³⁷

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 579))

326 Accesses
4 Citations

Abstract

We examine the efficacy of text and speech-based features for language identification in code-switched human-human dialog interactions at the turn level. We extract a variety of character- and word-based text features and pass them into multiple learners, including conditional random fields, logistic regressors and deep neural networks. We observe that our best-performing text system significantly outperforms a majority vote baseline. We further leverage the popular i-Vector approach in extracting features from the speech signal and show that this outperforms a traditional spectral feature-based front-end as well as the majority vote baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Crutching refers to language learners relying on one language to fill in gaps in vocabulary or knowledge in the other [5].
2.
http://bangortalk.org.uk/.
3.
We did experiment with oversampling the code-switched class as well, but this resulted in a degradation in performance. This could probably be due to the relatively few samples in the code-switched class to begin with.
4.
In order to roughly test this hypothesis, we ran experiments wherein we used the relatively cleaner Fisher corpora (of both Spanish and English speech) for both training and testing. In this case, the F1 score obtained was 0.96, highlighting both the mismatch between the Fisher and Bangor corpora as well as the effect of noise in the Bangor corpus. Of course, there is the possibility that the 2-class classification of English and Spanish turns from monolingual turns in code-switched speech might pose more challenges than LID in non-code-switched speech. Nevertheless, while this test was not a systematic one (and hence reported only as a footnote), this clearly points toward the effect of dataset quality on system performance.

References

Milroy L, Muysken P (1995) One speaker, two languages: cross-disciplinary perspectives on code-switching. Cambridge University Press
Google Scholar
Wei L (2000) The bilingualism reader. Psychology Press
Google Scholar
MacSwan J (2004) Code switching and grammatical theory. In: The handbook of bilingualism, vol 46, p 283
Google Scholar
Myers-Scotton C (2006) Codeswitching with English: types of switching, types of communities. World Engl Crit Concepts Linguist 4(3):214
Article Google Scholar
OConnor BH, Crawford LJ (2015) An art of being in between: the promise of hybrid language practices. In: Research on preparing inservice teachers to work effectively with emergent bilinguals. Emerald Group Publishing Limited, pp 149–173
Google Scholar
Wheeler RS (2008) Code-switching. In: Educational leadership
Google Scholar
Jiang Y-LB, García GE, Willis AI (2014) Code-mixing as a bilingual instructional strategy. Biling Res J 37(3):311–326
Article Google Scholar
Ahmad BH, Jusoff K (2009) Teachers code-switching in classroom instructions for low English proficient learners. Engl Lang Teach 2(2):49
Google Scholar
Littlewood W, Yu B (2011) First language and target language in the foreign language classroom. Lang Teach 44(1):64–77
Article Google Scholar
Solorio T, Blair E, Maharjan S, Bethard S, Diab M, Gohneim M, Hawwari A, AlGhamdi F, Hirschberg J, Chang A et al (2014) Overview for the first shared task on language identification in code-switched data. In: Proceedings of the first workshop on computational approaches to code switching. Citeseer, pp 62–72
Google Scholar
Bali K, Vyas Y, Sharma J, Choudhury M (2014) I am borrowing ya mixing? An analysis of English-Hindi code mixing in facebook. In: Proceedings of the first workshop on computational approaches to code switching, EMNLP 2014, p 116
Google Scholar
Molina G, Rey-Villamizar N, Solorio T, AlGhamdi F, Ghoneim M, Hawwari A, Diab M (2016) Overview for the second shared task on language identification in code-switched data. In: EMNLP 2016, p 40
Google Scholar
Solorio T, Liu Y (2008) Learning to predict code-switching points. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 973–981
Google Scholar
Barman U, Das A, Wagner J, Foster J (2014) Code mixing: a challenge for language identification in the language of social media. In: EMNLP 2014, vol 13
Google Scholar
King B, Abney SP (2013) Labeling the languages of words in mixed-language documents using weakly supervised methods. In: HLT-NAACL, pp 1110–1119
Google Scholar
Shirvani R, Piergallini M, Gautam GS, Chouikha M (2016) The Howard University system submission for the shared task in language identification in Spanish-English codeswitching. In: Proceedings of the second workshop on computational approaches to code switching, pp 116–120
Google Scholar
Samih Y, Maharjan S, Attia M, Kallmeyer L, Solorio T (2016) Multilingual code-switching identification via LSTM recurrent neural networks. In: EMNLP 2016, p 50
Google Scholar
Li Y, Yu Y, Fung P (2012) A Mandarin-English code-switching corpus. In: LREC, pp 2515–2519
Google Scholar
Lyu D-C, Tan T-P, Chng E-S, Li H (2015) Mandarin-English code-switching speech corpus in South-East Asia: SEAME. Lang Resour Eval 49(3):581–600
Article Google Scholar
Chan JY, Ching P, Lee T (2005) Development of a Cantonese-English code-mixing speech corpus. In: INTERSPEECH, pp 1533–1536
Google Scholar
Dey A, Fung P (2014) A Hindi-English code-switching corpus. In: LREC, pp 2410–2413
Google Scholar
Donnelly K, Deuchar M (2011) The bangor autoglosser: a multilingual tagger for conversational text. ITA11. Wrexham, Wales
Google Scholar
Ramanarayanan V, Suendermann-Oeft D (2017) Jee haan, I’d like both, por favor: elicitation of a code-switched corpus of Hindi-English and Spanish-English human-machine dialog. Proc Interspeech 2017:47–51
Article Google Scholar
Vu NT, Lyu DC, Weiner J, Telaar D, Schlippe T, Blaicher F, Chng E-S, Schultz T, Li H (2012) A first speech recognition system for Mandarin-English code-switch conversational speech. In: 2012 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 4889–4892
Google Scholar
Yeh C-F, Sun L-C, Huang C-Y, Lee L-S (2011) Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 5020–5023
Google Scholar
Eyben F, Weninger F, Gross F, Schuller B (2013) Recent developments in openSMILE, the Munich open-source multimedia feature extractor. In: Proceedings of the 21st ACM international conference on multimedia. ACM, pp 835–838
Google Scholar
Qian Y, Evanini K, Wang X, Suendermann-Oeft D, Pugh RA, Lange PL, Molloy HR, Soong FK (2017) Improving sub-phone modeling for better native language identification with non-native English speech. Proc Interspeech 2017:2586–2590
Article Google Scholar
Povey D, Ghoshal A, Boulianne G, Burget L, Glembek O, Goel N, Hannemann M, Motlicek P, Qian Y, Schwarz P et al (2011) The Kaldi speech recognition toolkit. In: IEEE 2011 workshop on automatic speech recognition and understanding, no. EPFL-CONF-192584. IEEE Signal Processing Society
Google Scholar
Dehak N, Kenny PJ, Dehak R, Dumouchel P, Ouellet P (2011) Front-end factor analysis for speaker verification. IEEE Trans Audio Speech Lang Process 19(4):788–798
Article Google Scholar
Martinez D, Plchot O, Burget L, Glembek O, Matějka P (2011) Language recognition in ivectors space. In: Twelfth annual conference of the international speech communication association
Google Scholar
Cieri C, Miller D, Walker K (2004) The fisher corpus: a resource for the next generations of speech-to-text. In: LREC, vol 4, pp 69–71
Google Scholar
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) Smote: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
Article Google Scholar

Download references

Author information

Authors and Affiliations

Educational Testing Service R&D, 90 New Montgomery St, #1500, San Francisco, CA, USA
Vikram Ramanarayanan, Robert Pugh, Yao Qian & David Suendermann-Oeft

Authors

Vikram Ramanarayanan
View author publications
You can also search for this author in PubMed Google Scholar
Robert Pugh
View author publications
You can also search for this author in PubMed Google Scholar
Yao Qian
View author publications
You can also search for this author in PubMed Google Scholar
David Suendermann-Oeft
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vikram Ramanarayanan .

Editor information

Editors and Affiliations

Universidad Politécnica de Madrid, Madrid, Spain
Luis Fernando D'Haro
Nanyang Technological University, Singapore, Singapore
Rafael E. Banchs
Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramanarayanan, V., Pugh, R., Qian, Y., Suendermann-Oeft, D. (2019). Automatic Turn-Level Language Identification for Code-Switched Spanish–English Dialog. In: D'Haro, L., Banchs, R., Li, H. (eds) 9th International Workshop on Spoken Dialogue System Technology. Lecture Notes in Electrical Engineering, vol 579. Springer, Singapore. https://doi.org/10.1007/978-981-13-9443-0_5

Download citation

DOI: https://doi.org/10.1007/978-981-13-9443-0_5
Published: 25 September 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9442-3
Online ISBN: 978-981-13-9443-0
eBook Packages: Literature, Cultural and Media StudiesLiterature, Cultural and Media Studies (R0)

Publish with us

Policies and ethics