Grey wolf optimization-extreme learning machine for automatic spoken language identification

Albadr, Musatafa Abbas Abbood; Tiun, Sabrina; Ayob, Masri; Nazri, Mohd Zakree Ahmad; AL-Dhief, Fahad Taha

doi:10.1007/s11042-023-14473-3

Grey wolf optimization-extreme learning machine for automatic spoken language identification

Published: 08 February 2023

Volume 82, pages 27165–27191, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Musatafa Abbas Abbood Albadr ORCID: orcid.org/0000-0003-2062-688X¹,
Sabrina Tiun¹,
Masri Ayob¹,
Mohd Zakree Ahmad Nazri¹ &
…
Fahad Taha AL-Dhief²

237 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Natural language classification and determination based on a particular content and dataset is carried out using Spoken Language Identification (LID) which typically involves the extraction of valuable elements in a mature data processing procedure whereby the regular LID features had been developed using the Mel Frequency Cepstral Coefficient (MFCC), Shifted Delta Coefficient (SDC), Gaussian Mixture Model (GMM) and an i-vector framework. However, there remains a need for optimization in terms of the learning process so as to allow for all the knowledge embedded in the extracted features to be captured completely. A powerful machine learning algorithm known as Extreme Learning Machine (ELM) is used for conducting regression and classification and can train single hidden layer neural networks effectively. Yet, ELM’s learning process remains under-optimized owing to the entrenched random weights selection in the input hidden layer. Based on the standard feature extraction, this current study chooses ELM as the learning model for LID. An optimized method known as the Enhanced Self-Adjusting-ELM (ESA-ELM) has been chosen as a benchmark with enhancements via the adoption of an alternate optimization approach i.e., Grey Wolf Optimisation (GWO) rather than Enhanced Ameliorated Teaching Learning-Based Optimization (EATLBO) to ensure higher performance. Ultimately, this enhanced version of the ESA-ELM is referred to as a Grey Wolf Optimisation-Extreme Learning Machine (GWO-ELM). The results generation is carried out based on LID using the exact benchmark dataset that was derived from eight separate languages. The results indicated that the GWO-ELM LID has a much superior performance than the ESA-ELM LID with respective accuracies of 100.00% for the former and merely 96.25% for the latter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Spoken language identification based on optimised genetic algorithm–extreme learning machine approach

Article 13 July 2019

Spoken Language Identification Based on Particle Swarm Optimisation–Extreme Learning Machine Approach

Article 18 March 2020

Mel-Frequency Cepstral Coefficient Features Based on Standard Deviation and Principal Component Analysis for Language Identification Systems

Article 16 July 2021

References

Albadr MAA, Tiun S (2020) Spoken language identification based on particle swarm optimisation–extreme learning machine approach. Circuits Syst Signal Process 39(9):4596–4622
Google Scholar
Albadr MAA, Tiun S, al-Dhief FT, Sammour MAM (2018) Spoken language identification based on the enhanced self-adjusting extreme learning machine approach. PLoS One 13(4):e0194770
Google Scholar
Albadr MAA, Tiun S, Ayob M, al-Dhief FT (2019) Spoken language identification based on optimised genetic algorithm–extreme learning machine approach. Int J Speech Technol 22(3):711–727
Google Scholar
Albadr MAA, Tiun S, Ayob M, al-Dhief FT, Omar K, Hamzah FA (2020) Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection. PLoS One 15(12):e0242899
Google Scholar
Albadr MA, Tiun S, Ayob M, al-Dhief F (2020) Genetic algorithm based on natural selection theory for optimization problems. Symmetry 12(11):1758
Google Scholar
Albadr MAA et al (2021) Extreme learning machine for automatic language identification utilizing emotion speech data. In: 2021 international conference on electrical, communication, and computer engineering (ICECCE). IEEE
Albadr MAA, Tiun S, Ayob M, Mohammed M, al-Dhief FT (2021) Mel-frequency cepstral coefficient features based on standard deviation and principal component analysis for language identification systems. Cogn Comput 13(5):1136–1153
Google Scholar
Albadr MA et al (2022. In Press) Speech emotion recognition using optimized genetic algorithm-extreme learning machine. Multimed Tools Appl 81:23963–23989
Albadra MAA, Tiuna S (2017) Extreme learning machine: a review. Int J Appl Eng Res 12(14):4610–4623
Google Scholar
AL-Dhief FT et al (2020) Voice pathology detection using machine learning technique. In: 2020 IEEE 5th international symposium on telecommunication technologies (ISTT). IEEE
Al-Dhief FT et al (2020) A survey of voice pathology surveillance systems based on internet of things and machine learning algorithms. IEEE Access 8:64514–64533
Google Scholar
Al-Dhief FT et al (2021) Voice pathology detection and classification by adopting online sequential extreme learning machine. IEEE Access 9:77293–77306
Google Scholar
AL-Dhief FT et al (2021) Voice pathology detection using support vector machine based on different number of voice signals. In: 2021 26th IEEE Asia-Pacific conference on communications (APCC). IEEE, pp 1–6
Alexander V, Annamalai P (2016) An Elitist Genetic Algorithm Based Extreme Learning Machine. In: Computational Intelligence, Cyber Security and Computational Models. Springer, pp 301–309
Google Scholar
Ambikairajah E, Li H, Wang L, Yin B, Sethu V (2011) Language identification: a tutorial. IEEE Circuits Syst Mag 11(2):82–108
Google Scholar
Ben-Reuven E, Goldberger J (2016) A semisupervised approach for language identification based on ladder networks. arXiv preprint arXiv:1604.00317
Deng C, Huang GB, Xu J, Tang JX (2015) Extreme learning machines: new trends and applications. Sci China Inf Sci 58(2):1–16
Google Scholar
Faris H, Aljarah I, al-Betar MA, Mirjalili S (2018) Grey wolf optimizer: a review of recent variants and applications. Neural Comput & Applic 30(2):413–435
Google Scholar
Faris H, Mirjalili S, Aljarah I (2019) Automatic selection of hidden neurons and weights in neural networks using grey wolf optimizer based on a hybrid encoding scheme. Int J Mach Learn Cybern 10(10):2901–2920
Google Scholar
Ganapathy S et al (2014) Robust language identification using convolutional neural network features. In: INTERSPEECH
Gao R et al (2019) Extreme learning machine ensemble for CSI based device-free indoor localization. In: 2019 28th wireless and optical communications conference (WOCC). IEEE
Garg A, Gupta V, Jindal M (2014) A survey of language identification techniques and applications. J Emerg Technol Web Intell 6(4):388–400
Google Scholar
Gazeau V, Varol C (2018) Automatic spoken language recognition with neural networks. Int J Inf Technol Comput Sci (IJITCS) 10(8):11–17
Google Scholar
Hafen RP, Henry MJ (2012) Speech information retrieval: a review. Multimed Syst 18(6):499–518
Google Scholar
Han K, Yu D, Tashev I (2014) Speech emotion recognition using deep neural network and extreme learning machine. In: Fifteenth Annual Conference of the International Speech Communication Association
Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1–3):489–501
Google Scholar
Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501
Google Scholar
Huang G-B, Chen L, Siew CK (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17(4):879–892
Google Scholar
Huang G-B et al (2011) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B Cybern 42(2):513–529
Google Scholar
Jiang B, Song Y, Wei S, Liu JH, McLoughlin IV, Dai LR (2014) Deep bottleneck features for spoken language identification. PLoS One 9(7):e100795
Google Scholar
Kaya H, Karpov AA (2018) Efficient and effective strategies for cross-corpus acoustic emotion recognition. Neurocomputing 275:1028–1034
Google Scholar
Lan Y, Hu Z, Soh YC, Huang GB (2013) An extreme learning machine approach for speaker recognition. Neural Comput & Applic 22(3–4):417–425
Google Scholar
Lee KA et al (2016) The 2015 NIST language recognition evaluation: the shared view of I2R, Fantastic4 and SingaMS
Li J et al (2015) LSTM time and frequency recurrence for automatic speech recognition. In: Automatic speech recognition and understanding (ASRU), 2015 IEEE workshop on. IEEE
Liang N-Y et al (2006) A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans Neural Netw 17(6):1411–1423
Google Scholar
Lopez-Moreno I, Gonzalez-Dominguez J, Martinez D, Plchot O, Gonzalez-Rodriguez J, Moreno PJ (2016) On the use of deep feedforward neural networks for automatic language identification. Comput Speech Lang 40:46–59
Google Scholar
Malik H, Roy N (2019) Extreme Learning Machine-Based Image Classification Model Using Handwritten Digit Database. In: Applications of Artificial Intelligence Techniques in Engineering. Springer, pp 607–618
Google Scholar
Minhas R, Baradarani A, Seifzadeh S, Jonathan Wu QM (2010) Human action recognition using extreme learning machine based on visual vocabularies. Neurocomputing 73(10):1906–1917
Google Scholar
Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw 69:46–61
Google Scholar
Mohammed AA, Minhas R, Jonathan Wu QM, Sid-Ahmed MA (2011) Human face recognition based on multidimensional PCA and extreme learning machine. Pattern Recogn 44(10):2588–2597
MATH Google Scholar
Muthusamy H, Polat K, Yaacob S (2015) Improved emotion recognition using Gaussian mixture model and extreme learning machine in speech and glottal signals. Math Probl Eng 2015:1–13
Google Scholar
Nayak P et al (2016) Comparison of modified teaching–learning-based optimization and extreme learning machine for classification of multiple power signal disturbances. Neural Comput & Applic 27(7):2107–2122
Google Scholar
Naz A, Javaid N, Javaid S (2018) Enhanced recurrent extreme learning machine using gray wolf optimization for load forecasting. In: 2018 IEEE 21st international multi-topic conference (INMIC). IEEE
Niu P, Ma Y, Li M, Yan S, Li G (2016) A kind of parameters self-adjusting extreme learning machine. Neural Process Lett 44(3):813–830
Google Scholar
Pal M, Maxwell AE, Warner TA (2013) Kernel-based extreme learning machine for remote-sensing image classification. Remote Sens Lett 4(9):853–862
Google Scholar
Peng Y, Wang S, Long X, Lu BL (2015) Discriminative graph regularized extreme learning machine and its application to face recognition. Neurocomputing 149:340–353
Google Scholar
Singh G, Sharma S, Kumar V, Kaur M, Baz M, Masud M (2021) Spoken language identification using deep learning. Comput Intell Neurosci 2021:1–12
Google Scholar
Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. In: Australasian joint conference on artificial intelligence. Springer
van Heeswijk M (2015) Advances in extreme learning machines
Wang Y, Cao F, Yuan Y (2011) A study on effectiveness of extreme learning machine. Neurocomputing 74(16):2483–2490
Google Scholar
Wang M, Chen H, Li H, Cai Z, Zhao X, Tong C, Li J, Xu X (2017) Grey wolf optimization evolving kernel extreme learning machine: application to bankruptcy prediction. Eng Appl Artif Intell 63:54–68
Google Scholar
Wang Z et al (2019) Breast Cancer detection using extreme learning machine based on feature fusion with CNN deep features. IEEE Access
Wang W, Song W, Chen C, Zhang Z, Xin Y (2019) I-vector features and deep neural network modeling for language recognition. Procedia Comput Sci 147:36–43
Google Scholar
Xu J et al (2015) Regularized minimum class variance extreme learning machine for language recognition. EURASIP J Audio Speech Music Process 2015(1):22
MathSciNet Google Scholar
Yang Z, Zhang T, Zhang D (2016) A novel algorithm with differential evolution and coral reef optimization for extreme learning machine training. Cogn Neurodyn 10(1):73–83
MathSciNet Google Scholar
Zazo R, Lozano-Diez A, Gonzalez-Dominguez J, T. Toledano D, Gonzalez-Rodriguez J (2016) Language identification in short utterances using long short-term memory (LSTM) recurrent neural networks. PLoS One 11(1):e0146917
Google Scholar
Zhou Z, Wang C, Zhu Z, Wang Y, Yang D (2019) Sliding mode control based on a hybrid grey-wolf-optimized extreme learning machine for robot manipulators. Optik 185:364–380
Google Scholar

Download references

Acknowledgments

This project was funded by the Universiti Kebangsaan Malaysia under Dana Impak Perdana grant (Research code: GUP-2020-063).

Author information

Authors and Affiliations

CAIT, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
Musatafa Abbas Abbood Albadr, Sabrina Tiun, Masri Ayob & Mohd Zakree Ahmad Nazri
School of Electrical Engineering, Department of Communication Engineering, Universiti Teknologi Malaysia, UTM Johor Bahru, Johor Bahru, Johor, Malaysia
Fahad Taha AL-Dhief

Authors

Musatafa Abbas Abbood Albadr
View author publications
You can also search for this author in PubMed Google Scholar
Sabrina Tiun
View author publications
You can also search for this author in PubMed Google Scholar
Masri Ayob
View author publications
You can also search for this author in PubMed Google Scholar
Mohd Zakree Ahmad Nazri
View author publications
You can also search for this author in PubMed Google Scholar
Fahad Taha AL-Dhief
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Musatafa Abbas Abbood Albadr.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Albadr, M.A.A., Tiun, S., Ayob, M. et al. Grey wolf optimization-extreme learning machine for automatic spoken language identification. Multimed Tools Appl 82, 27165–27191 (2023). https://doi.org/10.1007/s11042-023-14473-3

Download citation

Received: 19 April 2021
Revised: 18 March 2022
Accepted: 31 January 2023
Published: 08 February 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s11042-023-14473-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Grey wolf optimization-extreme learning machine for automatic spoken language identification

Abstract

Access this article

Similar content being viewed by others

Spoken language identification based on optimised genetic algorithm–extreme learning machine approach

Spoken Language Identification Based on Particle Swarm Optimisation–Extreme Learning Machine Approach

Mel-Frequency Cepstral Coefficient Features Based on Standard Deviation and Principal Component Analysis for Language Identification Systems

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Grey wolf optimization-extreme learning machine for automatic spoken language identification

Abstract

Access this article

Similar content being viewed by others

Spoken language identification based on optimised genetic algorithm–extreme learning machine approach

Spoken Language Identification Based on Particle Swarm Optimisation–Extreme Learning Machine Approach

Mel-Frequency Cepstral Coefficient Features Based on Standard Deviation and Principal Component Analysis for Language Identification Systems

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation