Skip to main content
Log in

Grey wolf optimization-extreme learning machine for automatic spoken language identification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Natural language classification and determination based on a particular content and dataset is carried out using Spoken Language Identification (LID) which typically involves the extraction of valuable elements in a mature data processing procedure whereby the regular LID features had been developed using the Mel Frequency Cepstral Coefficient (MFCC), Shifted Delta Coefficient (SDC), Gaussian Mixture Model (GMM) and an i-vector framework. However, there remains a need for optimization in terms of the learning process so as to allow for all the knowledge embedded in the extracted features to be captured completely. A powerful machine learning algorithm known as Extreme Learning Machine (ELM) is used for conducting regression and classification and can train single hidden layer neural networks effectively. Yet, ELM’s learning process remains under-optimized owing to the entrenched random weights selection in the input hidden layer. Based on the standard feature extraction, this current study chooses ELM as the learning model for LID. An optimized method known as the Enhanced Self-Adjusting-ELM (ESA-ELM) has been chosen as a benchmark with enhancements via the adoption of an alternate optimization approach i.e., Grey Wolf Optimisation (GWO) rather than Enhanced Ameliorated Teaching Learning-Based Optimization (EATLBO) to ensure higher performance. Ultimately, this enhanced version of the ESA-ELM is referred to as a Grey Wolf Optimisation-Extreme Learning Machine (GWO-ELM). The results generation is carried out based on LID using the exact benchmark dataset that was derived from eight separate languages. The results indicated that the GWO-ELM LID has a much superior performance than the ESA-ELM LID with respective accuracies of 100.00% for the former and merely 96.25% for the latter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Albadr MAA, Tiun S (2020) Spoken language identification based on particle swarm optimisation–extreme learning machine approach. Circuits Syst Signal Process 39(9):4596–4622

    Google Scholar 

  2. Albadr MAA, Tiun S, al-Dhief FT, Sammour MAM (2018) Spoken language identification based on the enhanced self-adjusting extreme learning machine approach. PLoS One 13(4):e0194770

    Google Scholar 

  3. Albadr MAA, Tiun S, Ayob M, al-Dhief FT (2019) Spoken language identification based on optimised genetic algorithm–extreme learning machine approach. Int J Speech Technol 22(3):711–727

    Google Scholar 

  4. Albadr MAA, Tiun S, Ayob M, al-Dhief FT, Omar K, Hamzah FA (2020) Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection. PLoS One 15(12):e0242899

    Google Scholar 

  5. Albadr MA, Tiun S, Ayob M, al-Dhief F (2020) Genetic algorithm based on natural selection theory for optimization problems. Symmetry 12(11):1758

    Google Scholar 

  6. Albadr MAA et al (2021) Extreme learning machine for automatic language identification utilizing emotion speech data. In: 2021 international conference on electrical, communication, and computer engineering (ICECCE). IEEE

  7. Albadr MAA, Tiun S, Ayob M, Mohammed M, al-Dhief FT (2021) Mel-frequency cepstral coefficient features based on standard deviation and principal component analysis for language identification systems. Cogn Comput 13(5):1136–1153

    Google Scholar 

  8. Albadr MA et al (2022. In Press) Speech emotion recognition using optimized genetic algorithm-extreme learning machine. Multimed Tools Appl 81:23963–23989

  9. Albadra MAA, Tiuna S (2017) Extreme learning machine: a review. Int J Appl Eng Res 12(14):4610–4623

    Google Scholar 

  10. AL-Dhief FT et al (2020) Voice pathology detection using machine learning technique. In: 2020 IEEE 5th international symposium on telecommunication technologies (ISTT). IEEE

  11. Al-Dhief FT et al (2020) A survey of voice pathology surveillance systems based on internet of things and machine learning algorithms. IEEE Access 8:64514–64533

    Google Scholar 

  12. Al-Dhief FT et al (2021) Voice pathology detection and classification by adopting online sequential extreme learning machine. IEEE Access 9:77293–77306

    Google Scholar 

  13. AL-Dhief FT et al (2021) Voice pathology detection using support vector machine based on different number of voice signals. In: 2021 26th IEEE Asia-Pacific conference on communications (APCC). IEEE, pp 1–6

  14. Alexander V, Annamalai P (2016) An Elitist Genetic Algorithm Based Extreme Learning Machine. In: Computational Intelligence, Cyber Security and Computational Models. Springer, pp 301–309

    Google Scholar 

  15. Ambikairajah E, Li H, Wang L, Yin B, Sethu V (2011) Language identification: a tutorial. IEEE Circuits Syst Mag 11(2):82–108

    Google Scholar 

  16. Ben-Reuven E, Goldberger J (2016) A semisupervised approach for language identification based on ladder networks. arXiv preprint arXiv:1604.00317

  17. Deng C, Huang GB, Xu J, Tang JX (2015) Extreme learning machines: new trends and applications. Sci China Inf Sci 58(2):1–16

    Google Scholar 

  18. Faris H, Aljarah I, al-Betar MA, Mirjalili S (2018) Grey wolf optimizer: a review of recent variants and applications. Neural Comput & Applic 30(2):413–435

    Google Scholar 

  19. Faris H, Mirjalili S, Aljarah I (2019) Automatic selection of hidden neurons and weights in neural networks using grey wolf optimizer based on a hybrid encoding scheme. Int J Mach Learn Cybern 10(10):2901–2920

    Google Scholar 

  20. Ganapathy S et al (2014) Robust language identification using convolutional neural network features. In: INTERSPEECH

  21. Gao R et al (2019) Extreme learning machine ensemble for CSI based device-free indoor localization. In: 2019 28th wireless and optical communications conference (WOCC). IEEE

  22. Garg A, Gupta V, Jindal M (2014) A survey of language identification techniques and applications. J Emerg Technol Web Intell 6(4):388–400

    Google Scholar 

  23. Gazeau V, Varol C (2018) Automatic spoken language recognition with neural networks. Int J Inf Technol Comput Sci (IJITCS) 10(8):11–17

    Google Scholar 

  24. Hafen RP, Henry MJ (2012) Speech information retrieval: a review. Multimed Syst 18(6):499–518

    Google Scholar 

  25. Han K, Yu D, Tashev I (2014) Speech emotion recognition using deep neural network and extreme learning machine. In: Fifteenth Annual Conference of the International Speech Communication Association

  26. Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1–3):489–501

    Google Scholar 

  27. Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501

    Google Scholar 

  28. Huang G-B, Chen L, Siew CK (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17(4):879–892

    Google Scholar 

  29. Huang G-B et al (2011) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B Cybern 42(2):513–529

    Google Scholar 

  30. Jiang B, Song Y, Wei S, Liu JH, McLoughlin IV, Dai LR (2014) Deep bottleneck features for spoken language identification. PLoS One 9(7):e100795

    Google Scholar 

  31. Kaya H, Karpov AA (2018) Efficient and effective strategies for cross-corpus acoustic emotion recognition. Neurocomputing 275:1028–1034

    Google Scholar 

  32. Lan Y, Hu Z, Soh YC, Huang GB (2013) An extreme learning machine approach for speaker recognition. Neural Comput & Applic 22(3–4):417–425

    Google Scholar 

  33. Lee KA et al (2016) The 2015 NIST language recognition evaluation: the shared view of I2R, Fantastic4 and SingaMS

  34. Li J et al (2015) LSTM time and frequency recurrence for automatic speech recognition. In: Automatic speech recognition and understanding (ASRU), 2015 IEEE workshop on. IEEE

  35. Liang N-Y et al (2006) A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans Neural Netw 17(6):1411–1423

    Google Scholar 

  36. Lopez-Moreno I, Gonzalez-Dominguez J, Martinez D, Plchot O, Gonzalez-Rodriguez J, Moreno PJ (2016) On the use of deep feedforward neural networks for automatic language identification. Comput Speech Lang 40:46–59

    Google Scholar 

  37. Malik H, Roy N (2019) Extreme Learning Machine-Based Image Classification Model Using Handwritten Digit Database. In: Applications of Artificial Intelligence Techniques in Engineering. Springer, pp 607–618

    Google Scholar 

  38. Minhas R, Baradarani A, Seifzadeh S, Jonathan Wu QM (2010) Human action recognition using extreme learning machine based on visual vocabularies. Neurocomputing 73(10):1906–1917

    Google Scholar 

  39. Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw 69:46–61

    Google Scholar 

  40. Mohammed AA, Minhas R, Jonathan Wu QM, Sid-Ahmed MA (2011) Human face recognition based on multidimensional PCA and extreme learning machine. Pattern Recogn 44(10):2588–2597

    MATH  Google Scholar 

  41. Muthusamy H, Polat K, Yaacob S (2015) Improved emotion recognition using Gaussian mixture model and extreme learning machine in speech and glottal signals. Math Probl Eng 2015:1–13

    Google Scholar 

  42. Nayak P et al (2016) Comparison of modified teaching–learning-based optimization and extreme learning machine for classification of multiple power signal disturbances. Neural Comput & Applic 27(7):2107–2122

    Google Scholar 

  43. Naz A, Javaid N, Javaid S (2018) Enhanced recurrent extreme learning machine using gray wolf optimization for load forecasting. In: 2018 IEEE 21st international multi-topic conference (INMIC). IEEE

  44. Niu P, Ma Y, Li M, Yan S, Li G (2016) A kind of parameters self-adjusting extreme learning machine. Neural Process Lett 44(3):813–830

    Google Scholar 

  45. Pal M, Maxwell AE, Warner TA (2013) Kernel-based extreme learning machine for remote-sensing image classification. Remote Sens Lett 4(9):853–862

    Google Scholar 

  46. Peng Y, Wang S, Long X, Lu BL (2015) Discriminative graph regularized extreme learning machine and its application to face recognition. Neurocomputing 149:340–353

    Google Scholar 

  47. Singh G, Sharma S, Kumar V, Kaur M, Baz M, Masud M (2021) Spoken language identification using deep learning. Comput Intell Neurosci 2021:1–12

    Google Scholar 

  48. Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. In: Australasian joint conference on artificial intelligence. Springer

  49. van Heeswijk M (2015) Advances in extreme learning machines

  50. Wang Y, Cao F, Yuan Y (2011) A study on effectiveness of extreme learning machine. Neurocomputing 74(16):2483–2490

    Google Scholar 

  51. Wang M, Chen H, Li H, Cai Z, Zhao X, Tong C, Li J, Xu X (2017) Grey wolf optimization evolving kernel extreme learning machine: application to bankruptcy prediction. Eng Appl Artif Intell 63:54–68

    Google Scholar 

  52. Wang Z et al (2019) Breast Cancer detection using extreme learning machine based on feature fusion with CNN deep features. IEEE Access

  53. Wang W, Song W, Chen C, Zhang Z, Xin Y (2019) I-vector features and deep neural network modeling for language recognition. Procedia Comput Sci 147:36–43

    Google Scholar 

  54. Xu J et al (2015) Regularized minimum class variance extreme learning machine for language recognition. EURASIP J Audio Speech Music Process 2015(1):22

    MathSciNet  Google Scholar 

  55. Yang Z, Zhang T, Zhang D (2016) A novel algorithm with differential evolution and coral reef optimization for extreme learning machine training. Cogn Neurodyn 10(1):73–83

    MathSciNet  Google Scholar 

  56. Zazo R, Lozano-Diez A, Gonzalez-Dominguez J, T. Toledano D, Gonzalez-Rodriguez J (2016) Language identification in short utterances using long short-term memory (LSTM) recurrent neural networks. PLoS One 11(1):e0146917

    Google Scholar 

  57. Zhou Z, Wang C, Zhu Z, Wang Y, Yang D (2019) Sliding mode control based on a hybrid grey-wolf-optimized extreme learning machine for robot manipulators. Optik 185:364–380

    Google Scholar 

Download references

Acknowledgments

This project was funded by the Universiti Kebangsaan Malaysia under Dana Impak Perdana grant (Research code: GUP-2020-063).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Musatafa Abbas Abbood Albadr.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Albadr, M.A.A., Tiun, S., Ayob, M. et al. Grey wolf optimization-extreme learning machine for automatic spoken language identification. Multimed Tools Appl 82, 27165–27191 (2023). https://doi.org/10.1007/s11042-023-14473-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-14473-3

Keywords

Navigation