ELM Based Algorithms for Acoustic Template Matching in Home Automation Scenarios: Advancements and Performance Analysis

della Porta, Giulio; Principi, Emanuele; Ferroni, Giacomo; Squartini, Stefano; Hussain, Amir; Piazza, Francesco

doi:10.1007/978-3-319-28109-4_16

Giulio della Porta¹⁰,
Emanuele Principi¹⁰,
Giacomo Ferroni¹⁰,
Stefano Squartini¹⁰,
Amir Hussain¹¹ &
…
Francesco Piazza¹⁰

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 48))

811 Accesses

Abstract

Speech and sound recognition in home automation scenarios has been gaining an increasing interest in the last decade. One interesting approach addressed in the literature is based on the template matching paradigm, which is characterized by ease of implementation and independence on large datasets for system training. Moving from a recent contribution of some of the authors, where an Extreme Learning Machine algorithm was proposed and evaluated, a wider performance analysis in diverse operating conditions is provided here, together with some relevant improvements. These are allowed by the employment of supervector features as input, for the first time used with ELMs, up to the authors’ knowledge. As already verified in other application contexts and with different learning systems, this ensures a more robust characterization of the speech segment to be classified, also in presence of mismatch between training and testing data. The accomplished computer simulations confirm the effectiveness of the approach, with F\(_1\)-Measure performance up to 99 % in the multicondition case, and a computational time reduction factor close to 4, with respect to the SVM counterpart.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Angelini, B., Brugnara, F., Falavigna, D., Giuliani, D., Gretter, R., Omologo, M.: Automatic segmentation and labeling of english and italian speech databases. In: Proceedings of Eurospeech, pp. 653–656. Berlin, Germany, 22–25 Sept 1993
Google Scholar
Anguera, X.: Information retrieval-based dynamic time warping. In: Proceedings of Interspeech, pp. 1–5. Lyon, France, 25–29 Aug 2013
Google Scholar
Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Dileep, A.D., Sekhar, C.C.: Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. Speech Commun. 57, 126–143 (2014)
Article Google Scholar
Ganapathiraju, A., Hamaker, J., Picone, J.: Hybrid SVM/HMM architectures for speech recognition. In: Proceedings of ICSLP, pp. 504–507. Beijing, China, 16–20 Oct 2000
Google Scholar
Hinton, G., Deng, L., Yu, D., Dahl, G.E., Mohamed, A.r., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T.N., et al.: Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. Signal Process. Mag., IEEE 29(6), 82–97 (2012)
Google Scholar
Huang, G.B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst., Man, Cybern. B 42(2), 513–529 (2012)
Google Scholar
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
Article Google Scholar
Jaeger, H.: The “echo state” approach to analysing and training recurrent neural networks. Tech. Rep. 148, German National Research Center for Information Technology, Bonn, Germany (2001)
Google Scholar
Kim, C., Seo, K.D.: Robust DTW-based recognition algorithm for hand-held consumer devices. IEEE Trans. Consum. Electron. 51(2), 699–709 (2005)
Article MathSciNet Google Scholar
Kinnunen, T., Li, H.: An overview of text-independent speaker recognition: from features to supervectors. Speech Commun. 52(1), 12–40 (2010)
Article Google Scholar
Principi, E., Squartini, S., Bonfigli, R., Ferroni, G., Piazza, F.: An integrated system for voice command recognition and emergency detection based on audio signals. Expert Syst. Appl. 42(13), 5668–5683 (2015)
Article Google Scholar
Principi, E., Squartini, S., Cambria, E., Piazza, F.: Acoustic template-matching for automatic emergency state detection: an ELM based algorithm. Neurocomputing 149, 426–434 (2014)
Article Google Scholar
Principi, E., Squartini, S., Piazza, F., Fuselli, D., Bonifazi, M.: A distributed system for recognizing home automation commands and distress calls in the Italian language. In: Proceedings of Interspeech, pp. 2049–2053. Lyon, France, 25–29 Aug 2013
Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall PTR (1993)
Google Scholar
Saon, G., Chien, J.T.: Large-vocabulary continuous speech recognition systems: a look at some recent advances. IEEE Signal Process. Mag. 29(6), 18–33 (2012)
Article Google Scholar
Zhang, X., Sun, J., Luo, Z., Li, M.: Confidence Index Dynamic Time Warping for Language-Independent Embedded Speech Recognition. In: Proceedings of ICASSP, pp. 8066–8070. Vancouver, Canada, 26–31 May 2013
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Engineering, Università Politecnica delle Marche, Via Brecce Bianche, 60131, Ancona, Italy
Giulio della Porta, Emanuele Principi, Giacomo Ferroni, Stefano Squartini & Francesco Piazza
Department of Computing Science and Mathematics, University of Stirling, Stirling, FK9 4LA, UK
Amir Hussain

Authors

Giulio della Porta
View author publications
You can also search for this author in PubMed Google Scholar
Emanuele Principi
View author publications
You can also search for this author in PubMed Google Scholar
Giacomo Ferroni
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Squartini
View author publications
You can also search for this author in PubMed Google Scholar
Amir Hussain
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Piazza
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stefano Squartini .

Editor information

Editors and Affiliations

Department of Psychology, Seconda Università di Napoli and IIASS, Caserta, Italy
Anna Esposito
(Pompeu Fabra University), Escola Superior Politècnica Tecnocampus, Mataró, Spain
Marcos Faundez-Zanuy
sezione di Napoli Osservatorio, Istituto Nazionale di Geofisica e Vulcan, Napoli, Italy
Antonietta M. Esposito
Department of Psychology, Seconda Universita di Napoli and IIASS, Caserta, Italy
Gennaro Cordasco
Boulevard Dolez, University of Mons, TCTS Lab.31, Mons, Belgium
Thomas Drugman
Data and Signal Processing Research Grou, University of Vic, Vic, Spain
Jordi Solé-Casals
NeuroLab, Università degli Studi "Mediterranea" di, Reggio Calabria, Italy
Francesco Carlo Morabito

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

della Porta, G., Principi, E., Ferroni, G., Squartini, S., Hussain, A., Piazza, F. (2016). ELM Based Algorithms for Acoustic Template Matching in Home Automation Scenarios: Advancements and Performance Analysis. In: Esposito, A., et al. Recent Advances in Nonlinear Speech Processing. Smart Innovation, Systems and Technologies, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-28109-4_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-28109-4_16
Published: 23 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28107-0
Online ISBN: 978-3-319-28109-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics