Skip to main content

ELM Based Algorithms for Acoustic Template Matching in Home Automation Scenarios: Advancements and Performance Analysis

  • Chapter
  • First Online:
Book cover Recent Advances in Nonlinear Speech Processing

Abstract

Speech and sound recognition in home automation scenarios has been gaining an increasing interest in the last decade. One interesting approach addressed in the literature is based on the template matching paradigm, which is characterized by ease of implementation and independence on large datasets for system training. Moving from a recent contribution of some of the authors, where an Extreme Learning Machine algorithm was proposed and evaluated, a wider performance analysis in diverse operating conditions is provided here, together with some relevant improvements. These are allowed by the employment of supervector features as input, for the first time used with ELMs, up to the authors’ knowledge. As already verified in other application contexts and with different learning systems, this ensures a more robust characterization of the speech segment to be classified, also in presence of mismatch between training and testing data. The accomplished computer simulations confirm the effectiveness of the approach, with F\(_1\)-Measure performance up to 99 % in the multicondition case, and a computational time reduction factor close to 4, with respect to the SVM counterpart.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Angelini, B., Brugnara, F., Falavigna, D., Giuliani, D., Gretter, R., Omologo, M.: Automatic segmentation and labeling of english and italian speech databases. In: Proceedings of Eurospeech, pp. 653–656. Berlin, Germany, 22–25 Sept 1993

    Google Scholar 

  2. Anguera, X.: Information retrieval-based dynamic time warping. In: Proceedings of Interspeech, pp. 1–5. Lyon, France, 25–29 Aug 2013

    Google Scholar 

  3. Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)

    Article  Google Scholar 

  4. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  5. Dileep, A.D., Sekhar, C.C.: Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. Speech Commun. 57, 126–143 (2014)

    Article  Google Scholar 

  6. Ganapathiraju, A., Hamaker, J., Picone, J.: Hybrid SVM/HMM architectures for speech recognition. In: Proceedings of ICSLP, pp. 504–507. Beijing, China, 16–20 Oct 2000

    Google Scholar 

  7. Hinton, G., Deng, L., Yu, D., Dahl, G.E., Mohamed, A.r., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T.N., et al.: Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. Signal Process. Mag., IEEE 29(6), 82–97 (2012)

    Google Scholar 

  8. Huang, G.B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst., Man, Cybern. B 42(2), 513–529 (2012)

    Google Scholar 

  9. Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)

    Article  Google Scholar 

  10. Jaeger, H.: The “echo state” approach to analysing and training recurrent neural networks. Tech. Rep. 148, German National Research Center for Information Technology, Bonn, Germany (2001)

    Google Scholar 

  11. Kim, C., Seo, K.D.: Robust DTW-based recognition algorithm for hand-held consumer devices. IEEE Trans. Consum. Electron. 51(2), 699–709 (2005)

    Article  MathSciNet  Google Scholar 

  12. Kinnunen, T., Li, H.: An overview of text-independent speaker recognition: from features to supervectors. Speech Commun. 52(1), 12–40 (2010)

    Article  Google Scholar 

  13. Principi, E., Squartini, S., Bonfigli, R., Ferroni, G., Piazza, F.: An integrated system for voice command recognition and emergency detection based on audio signals. Expert Syst. Appl. 42(13), 5668–5683 (2015)

    Article  Google Scholar 

  14. Principi, E., Squartini, S., Cambria, E., Piazza, F.: Acoustic template-matching for automatic emergency state detection: an ELM based algorithm. Neurocomputing 149, 426–434 (2014)

    Article  Google Scholar 

  15. Principi, E., Squartini, S., Piazza, F., Fuselli, D., Bonifazi, M.: A distributed system for recognizing home automation commands and distress calls in the Italian language. In: Proceedings of Interspeech, pp. 2049–2053. Lyon, France, 25–29 Aug 2013

    Google Scholar 

  16. Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall PTR (1993)

    Google Scholar 

  17. Saon, G., Chien, J.T.: Large-vocabulary continuous speech recognition systems: a look at some recent advances. IEEE Signal Process. Mag. 29(6), 18–33 (2012)

    Article  Google Scholar 

  18. Zhang, X., Sun, J., Luo, Z., Li, M.: Confidence Index Dynamic Time Warping for Language-Independent Embedded Speech Recognition. In: Proceedings of ICASSP, pp. 8066–8070. Vancouver, Canada, 26–31 May 2013

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stefano Squartini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

della Porta, G., Principi, E., Ferroni, G., Squartini, S., Hussain, A., Piazza, F. (2016). ELM Based Algorithms for Acoustic Template Matching in Home Automation Scenarios: Advancements and Performance Analysis. In: Esposito, A., et al. Recent Advances in Nonlinear Speech Processing. Smart Innovation, Systems and Technologies, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-28109-4_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-28109-4_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-28107-0

  • Online ISBN: 978-3-319-28109-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics