Sound Recognition System Using Spiking and MLP Neural Networks

Cerezuela-Escudero, Elena; Jimenez-Fernandez, Angel; Paz-Vicente, Rafael; Dominguez-Morales, Juan P.; Dominguez-Morales, Manuel J.; Linares-Barranco, Alejandro

doi:10.1007/978-3-319-44781-0_43

Elena Cerezuela-Escudero¹⁶,
Angel Jimenez-Fernandez¹⁶,
Rafael Paz-Vicente¹⁶,
Juan P. Dominguez-Morales¹⁶,
Manuel J. Dominguez-Morales¹⁶ &
…
Alejandro Linares-Barranco¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9887))

Included in the following conference series:

International Conference on Artificial Neural Networks

3986 Accesses
2 Citations
1 Altmetric

Abstract

In this paper, we explore the capabilities of a sound classification system that combines a Neuromorphic Auditory System for feature extraction and an artificial neural network for classification. Two models of neural network have been used: Multilayer Perceptron Neural Network and Spiking Neural Network. To compare their accuracies, both networks have been developed and trained to recognize pure tones in presence of white noise. The spiking neural network has been implemented in a FPGA device. The neuromorphic auditory system that is used in this work produces a form of representation that is analogous to the spike outputs of the biological cochlea. Both systems are able to distinguish the different sounds even in the presence of white noise. The recognition system based in a spiking neural networks has better accuracy, above 91 %, even when the sound has white noise with the same power.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pickles, J.O.: An Introduction to the Physiology of Hearing. Emerald, London (2012)
Google Scholar
Guerrero-turrubiates, J.J., Gonzalez-reyna, S.E., Ledesma-orozco, S.E., Avina-cervantes, J.G.: Pitch estimation for musical note recognition using artificial neural networks. In: International Conference on Electronics, Communications and Computers (CONIELECOMP), pp. 53–58 (2014)
Google Scholar
Nielsen, A.B., Hansen, L.K., Kjems, U.: Pitch based sound classification. In: 2006 Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006), vol. 3, pp. 788–791 (2006)
Google Scholar
Pishdadian, F., Nelson, J.K.: On the transcription of monophonic melodies in an instance-based pitch classification scenario. In: Proceedings of 2013 IEEE Digital Signal Processing and Signal Processing Education Meeting, DSP/SPE 2013, pp. 222–227 (2013)
Google Scholar
Iwasa, K., Kugler, M., Kuroyanagi, S., Iwata, A.: A sound localization and recognition system using pulsed neural networks on FPGA. In: International Joint Conference on Neural Networks, IJCNN 2007, pp. 902–907. IEEE, August 2007
Google Scholar
Newton, M.J., Smith, L.S.: Biologically-inspired neural coding of sound onset for a musical sound classification task. In: Proceedings of the International Joint Conference on Neural Networks, pp. 1386–1393 (2011)
Google Scholar
Liu, S.C.: Event-Based Neuromorphic Systems. Wiley (2015)
Google Scholar
Waibel, A., Hanazawa, T., Hinton, G., Shiano, K., Lang, K.J.: Phoneme recognition using time-delay neural networks. IEEE Trans. Acousti. Speech Sig. Process. 37(3), 328–339 (1989)
Article Google Scholar
Gerstner, W., Kistler, W.M.: Spiking Neuron Models: Single Neurons, Populations, Plasticity. Cambridge University Press, Cambridge (2002)
Book MATH Google Scholar
Robert, A., Eriksson, J.L.: A composite model of the auditory periphery for simulating responses to complex sounds. J. Acoust. Soc. Am. 106(4), 1852–1864 (1999)
Article Google Scholar
Eriksson, J.L., Robert, A.: The representation of pure tones and noise in a model of cochlear nucleus neurons. J. Acoust. Soc. Am. 106(4), 1865–1879 (1999)
Article Google Scholar
Jaeger, H.: Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the “echo state network” approach. GMD Report 159, German National Research Center for Information Technology (2002)
Google Scholar
Mendes, J.A., Robson, R.R., Labidi S., Barros A.K.: Subvocal Speech recognition based on EMG signal using independent component analysis and neural network MLP. In: Congress on Image and Signal Processing, CISP 2008, vol. 1, pp. 221–224 (2008)
Google Scholar
Indiveri, G., Chicca, E., Douglas, R.: A VLSI array of low-power spiking neurons and bistables synapses with spike-timig dependant plasticity. IEEE Trans. Neural Netw. 17(1), 211–221 (2006)
Article Google Scholar
Thorpe, S.J., Brilhault, A., Perez-Carrasco, J.A.: Suggestions for a biologically inspired spiking retina using order-based coding. In: 2010 IEEE International Symposium on Circuits and Systems Nano-Bio Circuit Fabrics and Systems, ISCAS 2010, pp. 265–268 (2010)
Google Scholar
Mahowald, M.: VLSI analogs of neuronal visual processing: a synthesis of form and function, Ph.D. dissertation, California Institute of Technology, Pasadena (1992)
Google Scholar
Boahen, K.A.: Communicating Neuronal Ensembles between Neuromorphic Chips. Neuromorphic Systems. Kluwer Academic Publishers, Boston (1998)
Google Scholar
Lyon, R.F., Mead, C.: An analog electronic cochlea. IEEE Trans. Acoust. Speech Sig. Process. 36, 1119–1134 (1988)
Article MATH Google Scholar
Wen, B.: Boahen, K.A silicon cochlea with active coupling. IEEE Trans. Biomed. Circ. Syst. 3, 444–455 (2009)
Article Google Scholar
Hamilton, T.J., Jin, C., van Schaik, A., Tapson, J.: An active 2-d silicon cochlea. IEEE Trans. Biomed. Circ. Syst. 2, 30–43 (2008)
Article Google Scholar
Liu, S-C., Van Schaik, A., Minch, B.A., Delbruck, T.: Event-based 64-channel binaural silicon cochlea with Q enhancement mechanisms. In: Proceedings of 2010 IEEE International Symposium on Circuits and Systems (ISCAS), 30 May−2 June 2010, pp. 2027–2030 (2010)
Google Scholar
Leong, M.P., Jin, C., Leong, P.: An FPGA-based electronic cochlea. EURASIP J. Appl. Sig. Process. 2003(7), 629–638 (2003)
Article Google Scholar
Dundur, R., Latte, M.V., Kulkarni, S.Y., Venkatesha, M.K.: Digital filter for cochlear implant implemented on a field-programmable gate array. Int. J. Electr. Comput. Energ. Electron. Commun. Eng. 2(7), 468–472 (2008)
Google Scholar
Thakur, C.S., Hamilton, T.J., Tapson, J., van Schaik, A., Lyon, R.F.: FPGA Implementation of the CAR model of the Cochlea. In: IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1853−1856 (2014)
Google Scholar
Domínguez-Morales, M., Jimenez-Fernandez, A., Cerezuela-Escudero, E., Paz-Vicente, R., Linares-Barranco, A., Jimenez, G.: On the designing of spikes band-pass filters for FPGA. In: Honkela, T. (ed.) ICANN 2011, Part II. LNCS, vol. 6792, pp. 389–396. Springer, Heidelberg (2011)
Chapter Google Scholar
Jimenez-Fernandez, A., Linares-Barranco, A., Paz-Vicente, R., Jiménez, G., Civit, A.: Building blocks for spike-based signal processing. In: International Joint Conference on Neural Networks, IJCNN, pp. 1–8 (2010)
Google Scholar
Gomez-Rodriguez, F., Paz, R., Miro, L., Linares-Barranco, A., Jimenez, G., Civit, A.: Two hardware implementations of the exhaustive synthetic AER generation method. In: Cabestany, J., Prieto, A.G., Sandoval, F. (eds.) IWANN 2005. LNCS, vol. 3512, pp. 534–540. Springer, Heidelberg (2005)
Chapter Google Scholar
Cerezuela-Escudero, E., Dominguez-Morales, M.J., Jiménez-Fernández, A., Paz-Vicente, R., Linares-Barranco, A., Jiménez-Moreno, G.: Spikes monitors for FPGAs, an experimental comparative study. In: Rojas, I., Joya, G., Gabestany, J. (eds.) IWANN 2013, Part I. LNCS, vol. 7902, pp. 179–188. Springer, Heidelberg (2013)
Chapter Google Scholar
Rios-Navarro, A., Jimenez-Fernandez, A., Cerezuela-Escudero, E., Rivas, M., Jimenez, G., Linares-Barranco, A.: Live demostration: real-time motor rotation frequency detection by spike-based visual and auditory sensory fusion on AER and FPGA. In: Wermter, S., Weber, C., Duch, W., Honkela, T., Koprinkova-Hristova, P., Magg, S., Palm, G., Villa, A.E.P. (eds.) ICANN 2014. LNCS, vol. 8681, pp. 847–848. Springer, Switzerland (2014)
Google Scholar
Cerezuela-Escudero, E., Jimenez-Fernandez, A., Paz-Vicente, R., Dominguez-Morales, M., Linares-Barranco, A., Jimenez-Moreno, G.: Musical notes classification with Neuromorphic Auditory System using FPGA and a Convolutional Spiking Network. In: Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1−7 (2015)
Google Scholar
Lass, N.: Contemporary Issues in Experimental Phonetics. Elsevier (2012)
Google Scholar

Download references

Acknowledgements

This work is supported by the Spanish government grant BIOSENSE (TEC2012-37868-C04-02) and by the excellence project from Andalusian Council MINERVA (P12-TIC-1300), both with support from the European Regional Development Fund.

Author information

Authors and Affiliations

Robotic and Technology of Computers Lab, Department of Architecture and Technology of Computers, University of Seville, Seville, Spain
Elena Cerezuela-Escudero, Angel Jimenez-Fernandez, Rafael Paz-Vicente, Juan P. Dominguez-Morales, Manuel J. Dominguez-Morales & Alejandro Linares-Barranco

Authors

Elena Cerezuela-Escudero
View author publications
You can also search for this author in PubMed Google Scholar
Angel Jimenez-Fernandez
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Paz-Vicente
View author publications
You can also search for this author in PubMed Google Scholar
Juan P. Dominguez-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Manuel J. Dominguez-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Linares-Barranco
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Cerezuela-Escudero .

Editor information

Editors and Affiliations

University of Lausanne, Lausanne, Switzerland
Alessandro E.P. Villa
University of Lausanne, Lausanne, Switzerland
Paolo Masulli
Universitat Politécnica de Catalunya, Terrrassa, Spain
Antonio Javier Pons Rivero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cerezuela-Escudero, E., Jimenez-Fernandez, A., Paz-Vicente, R., Dominguez-Morales, J.P., Dominguez-Morales, M.J., Linares-Barranco, A. (2016). Sound Recognition System Using Spiking and MLP Neural Networks. In: Villa, A., Masulli, P., Pons Rivero, A. (eds) Artificial Neural Networks and Machine Learning – ICANN 2016. ICANN 2016. Lecture Notes in Computer Science(), vol 9887. Springer, Cham. https://doi.org/10.1007/978-3-319-44781-0_43

Download citation

DOI: https://doi.org/10.1007/978-3-319-44781-0_43
Published: 13 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44780-3
Online ISBN: 978-3-319-44781-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics