Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding

Muñoz Expósito, J. E.; Ruiz Reyes, N.; Garcia Galán, S.; Vera Candeas, P.

doi:10.1007/978-3-540-72849-8_70

J. E. Muñoz Expósito¹,
N. Ruiz Reyes¹,
S. Garcia Galán¹ &
…
P. Vera Candeas¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4478))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

2326 Accesses

Abstract

Automatic Speech/Music Discrimination (SMD) has become a research topic of interest in the last years. This paper present a new approach for such goal, which is mainly based on a distributed expert system that incorporates fuzzy rules into its knowledge base. The proposed SMD scheme consists of two stages: 1) features extraction, 2) classification of parameters. Classification is performed by cascading a GMM-based classifier with an Evolutionary Fuzzy Expert (EFE) system. The EFE system improves the accuracy rate provided by the GMM-based classifier taking into account information of current and past audio frames. Testing the kindness of new fuzzy rules for the expert system has a high computacional cost. For that reason, a distributed learning approach based on web services has been implemented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Saunders, J.: Real-time discrimination of broacast speech/music. In: Proc. IEEE ICASSP’96, Atlanta, USA, pp. 993–996 (1996)
Google Scholar
O’Shaughnessy, D.: Speech Communcations: Human and Machine, 2nd edn. IEEE Press, Piscataway (2000)
MATH Google Scholar
Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs (1993)
Google Scholar
Klapuri, A.: Automatic music transcription as we know it today. J. New. Music Research 33, 269–282 (2004)
Article Google Scholar
Greenberg, J.E., Desloge, J.D., Zurek, P.M.: Evaluation of array-processing algorithms for a head-band hearing aid. J. Acoustic Soc. Am. 113, 1646–1657 (2003)
Article Google Scholar
Tancerel, L., Ragot, S., Ruoppila, V.T., Lefebvre, R.: Combined speech and audio coding by discrimination. In: Proc. IEEE Workshop on Speech Coding, pp. 17–20 (2000)
Google Scholar
Cordon, O., Herrera, F., Hoffmann, F., Magdalena, L.: Genetic fuzzy systems: Evolutionary tuning and learning of fuzzy knowledge bases. Advances in fuzzy systems - Applications and theory, vol. 19 (2001)
Google Scholar
Dote, Y., Ovaska, S.: Industrial Applications of Soft Computing: A review. Proceedings of the IEEE (Special Issue on Industrial Innovations using soft Computing) 89(9) (2001)
Google Scholar
Lee, C.C.: Fuzzy Logic in Control Systems: Fuzzy Logic Controller, Part I-II. IEEE Transactions on Systems, Man. and Cybernetics 20(2), 404–435 (1990)
Article MATH Google Scholar
Magdalena, L., Velasco, J.R.: Fuzzy Rule-Based Controllers that Learn by Evolving their Knowledge Base. In: Herrera, F., Verdegay, J.L. (eds.) Genetics Algorithms and Soft Computing, Physica-Verlag, Heidelberg (1996)
Google Scholar
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, London (1989)
MATH Google Scholar
Alonso, G., Cassati, F., Kuno, H., Machiraju, V.: Web services - Concepts, Architectures and Applications. Springer, Heidelberg (2004)
MATH Google Scholar
Carey, M.J., Parris, E.S., Lloyd-Thomas, H.: A comparison of features for speech, music discrimination. In: Proc. IEEE ICASSP’99, Phoenix, USA, pp. 1432–1435 (1999)
Google Scholar
Vinton, M., Robinson, C.: Automated speech/other discrimination for loudness monitoring. In: 118th AES Convention, Barcelona, Spain, vol. 6437, preprint (2005)
Google Scholar
Logan, B.: Mel frequency cepstral coefficients for music modelling. In: Proc. Int. Symp. Music Information Retrieval (2000)
Google Scholar
Duda, R., Hart, P., Stork, D.: Pattern classification. John Wiley, New York (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Telecommunication Engineering Department, University of Jaén, Polytechnic School, C/ Alfonso X el Sabio 28, 23700 Linares, Jaén, Spain
J. E. Muñoz Expósito, N. Ruiz Reyes, S. Garcia Galán & P. Vera Candeas

Authors

J. E. Muñoz Expósito
View author publications
You can also search for this author in PubMed Google Scholar
N. Ruiz Reyes
View author publications
You can also search for this author in PubMed Google Scholar
S. Garcia Galán
View author publications
You can also search for this author in PubMed Google Scholar
P. Vera Candeas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Joan Martí José Miguel Benedí Ana Maria Mendonça Joan Serrat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Muñoz Expósito, J.E., Ruiz Reyes, N., Garcia Galán, S., Vera Candeas, P. (2007). Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds) Pattern Recognition and Image Analysis. IbPRIA 2007. Lecture Notes in Computer Science, vol 4478. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72849-8_70

Download citation

DOI: https://doi.org/10.1007/978-3-540-72849-8_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72848-1
Online ISBN: 978-3-540-72849-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics