Abstract
Automatic Speech/Music Discrimination (SMD) has become a research topic of interest in the last years. This paper present a new approach for such goal, which is mainly based on a distributed expert system that incorporates fuzzy rules into its knowledge base. The proposed SMD scheme consists of two stages: 1) features extraction, 2) classification of parameters. Classification is performed by cascading a GMM-based classifier with an Evolutionary Fuzzy Expert (EFE) system. The EFE system improves the accuracy rate provided by the GMM-based classifier taking into account information of current and past audio frames. Testing the kindness of new fuzzy rules for the expert system has a high computacional cost. For that reason, a distributed learning approach based on web services has been implemented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Saunders, J.: Real-time discrimination of broacast speech/music. In: Proc. IEEE ICASSP’96, Atlanta, USA, pp. 993–996 (1996)
O’Shaughnessy, D.: Speech Communcations: Human and Machine, 2nd edn. IEEE Press, Piscataway (2000)
Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs (1993)
Klapuri, A.: Automatic music transcription as we know it today. J. New. Music Research 33, 269–282 (2004)
Greenberg, J.E., Desloge, J.D., Zurek, P.M.: Evaluation of array-processing algorithms for a head-band hearing aid. J. Acoustic Soc. Am. 113, 1646–1657 (2003)
Tancerel, L., Ragot, S., Ruoppila, V.T., Lefebvre, R.: Combined speech and audio coding by discrimination. In: Proc. IEEE Workshop on Speech Coding, pp. 17–20 (2000)
Cordon, O., Herrera, F., Hoffmann, F., Magdalena, L.: Genetic fuzzy systems: Evolutionary tuning and learning of fuzzy knowledge bases. Advances in fuzzy systems - Applications and theory, vol. 19 (2001)
Dote, Y., Ovaska, S.: Industrial Applications of Soft Computing: A review. Proceedings of the IEEE (Special Issue on Industrial Innovations using soft Computing) 89(9) (2001)
Lee, C.C.: Fuzzy Logic in Control Systems: Fuzzy Logic Controller, Part I-II. IEEE Transactions on Systems, Man. and Cybernetics 20(2), 404–435 (1990)
Magdalena, L., Velasco, J.R.: Fuzzy Rule-Based Controllers that Learn by Evolving their Knowledge Base. In: Herrera, F., Verdegay, J.L. (eds.) Genetics Algorithms and Soft Computing, Physica-Verlag, Heidelberg (1996)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, London (1989)
Alonso, G., Cassati, F., Kuno, H., Machiraju, V.: Web services - Concepts, Architectures and Applications. Springer, Heidelberg (2004)
Carey, M.J., Parris, E.S., Lloyd-Thomas, H.: A comparison of features for speech, music discrimination. In: Proc. IEEE ICASSP’99, Phoenix, USA, pp. 1432–1435 (1999)
Vinton, M., Robinson, C.: Automated speech/other discrimination for loudness monitoring. In: 118th AES Convention, Barcelona, Spain, vol. 6437, preprint (2005)
Logan, B.: Mel frequency cepstral coefficients for music modelling. In: Proc. Int. Symp. Music Information Retrieval (2000)
Duda, R., Hart, P., Stork, D.: Pattern classification. John Wiley, New York (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Muñoz Expósito, J.E., Ruiz Reyes, N., Garcia Galán, S., Vera Candeas, P. (2007). Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds) Pattern Recognition and Image Analysis. IbPRIA 2007. Lecture Notes in Computer Science, vol 4478. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72849-8_70
Download citation
DOI: https://doi.org/10.1007/978-3-540-72849-8_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72848-1
Online ISBN: 978-3-540-72849-8
eBook Packages: Computer ScienceComputer Science (R0)