Skip to main content

Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding

  • Conference paper
Book cover Pattern Recognition and Image Analysis (IbPRIA 2007)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4478))

Included in the following conference series:

  • 2326 Accesses

Abstract

Automatic Speech/Music Discrimination (SMD) has become a research topic of interest in the last years. This paper present a new approach for such goal, which is mainly based on a distributed expert system that incorporates fuzzy rules into its knowledge base. The proposed SMD scheme consists of two stages: 1) features extraction, 2) classification of parameters. Classification is performed by cascading a GMM-based classifier with an Evolutionary Fuzzy Expert (EFE) system. The EFE system improves the accuracy rate provided by the GMM-based classifier taking into account information of current and past audio frames. Testing the kindness of new fuzzy rules for the expert system has a high computacional cost. For that reason, a distributed learning approach based on web services has been implemented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Saunders, J.: Real-time discrimination of broacast speech/music. In: Proc. IEEE ICASSP’96, Atlanta, USA, pp. 993–996 (1996)

    Google Scholar 

  2. O’Shaughnessy, D.: Speech Communcations: Human and Machine, 2nd edn. IEEE Press, Piscataway (2000)

    MATH  Google Scholar 

  3. Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs (1993)

    Google Scholar 

  4. Klapuri, A.: Automatic music transcription as we know it today. J. New. Music Research 33, 269–282 (2004)

    Article  Google Scholar 

  5. Greenberg, J.E., Desloge, J.D., Zurek, P.M.: Evaluation of array-processing algorithms for a head-band hearing aid. J. Acoustic Soc. Am. 113, 1646–1657 (2003)

    Article  Google Scholar 

  6. Tancerel, L., Ragot, S., Ruoppila, V.T., Lefebvre, R.: Combined speech and audio coding by discrimination. In: Proc. IEEE Workshop on Speech Coding, pp. 17–20 (2000)

    Google Scholar 

  7. Cordon, O., Herrera, F., Hoffmann, F., Magdalena, L.: Genetic fuzzy systems: Evolutionary tuning and learning of fuzzy knowledge bases. Advances in fuzzy systems - Applications and theory, vol. 19 (2001)

    Google Scholar 

  8. Dote, Y., Ovaska, S.: Industrial Applications of Soft Computing: A review. Proceedings of the IEEE (Special Issue on Industrial Innovations using soft Computing) 89(9) (2001)

    Google Scholar 

  9. Lee, C.C.: Fuzzy Logic in Control Systems: Fuzzy Logic Controller, Part I-II. IEEE Transactions on Systems, Man. and Cybernetics 20(2), 404–435 (1990)

    Article  MATH  Google Scholar 

  10. Magdalena, L., Velasco, J.R.: Fuzzy Rule-Based Controllers that Learn by Evolving their Knowledge Base. In: Herrera, F., Verdegay, J.L. (eds.) Genetics Algorithms and Soft Computing, Physica-Verlag, Heidelberg (1996)

    Google Scholar 

  11. Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, London (1989)

    MATH  Google Scholar 

  12. Alonso, G., Cassati, F., Kuno, H., Machiraju, V.: Web services - Concepts, Architectures and Applications. Springer, Heidelberg (2004)

    MATH  Google Scholar 

  13. Carey, M.J., Parris, E.S., Lloyd-Thomas, H.: A comparison of features for speech, music discrimination. In: Proc. IEEE ICASSP’99, Phoenix, USA, pp. 1432–1435 (1999)

    Google Scholar 

  14. Vinton, M., Robinson, C.: Automated speech/other discrimination for loudness monitoring. In: 118th AES Convention, Barcelona, Spain, vol. 6437, preprint (2005)

    Google Scholar 

  15. Logan, B.: Mel frequency cepstral coefficients for music modelling. In: Proc. Int. Symp. Music Information Retrieval (2000)

    Google Scholar 

  16. Duda, R., Hart, P., Stork, D.: Pattern classification. John Wiley, New York (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Joan Martí José Miguel Benedí Ana Maria Mendonça Joan Serrat

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Muñoz Expósito, J.E., Ruiz Reyes, N., Garcia Galán, S., Vera Candeas, P. (2007). Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds) Pattern Recognition and Image Analysis. IbPRIA 2007. Lecture Notes in Computer Science, vol 4478. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72849-8_70

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72849-8_70

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72848-1

  • Online ISBN: 978-3-540-72849-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics