CLEAR Evaluation of Acoustic Event Detection and Classification Systems

Temko, Andrey; Malkin, Robert; Zieger, Christian; Macho, Dušan; Nadeu, Climent; Omologo, Maurizio

doi:10.1007/978-3-540-69568-4_29

Andrey Temko¹,
Robert Malkin²,
Christian Zieger³,
Dušan Macho¹,
Climent Nadeu¹ &
…
Maurizio Omologo³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4122))

Included in the following conference series:

International Evaluation Workshop on Classification of Events, Activities and Relationships

1608 Accesses
58 Citations

Abstract

In this paper, we present the results of the Acoustic Event Detection (AED) and Classification (AEC) evaluations carried out in February 2006 by the three participant partners from the CHIL project. The primary evaluation task was AED of the testing portions of the isolated sound databases and seminar recordings produced in CHIL. Additionally, a secondary AEC evaluation task was designed using only the isolated sound databases. The set of meeting-room acoustic event classes and the metrics were agreed by the three partners and ELDA was in charge of the scoring task. In this paper, the various systems for the tasks of AED and AEC and their results are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wang, D., Brown, G.: Computational Auditory Scene Analysis: Principles, Algorithms and Applications. IEEE Press, Los Alamitos (2006)
Google Scholar
CHIL - COMPUTERS IN THE HUMAN INTERACTION LOOP. http://chil.server.de/
Rabiner, L., Juang, B.: Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs (1993)
Google Scholar
Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2002)
Google Scholar
Temko, A., Macho, D., Nadeu, C., Segura, C.: UPC-TALP Database of Isolated Acoustic Events. Internal UPC report (2005)
Google Scholar
Zieger, C., Omologo, M.: Acoustic Event Detection - ITC-irst AED database. Internal ITC report (2005)
Google Scholar
Casas, J., Stiefelhagen, R., et al.: Multi-camera/multi-microphone system design for continuous room monitoring. CHIL-WP4-D4.1-V2.1-2004-07-08-CO, CHIL Consortium Deliverable D4.1, July (2004)
Google Scholar
Platt, J., et al.: Large Margin DAGs for Multiclass Classification. In: Proc. Advances in Neural Information Processing Systems, vol. 12 , pp. 547–553 (2000)
Google Scholar
Temko, A., Nadeu, C.: Classification of meeting-room acoustic events with Support Vector Machines and Confusion-based Clustering. In: Proc. ICASSP’05, pp. 505–508 (2005)
Google Scholar
Nadeu, C., et al.: On the decorrelation of filter-bank energies in speech recognition. In: Proc. Eurospeech’95, pp. 1381–1384 (1995)
Google Scholar
Reyes-Gomez, M., Ellis, D.: Selection, Parameter Estimation, and Discriminative Training of Hidden Markov Models for General Audio Modeling. In: Proc. ICME’03 (2003)
Google Scholar
Leggetter, C., Woodland, P.: Speaker Adaptation of Continuous Density HMMs using Multivariate Regression. In: Proc. ICSLP’94 (1994)
Google Scholar
Gales, M.: Maximum Likelihood Linear Transformations for HMM-based Speech Recognition. In: Computer Speech and Language (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

TALP Research Center, UPC Campus Nord, Ed. D5, Jordi Girona 1-3, 08034 Barcelona, Spain
Andrey Temko, Dušan Macho & Climent Nadeu
interACT, Carnegie Mellon University, 407 S. Craig St, Pittsburgh PA 15213, USA
Robert Malkin
ITC-irst, via Sommarive 18, 38050, Povo (TN), Italy
Christian Zieger & Maurizio Omologo

Authors

Andrey Temko
View author publications
You can also search for this author in PubMed Google Scholar
Robert Malkin
View author publications
You can also search for this author in PubMed Google Scholar
Christian Zieger
View author publications
You can also search for this author in PubMed Google Scholar
Dušan Macho
View author publications
You can also search for this author in PubMed Google Scholar
Climent Nadeu
View author publications
You can also search for this author in PubMed Google Scholar
Maurizio Omologo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen John Garofolo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Temko, A., Malkin, R., Zieger, C., Macho, D., Nadeu, C., Omologo, M. (2007). CLEAR Evaluation of Acoustic Event Detection and Classification Systems. In: Stiefelhagen, R., Garofolo, J. (eds) Multimodal Technologies for Perception of Humans. CLEAR 2006. Lecture Notes in Computer Science, vol 4122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69568-4_29

Download citation

DOI: https://doi.org/10.1007/978-3-540-69568-4_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69567-7
Online ISBN: 978-3-540-69568-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics