TUT Acoustic Event Detection System 2007

Heittola, Toni; Klapuri, Anssi

doi:10.1007/978-3-540-68585-2_35

TUT Acoustic Event Detection System 2007

Toni Heittola¹ &
Anssi Klapuri¹

Conference paper

1276 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4625))

Abstract

This paper describes a system used in acoustic event detection task of the CLEAR 2007 evaluation. The objective of the task is to detect acoustic events (door slam, steps, paper wrapping etc.) using acoustic data from a multiple microphone set up in the meeting room environment. A system based on hidden Markov models and multi-channel audio data was implemented. Mel-Frequency Cepstral Coefficients are used to represent the power spectrum of the acoustic signal. Fully-connected three-state hidden Markov models are trained for 12 acoustic events and one-state models are trained for speech, silence, and unknown events.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Temko, A., Malkin, R., Zieger, C., Macho, D., Nadeu, C., Omologo, M.: CLEAR Evaluation of Acoustic Event Detection and Classification Systems. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 311–322. Springer, Heidelberg (2007)
Chapter Google Scholar
Temko, A., Nadeu, C.: Classification of acoustic events using SVM-based clustering schemes. Pattern Recognition 39(4), 682–694 (2006)
Article MATH Google Scholar
Gaunard, P., Mubikangiey, C., Couvreur, C., Fontaine, V.: Automatic Classification of Environmental Noise Events by Hidden Markov Models. Applied Acoustics 54(3), 187–206 (1998)
Article Google Scholar
Eronen, A., Tuomi, J., Klapuri, A., Fagerlund, S., Sorsa, T., Lorho, G., Huopaniemi, J.: Audio-based context recognition. IEEE Transactions on Audio, Speech, and Language Processing 14(1), 321–329 (2006)
Article Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. PTR Prentice-Hall Inc, New Jersey (1993)
Google Scholar
CLEAR: AED Evaluation Plan (2007), http://isl.ira.uka.de/clear07/download=CLEAR_2007_AED_EvaluationPlan.pdf
NIST: Spring (RT-05S) Rich Transcription Meeting Recognition Evaluation Plan (2005), http://nist.gov/speech/tests/rt/rt2005/spring/rt05smeetingeval-plan-V1.pdf

Download references

Author information

Authors and Affiliations

Tampere University of Technology, P.O. Box 553, 33101, Tampere, Finland
Toni Heittola & Anssi Klapuri

Authors

Toni Heittola
View author publications
You can also search for this author in PubMed Google Scholar
Anssi Klapuri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Heittola, T., Klapuri, A. (2008). TUT Acoustic Event Detection System 2007. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_35

Download citation

DOI: https://doi.org/10.1007/978-3-540-68585-2_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics