Toward a Sound Analysis System for Telemedicine

Nguyen, Cong Phuong; Pham, Thi Ngoc Yen; Eric, Castelli

doi:10.1007/11540007_44

Cong Phuong Nguyen²⁰,
Thi Ngoc Yen Pham²⁰ &
Castelli Eric²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3614))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

901 Accesses
4 Citations

Abstract

Our work is within the framework of studying a sound analysis system in a telemedicine project. The task of this system is to detect situations of distress in a patient’s room basing on sound analysis. In this paper we present our studies on the constructions of a speech/non-speech discriminator and of a speech/scream-groan discriminator. The first discriminator’s task is to distinguish speech signal from non speech signal in a room such as sounds of broken glass, door shutting, chair falling, water in toilette, etc. The second one’s task is to detect sounds of scream-groan from speech signal. Results show that these discriminators are applicable to our sound analysis system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Istrate, D., Vacher, M., Castelli, E., Besacier, L., Sérignat, J.F.: Distress Situation Identification though Sound Processing. In: An Application to Medical Telemonitoring. European Conference on Computational Biology, Paris (2003)
Google Scholar
Équipe GEOD Dan Istrate: C.-I. Base de données. Sons de la vie courante (2001)
Google Scholar
Gauvain, J.L., Lamel, L.F., Eskenazi, M.: Design Considerations and Text Selection for BREF, a large French read-speech corpus. In: International Conference on Spoken Language Processing 1990, Kobe Japan (1990)
Google Scholar
Saunders, J.: Real-Time Discrimination of Broadcast Speech/Music. In: International Conference on Acoustics, Speech and Signal Processing (1996)
Google Scholar
Scheirer, E., Slaney, M.: Construction and Evaluation of a Robust Multifeature Music/Speech Discriminator. In: International Conference on Acoustics, Speech and Signal Processing (1997)
Google Scholar
Carey, M.J., Parris, E.S., Lloyd-Thomas, H.: A Comparison of Features for Speech, Music Discrimination. In: International Conference on Acoustics, Speech and Signal Processing, Phoenix AZ (1999)
Google Scholar
El-Maleh, K., Samouelian, A., Kabal, P.: Frame-Level Noise Classification in Mobile Environments. In: International Conference on Acoustics, Speech and Signal Processing, Phoenix AZ (1999)
Google Scholar
Wegmann, S., Zhan, P., Gillick, L.: Progress in Broadcast News Transcription at Dragon Systems. In: International Conference on Acoustics, Speech and Signal Processing, Phoenix AZ, vol. I, pp. 33–36 (1999)
Google Scholar
Moreno, P.J., Rifkin, R.: Using the Fisher Kernel Method for Web Audio Classification. In: International Conference on Acoustics, Speech and Signal Processing, vol. 4, pp. 2417–2420 (2000)
Google Scholar
Lu, L., Zhang, H.-J., Jiang, H.: Content Analysis for Audio Classification and Segmentation. IEEE Transaction on speech and audio processing 10(7) (2002)
Google Scholar
Ajmera, J., McCowan, I., Bourlard, H.: Speech/Music Segmentation Using Entropy and Dynamism Features in a HMM Classification Framework. Speech Communication 40, 351–363 (2003)
Article Google Scholar
McKinney, M.F., Breebaart, J.: Features for Audio and Music Classification. In: 4^th International Conference on Music Information, Maryland USA (2003)
Google Scholar
Liu, M., Wang, C., Wang, L.P.: Content-Based Audio Classification and Retrieval Using a Fuzzy Logic System: Towards Multimedia Search Engines. Soft Computing 6, 357–364 (2002)
MATH Google Scholar
Schmandt, C., Vallejo, G.: “Listenin” to Domestic Environments from Remote Locations. In: International Conference on Auditory Display, Boston USA (2003)
Google Scholar
http://www.homeguardion.com/

Download references

Author information

Authors and Affiliations

International Research Center MICA, 1 Daicoviet, Hanoi, Vietnam
Cong Phuong Nguyen, Thi Ngoc Yen Pham & Castelli Eric

Authors

Cong Phuong Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Thi Ngoc Yen Pham
View author publications
You can also search for this author in PubMed Google Scholar
Castelli Eric
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Honda Research Institute Europe GmbH, Offenbach/Main, Germany
Yaochu Jin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nguyen, C.P., Pham, T.N.Y., Eric, C. (2005). Toward a Sound Analysis System for Telemedicine. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_44

Download citation

DOI: https://doi.org/10.1007/11540007_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28331-7
Online ISBN: 978-3-540-31828-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics