Detecting Broad Phonemic Class Boundaries from Greek Speech in Noise Environments

Mporas, Iosif; Zervas, Panagiotis; Fakotakis, Nikos

doi:10.1007/11846406_60

Detecting Broad Phonemic Class Boundaries from Greek Speech in Noise Environments

Iosif Mporas²¹,
Panagiotis Zervas²¹ &
Nikos Fakotakis²¹

Conference paper

1043 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4188))

Abstract

In this work, we present the performance evaluation of an implicit approach for the automatic segmentation of continuous speech signals into broad phonemic classes as encountered in Greek language. Our framework was evaluated with clear speech and speech with white, pink, bubble, car and machine gun additive noise. Our framework’s results were very promising since an accuracy of 76.1% was achieved for the case of clear speech (for distances less than 25 msec to the actual segmentation point), without presenting over-segmentation on the speech signal. An average reduction of 4% in the total accuracy of our segmentation framework was observed in the case of wideband distortion additive noise environment.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland P.: The HTK Book, Revised for HTK Version 3.0 (July 2000)
Google Scholar
Zissman, M.: Comparison of four Approaches to Automatic Language Identification of Telephone Speech. IEEE Trans. Speech and Audio Proc. SAP-4, 31–44 (1996)
Article Google Scholar
Dutoit, T.: An Introduction to Text-To-Speech Synthesis. In: Text, Speech and Language Technology, vol. 3. Kluwer Academic Publishers, Dordrecht (1997)
Google Scholar
van Hemert, J.: Automatic Segmentation of Speech. IEEE Transactions on Signal Processing 39(4) (April 1991)
Google Scholar
Aversano, G., Esposito, A., Esposito, A., Marinaro, M.: A new text-independent method for phoneme segmentation. In: Proc. of 44th IEEE Midwest Symp. Circuits and Systems, vol. 2, pp. 516–519 (2001)
Google Scholar
Suh, Y., Lee, Y.: Phoneme segmentation of continuous speech using multi-layer perceptron. In: Proc. of ICSLP 1996, pp. 1297–1300 (1996)
Google Scholar
Svendsen, T., Kvale, K.: Automatic alignment of phonemic labels in continuous speech. In: Proc. of ICSLP 1990, Kobe, Japan (1990)
Google Scholar
Svendsent, T., Soong, F.K.: On the automatic segmentation of speech signals. In: Proc. of ICASSP 1987, Dallas, pp. 77–80 (April 1987)
Google Scholar
Grayden, D., Scordilis, M.: Phonemic segmentation of fluent speech. In: Proc. of ICASSP 1994, pp. 73–76 (1994)
Google Scholar
Essa, O.: Using prosody in automatic segmentation of speech. In: Proc. of 36th ACM Southeast Regional Conference (1998)
Google Scholar
Pellom, B., Hansen, J.: Automatic segmentation of speech recorded in unknown noisy channel characteristics. Speech Communication 25, 97–116 (1998)
Article Google Scholar
Reddy, D.R.: Pitch Period Determination of Speech Sounds. Communication of the ACM 10, 343–348 (1967)
Google Scholar
Tsagalidis, A.: http://www.media.uoa.gr/language/
Boersma, P.: Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In: Proc. of IFA,, vol. 17, pp. 97–110 (1993)
Google Scholar
Boersma, P., Weenink, D.: Praat: doing phonetics by computer (2005), Retrieved from: http://www.praat.org/
Deller, J., Proakis, J., Hansen, J.: Discrete-time processing of speech signals. MacMillan Series. Prentice-Hall Publishers, New York (1993)
Google Scholar
Zervas, P., Fakotakis, N., Kokkinakis, G.: Development of a prosodic database for Greek speech synthesis. In: Proc. of SPECOM 2005, Patras, Greece, pp. 603–606 (2005)
Google Scholar
Varga, A., Steenneken, H., J., M., Tomlinson, M., Jones, D.: The NOISEX 1992 study on the effect of additive noise on automatic speech recognition (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Wire Communications Laboratory, Electrical and Computer Engineering Department, University of Patras, 261 10 Rion, Patras, Greece
Iosif Mporas, Panagiotis Zervas & Nikos Fakotakis

Authors

Iosif Mporas
View author publications
You can also search for this author in PubMed Google Scholar
Panagiotis Zervas
View author publications
You can also search for this author in PubMed Google Scholar
Nikos Fakotakis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Botanická 68a, CZ-602 00, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Department of Computer Graphics and Design, Masaryk University, Botanická 68a, 60200, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mporas, I., Zervas, P., Fakotakis, N. (2006). Detecting Broad Phonemic Class Boundaries from Greek Speech in Noise Environments. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_60

Download citation

DOI: https://doi.org/10.1007/11846406_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics