Genetic Programming for Automatic Stress Detection in Spoken English

Xie, Huayang; Zhang, Mengjie; Andreae, Peter

doi:10.1007/11732242_41

Huayang Xie²⁹,
Mengjie Zhang²⁹ &
Peter Andreae²⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3907))

Included in the following conference series:

Workshops on Applications of Evolutionary Computation

1616 Accesses

Abstract

This paper describes an approach to the use of genetic programming (GP) for the automatic detection of rhythmic stress in spoken New Zealand English. A linear-structured GP system uses speaker independent prosodic features and vowel quality features as terminals to classify each vowel segment as stressed or unstressed. Error rate is used as the fitness function. In addition to the standard four arithmetic operators, this approach also uses several other arithmetic, trigonometric, and conditional functions in the function set. The approach is evaluated on 60 female adult utterances with 703 vowels and a maximum accuracy of 92.61% is achieved. The approach is compared with decision trees (DT) and support vector machines (SVM). The results suggest that, on our data set, GP outperforms DT and SVM for stress detection, and GP has stronger automatic feature selection capability than DT and SVM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Speech Features for Discriminating Stress Using Branch and Bound Wrapper Search

Stress Detection from Speech Using Spectral Slope Measurements

Approach for Spectral Analysis in Detection of Selected Pronunciation Pathologies

References

Ladefoged, P.: Three Areas of experimental phonetics. Oxford University Press, London (1967)
Google Scholar
Ladefoged, P.: A Course in Phonetics, 3rd edn. Harcourt Brace Jovanovich, New York (1993)
Google Scholar
Waibel, A.: Recognition of lexical stress in a continuous speech system - a pattern recognition approach. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Tokyo, Japan, pp. 2287–2290 (1986)
Google Scholar
Jenkin, K.L., Scordilis, M.S.: Development and comparison of three syllable stress classifiers. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, USA, pp. 733–736 (1996)
Google Scholar
van Kuijk, D., Boves, L.: Acoustic characteristics of lexical stress in continuous speech. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, vol. 3, pp. 1655–1658 (1999)
Google Scholar
Xie, H., Andreae, P., Zhang, M., Warren, P.: Detecting stress in spoken English using decision trees and support vector machines. Australian Computer Science Communications (Data Mining, CRPIT 32) 26, 145–150 (2004)
Google Scholar
Conrads, M., Nordin, P., Banzhaf, W.: Speech sound discrimination with genetic programming. In: Proceedings of the First European Workshop on Genetic Programming, pp. 113–129 (1998)
Google Scholar
Francone, F.D.: Discipulus owner’s manual (2004)
Google Scholar
Xie, H., Andreae, P., Zhang, M., Warren, P.: Learning models for English speech recognition. Australian Computer Science Communications (Computer Science, CRPIT 26) 26, 323–330 (2004)
Google Scholar
Quinlan, J.: C4.5: Programs for machine learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2003), http://www.csie.ntu.edu.tw/_cjlin/papers/libsvm.pdf
Koza, J.R.: Genetic Programming — On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge (1992)
MATH Google Scholar
Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. Journal of Machine Learning Research 5, 845–889 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematics, Statistics and Computer Science, Victoria University of Wellington, P.O. Box 600, Wellington, New Zealand
Huayang Xie, Mengjie Zhang & Peter Andreae

Authors

Huayang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Mengjie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Peter Andreae
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Johannes Gutenberg University, Mainz, Germany
Franz Rothlauf
Institute AIFB, University of Karlsruhe, 76128, Karlsruhe, Germany
Jürgen Branke
Dipartimento di Ingegneria dell’Informazione, Università di Parma,
Stefano Cagnoni
Centre of Informatics and Systems of the University of Coimbra,
Ernesto Costa
Dept. LCC, Universidad de Málaga, Spain
Carlos Cotta
Institute of Computer Science, University of Bremen, 28359, Bremen, Germany
Rolf Drechsler
INRIA Saclay - Ile-de-France, Parc Orsay Université, 4, rue Jacques Monod, 91893, ORSAY Cedex, France
Evelyne Lutton
CISUC, Department of Informatics Engineering, University of Coimbra, Polo II of the University of Coimbra, 3030, Coimbra, Portugal
Penousal Machado
Dartmouth College, Lebanon, NH, USA
Jason H. Moore
Universidade de A Coruña, CP 15071, A Coruña, Spain
Juan Romero
School of Computing Sciences, UEA Norwich, University of East Anglia, NR4 7TJ, Norwich, UK
George D. Smith
Dipartimento di Automatica e Informatica, Politecnico di Torino, Italy
Giovanni Squillero
Kyushu University, Japan
Hideyuki Takagi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, H., Zhang, M., Andreae, P. (2006). Genetic Programming for Automatic Stress Detection in Spoken English. In: Rothlauf, F., et al. Applications of Evolutionary Computing. EvoWorkshops 2006. Lecture Notes in Computer Science, vol 3907. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11732242_41

Download citation

DOI: https://doi.org/10.1007/11732242_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33237-4
Online ISBN: 978-3-540-33238-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics