Abstract
Detecting filled pause in spontaneous speech recognition is very important since most of the speech is spontaneous and the most frequent phenomenon in Indonesian spontaneous speech is filled pause. This paper discusses the detection of filled pauses in spontaneous speech of Indonesian by utilizing acoustic features of the speech signal. The detection was conducted by employing statistical method using Naïve Bayes, Classification Tree, and Multilayer Perceptron algorithm. To build the model, speech data were collected from an entertainment program. Word parts in the data were labeled and its features were extracted. These include the formant and pitch stability, energy-drop, and duration. Half an hour of sentences contains 295 filled pause and 2082 non-filled pause words were employed as training data. Using 25 sentences as testing data, Naïve Bayes gave best detection correctness, 74.35 % on a closed data set and 71.43 % on an open data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Audhkhasi, K., Kandhway, K., Deshmukh, O., Verma, A.: Formant-based technique for automatic filled-pause detection in spontaneous spoken English. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, pp. 4857–4860 (2009)
Barras, C., Geoffrois, E., Wu, Z., Liberman, M.: Transcriber: a free tool for segmenting, labeling and transcribing speech. In: First international conference on language resources and evaluation (LREC), pp. 1373–1376 (1998)
Batliner, A., Kießling, A., Burger, S., Nöth, E.: Filled pauses in spontaneous speech (2011)
Boersma, P., Weenink, D.: PRAAT: A system for doing phonetics by computer, in Report of the Institute of Phonetic Sciences of the University of Amsterdam 132 (1996)
Fitzgerald, E., Hall, K., Jelinek, F.: Reconstructing false start errors in spontaneous speech text. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, pp. 255–263 (2009)
Garner, S.R.: Weka: The waikato environment for knowledge analysis. In: Proceedings of the New Zealand computer science research students conference, pp. 57–64 (1995)
Goto, M., Itou, K., Hayamizu, S.: A real-time filled pause detection system for spontaneous speech recognition. In: Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech 1999), pp. 227–230 (1999)
Kaushik, M., Trinkle, M., Hashemi-Sakhtsari, A.: Automatic detection and removal of disfluencies from spontaneous speech. In: Australasian International Conference on Speech Science and Technology, Melbourne Victoria (2010)
Liu, Y., Shriberg, E., Stolcke, A.: Automatic disfluency identification in conversational speech using multiple knowledge sources. In: Proceedings of Eurospeech, vol. 1, pp. 957–960 (2003)
O’Shaughnessy, D.: Recognition of hesitations in spontaneous speech. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992, vol. 1, pp. 521–524 (1992)
Shriberg, E., Bates, R., Stolcke, A.: A prosody-only decision-tree model for disfluency detection. In: Proceedings of Eurospeech, vol. 5, pp. 2383–2386 (1997)
Shriberg, E.: Spontaneous speech: How people really talk and why engineers should care. In: Proceedings of. European Conference on Speech Communication and Technology (Eurospeech) (2005)
Stolcke, A., Shriberg, E.: Automatic linguistic segmentation of conversational speech. In: Proceedings Fourth International Conference on Spoken Language, ICSLP 1996, IEEE, vol. 2, pp. 1005–1008 (1996)
Stolcke, A., Shriberg, E., Bates, R.A., Ostendorf, M., Hakkani, D., Plauche, M., Lu, Y.: Automatic detection of sentence boundaries and disfluencies based on recognized words. In: ICSLP (1998)
Stouten, F., Martens, J.P.: A feature-based filled pause detection system for Dutch. In: IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003, pp. 309–314 (2003)
Swerts, M., Wichmann, A., Beun, R.J.: Filled pauses as Markers of Discourse Structure (1996)
Ward, W.: Understanding spontaneous speech. In: Proceedings of the workshop on Speech and Natural Language of Association for Computational Linguistics, pp. 137–141 (1989)
Žgank, A., Rotovnik, T., Sepesy Maučec, M.: Slovenian spontaneous speech recognition and acoustic modeling of filled pauses and onomatopoeas. In: WSEAS Transactions on Signal Processing (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Sani, A., Lestari, D.P., Purwarianti, A. (2016). Filled Pause Detection in Indonesian Spontaneous Speech. In: Hasida, K., Purwarianti, A. (eds) Computational Linguistics. PACLING 2015. Communications in Computer and Information Science, vol 593. Springer, Singapore. https://doi.org/10.1007/978-981-10-0515-2_4
Download citation
DOI: https://doi.org/10.1007/978-981-10-0515-2_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0514-5
Online ISBN: 978-981-10-0515-2
eBook Packages: Computer ScienceComputer Science (R0)