Abstract
Research fields such as speech recognition require a large amount of speech data with phoneme label information uttered by various speakers.Ho wever, phoneme labeling by visual inspection segmentation of input speech data into corresponding parts of given phoneme by human inspection is a time-consuming job.An automatic phoneme labeling system is required.Cur rently, several automatic phoneme labeling system based on Hidden Markov Model(HMM) were proposed. The performance of these systems depends on the used phoneme models.In this paper, at first, we propose an acquisition algorithm of accurate phoneme model with the optimum architecture, and then the obtained phoneme models is applied to segment an input speech without phoneme label information into the part corresponding to each phoneme label
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Takami, J. and Sagayama, S.: A Successive State Splitting Algorithm for Efficient Allophone Modeling. Proc. of ICASSP’92. (1992) 573–576
Suzuki, M., Makino, S., Ito, A., Aso, H. and Shimodaira, H.: A New HMnet Construction Algorithm Requiring No Contextual Factors. IEICE Trans. Inf. & Syst., E78-D,6. (1995) 662–668
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Suzuki, M., Maeda, T., Mori, H., Makino, S. (1998). Automatic Acquisition of Phoneme Models and Its Application to Phoneme Labeling of a Large Size of Speech Corpus. In: Arikawa, S., Motoda, H. (eds) Discovey Science. DS 1998. Lecture Notes in Computer Science(), vol 1532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49292-5_59
Download citation
DOI: https://doi.org/10.1007/3-540-49292-5_59
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65390-5
Online ISBN: 978-3-540-49292-4
eBook Packages: Springer Book Archive