Automatic Acquisition of Phoneme Models and Its Application to Phoneme Labeling of a Large Size of Speech Corpus

Suzuki, Motoyuki; Maeda, Teruhiko; Mori, Hiroki; Makino, Shozo

doi:10.1007/3-540-49292-5_59

Motoyuki Suzuki^3,4,
Teruhiko Maeda⁴,
Hiroki Mori⁵ &
…
Shozo Makino^3,4

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1532))

Included in the following conference series:

International Conference on Discovery Science

569 Accesses

Abstract

Research fields such as speech recognition require a large amount of speech data with phoneme label information uttered by various speakers.Ho wever, phoneme labeling by visual inspection segmentation of input speech data into corresponding parts of given phoneme by human inspection is a time-consuming job.An automatic phoneme labeling system is required.Cur rently, several automatic phoneme labeling system based on Hidden Markov Model(HMM) were proposed. The performance of these systems depends on the used phoneme models.In this paper, at first, we propose an acquisition algorithm of accurate phoneme model with the optimum architecture, and then the obtained phoneme models is applied to segment an input speech without phoneme label information into the part corresponding to each phoneme label

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Takami, J. and Sagayama, S.: A Successive State Splitting Algorithm for Efficient Allophone Modeling. Proc. of ICASSP’92. (1992) 573–576
Google Scholar
Suzuki, M., Makino, S., Ito, A., Aso, H. and Shimodaira, H.: A New HMnet Construction Algorithm Requiring No Contextual Factors. IEICE Trans. Inf. & Syst., E78-D,6. (1995) 662–668
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Center, Tohoku Univ, 980-8578, Japan
Motoyuki Suzuki & Shozo Makino
Graduate School of Information Sciences, Tohoku Univ, Japan
Motoyuki Suzuki, Teruhiko Maeda & Shozo Makino
Graduate School of Engineering, Tohoku Univ, Japan
Hiroki Mori

Authors

Motoyuki Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Teruhiko Maeda
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Mori
View author publications
You can also search for this author in PubMed Google Scholar
Shozo Makino
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, Kyushu University, Fukuoka, 812-8581, USA
Setsuo Arikawa
Institute of Scientific and Industrial Research Devision of Intelligent Systems Science, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka, 567-0047, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suzuki, M., Maeda, T., Mori, H., Makino, S. (1998). Automatic Acquisition of Phoneme Models and Its Application to Phoneme Labeling of a Large Size of Speech Corpus. In: Arikawa, S., Motoda, H. (eds) Discovey Science. DS 1998. Lecture Notes in Computer Science(), vol 1532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49292-5_59

Download citation

DOI: https://doi.org/10.1007/3-540-49292-5_59
Published: 14 January 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65390-5
Online ISBN: 978-3-540-49292-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics