Learning phonetic rules in a speech recognition system

Alexin, Zoltán; Csirik, János; Gyimóthy, Tibor; Jelasity, Mark; Tóth, László

doi:10.1007/3540635149_33

Zoltán Alexin¹,
János Csirik²,
Tibor Gyimóthy³,
Mark Jelasity³ &
…
László Tóth³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1297))

Included in the following conference series:

International Conference on Inductive Logic Programming

108 Accesses
1 Citations

Abstract

Current speech recognition systems can be categorized into two broad classes; the knowledge-based approach and the stochastic one. In this paper we present a rule-based method for the recognition of Hungarian vowels. A spectrogram model was used as a front-end module and some acoustic features were extracted (e.g. locations, intensities and shapes of local maxima) from spectrograms by using a genetic algorithm method. On the basis of these features we developed a rule set for the recognition of isolated Hungarian vowels. These rules represented by Prolog clauses were refined by the IMPUT Inductive Logic Programming method.

This work was supported by the grants ESPRIT 20237 and PHARE TDQM 9305-02/1022 (“ILP2/HUN”).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alexin, Z., Gyimóthy, T., Boström, H.: Integrating Algorithmic Debugging and Unfolding Transformation in an Interactive Learner in Proceedings of the 12th European Conference on Artificial Intelligence ECAI-96 ed. Wolfgang Wahlster, Budapest, Hungary (1996) 403–407 John Wiley & Son's Ltd. (1996)
Google Scholar
Bolla, Kálmán: Magyar fonetikai atlasz, A szegmentálás hangszerkezet elemei Nemzeti Tankönyvkiadó Rt Budapest (1995) in Hungarian
Google Scholar
Cooke, M. (eds.): Visual Representations of Speech Signals, Wiley, 1993
Google Scholar
Fohr, D., Haton, J., Laprie, Y.: Knowledge-Based Techniques in Acoustic-Phonetic Decoding of Speech: Interest and Limitations, International Journal on Pattern Recognition and Artificial Intelligence Vol. 8, No. 1, (1994) 133–153
Google Scholar
Huebener, K., Carson-Berndsen, J.: Phoneme Recognition Using Acoustic Events, Verbmobil Technical Report No. 15, June 1994
Google Scholar
Jelasity, M., Dombi, J.: GAS, an Approach on Modeling Species in Genetic Algorithms. Proceedings of the EA'95 (1995)
Google Scholar
Kókai, G., Alexin, Z., Kocsis, F.: The IDT System and its Application for Learning Prolog Programs. Proc. of the Sixth International Conference on Artificial Intelligence and Information Control Systems of Robots (AIICSR-94) Smolenice Castle Slovakia September 12–16. (1994) 315–320
Google Scholar
Lamel, L. F.: A knowledge-based system for stop consonant identification based on speech spectrogram reading, Computer Speech and Language (1993) 2, 169–191
Google Scholar
Lavrač, N., Džeroski, S.: Inductive Logic Programming: Techniques and Applications Ellis Horwood, (1994)
Google Scholar
Mammachandran, R. P., Mammone, R. J. (eds.): Modern Methods of Speech Processing, Kluwer Academic, 1995
Google Scholar
Mercier, G. et al.: Recognition of speaker-dependent continuous speech with KEAL, IEE Proceedings Vol. 136, Pt.1, No.2, April (1989)
Google Scholar
Muggleton, S., De Raedt, L.: Inductive Logic Programming: Theory and methods, Journal of Logic Programming 19 (20) (1994) 629–679
Google Scholar
Rabiner, L. R., Schafer, R. W.: Digital Processing of Speech Signals Prentice Hall (1978)
Google Scholar
Walbel, A., Lee, K. (eds.): Readings in Speech Recognition, Morgan Kaufmann (1990)
Google Scholar
Zue, V. W.: The Use of Speech Knowledge in Automatic Speech Recognition, Proc. IEEE, Vol. 73, No. 11, (1985) 1602–1615
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Applied Informatics, József Attila University, Árpád tér 2, H-6720, Szeged, Hungary
Zoltán Alexin
Department of Computer Science, József Attila University, Árpád tér 2, H-6720, Szeged, Hungary
János Csirik
Research Group on Artificial Intelligence, Hungarian Academy of Sciences, Aradi vértanuk tere 1, H-6720, Szeged, Hungary
Tibor Gyimóthy, Mark Jelasity & László Tóth

Authors

Zoltán Alexin
View author publications
You can also search for this author in PubMed Google Scholar
János Csirik
View author publications
You can also search for this author in PubMed Google Scholar
Tibor Gyimóthy
View author publications
You can also search for this author in PubMed Google Scholar
Mark Jelasity
View author publications
You can also search for this author in PubMed Google Scholar
László Tóth
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Nada Lavrač Sašo Džeroski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alexin, Z., Csirik, J., Gyimóthy, T., Jelasity, M., Tóth, L. (1997). Learning phonetic rules in a speech recognition system. In: Lavrač, N., Džeroski, S. (eds) Inductive Logic Programming. ILP 1997. Lecture Notes in Computer Science, vol 1297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3540635149_33

Download citation

DOI: https://doi.org/10.1007/3540635149_33
Published: 10 July 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63514-7
Online ISBN: 978-3-540-69587-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics