Abstract
The prime objective of this paper is to conduct phoneme categorization experiments for Indian languages. In this direction a major effort has been made to categorize Hindi phonemes using a time delay neural network (TDNN), and compare the recognition scores with other languages. A total of six neural nets aimed at the major coarse of phonetic classes in Hindi were trained. Evaluation of each net on 350 training tokens and 40 test tokens revealed a 99% recognition rate for vowel classes, 87% for unvoiced stops, 82% for voiced stops, 94.7% for semi vowels, 98.1% for nasals and 96.4% for fricatives. A new feature vector normalisation technique has been proposed to improve the recognition scores.
References
Bourlard H, Morgan N (1994) Connectionist speech recognition: a hybrid approach. Kluwer, Dordrecht, pp 155–183
Lippmann RP (1987) An introduction to computing with neural nets. In: IEEE ASSP Magazine, IEEE, New York pp 8–20
Waibel AH, Sawai H, Shikano K (1989) Consonant recognition by modular construction of large phonemic time-delay neural networks. In: Proc. IEEE, ICASSP-89, Glasgow, Scotland, vol. I, IEEE, New York, pp 112–115
Yang R, Majaniemi M, Haavisto P (1995) Dynamic parameter compensation for speech recognition in noise. In: Proc. Eurospeech’95, vol. I, ESCA, France, pp 469–472
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dev, A., Agrawal, S.S. & Choudhury, D.R. Categorization of Hindi phonemes by neural networks. AI & Soc 17, 375–382 (2003). https://doi.org/10.1007/s00146-003-0263-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00146-003-0263-0