Abstract
This paper explores technology available for prototyping and product development of child speech recognition applications, and factors relevant for eventual delivery of good user experience. Since child voices are known to be more challenging than adults, and less commercial investment has been dedicated to child voice modeling, it is significant that recognition accuracy for very constrained distinct vocabulary grammars is encouragingly high using out-of-the box technology to recognize suburban varieties of English. While accuracy on more demanding recognition tasks seems insufficient for immediate product development without focused support by a research laboratory, it is high enough to suggest that the class of feasible child applications may soon widen as new resources for child recognition become available. Preliminary results reported here contribute to a roadmap in pursuit of broad demographic coverage for a widening set of child applications.
Similar content being viewed by others
References
Adams, M. (1995). Beginning to Read,Cambridge, MA: MIT Press.
Fairweather, P., Nix, D., Oblinger,D., Adams, B., and Laffra, C. Overcoming Technical Barriers to a Speech-Enabled Children's Reading Tutor. IBM Research, undated.
Mostow, J.and Aist, G. (2001). Evaluating tutors that listen: An overview of project LISTEN. In K. Forbus and P. Feltovich (Eds.), Smart Machines in Education. AAAI Press/MIT Press.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Smith, L., Pham, T., Bower, J. et al. Toward Voice Applications for Children. International Journal of Speech Technology 5, 321–329 (2002). https://doi.org/10.1023/A:1020909008347
Issue Date:
DOI: https://doi.org/10.1023/A:1020909008347