skip to main content
research-article

Development of a Remote Therapy Tool for Childhood Apraxia of Speech

Published:14 November 2015Publication History
Skip Abstract Section

Abstract

We present a multitier system for the remote administration of speech therapy to children with apraxia of speech. The system uses a client-server architecture model and facilitates task-oriented remote therapeutic training in both in-home and clinical settings. The system allows a speech language pathologist (SLP) to remotely assign speech production exercises to each child through a web interface and the child to practice these exercises in the form of a game on a mobile device. The mobile app records the child's utterances and streams them to a back-end server for automated scoring by a speech-analysis engine. The SLP can then review the individual recordings and the automated scores through a web interface, provide feedback to the child, and adapt the training program as needed. We have validated the system through a pilot study with children diagnosed with apraxia of speech, their parents, and SLPs. Here, we describe the overall client-server architecture, middleware tools used to build the system, speech-analysis tools for automatic scoring of utterances, and present results from a clinical study. Our results support the feasibility of the system as a complement to traditional face-to-face therapy through the use of mobile tools and automated speech analysis algorithms.

References

  1. L. Anthony, Q. Brown, J. Nias, B. Tate, and S. Mohan. 2012. Interaction and recognition challenges in interpreting children's touch and gesture input on mobile devices. In ACM Conference on Interactive Tabletops and Surfaces. 225--234. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. ArtikPix. Retrieved October 1, 2015 from http://rinnapps.com/artikpix/.Google ScholarGoogle Scholar
  3. ASHA Ad Hoc Committee on Apraxia of Speech in Children. American Speech-Language-Hearing Association. 2007. Childhood apraxia of speech {Technical Report}. Available from www.asha.org/policy.Google ScholarGoogle Scholar
  4. K. J. Ballard, D. A. Robin, P. McCabe, and J. McDonald. 2010. A treatment for dysprosody in childhood apraxia of speech. Journal of Speech, Language, and Hearing Research 53, 1227--1245.Google ScholarGoogle ScholarCross RefCross Ref
  5. K. J. Ballard, H. D. Smith, D. Paramatmuni, P. McCabe, D. G. Theodoros, and B. E. Murdoch. 2012. Amount of kinematic feedback affects learning of speech motor skills. Motor Control 16, 106--119.Google ScholarGoogle ScholarCross RefCross Ref
  6. P. Boersma. 2002. Praat, a system for doing phonetics by computer. Glot International 5, 341--345.Google ScholarGoogle Scholar
  7. H. T. Bunnell, D. M. Yarrington, and J. B. Polikoff. 2000. STAR: Articulation training for young children. In International Conference on Spoken Language Processing. 85--88.Google ScholarGoogle Scholar
  8. N. Chen and Q. He. 2007. Using nonlinear features in automatic English lexical stress detection. In International Conference on Computational Intelligence and Security Workshops, 2007 (CISW’07). 328--332. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. A. Constantinescu, D. G. Theodoros, T. G. Russell, E. C. Ward, S. J. Wilson, and R. Wootton. 2010. Home-based speech treatment for Parkinson's disease delivered remotely: A case report. Journal of Telemedicine and Telecare 16, 100--4.Google ScholarGoogle ScholarCross RefCross Ref
  10. A. L. Delaney and R. D. Kent. 2004. Developmental profiles of children diagnosed with apraxia of speech. Presented at the Annual Convention of the American-Speech-Language-Hearing Association.Google ScholarGoogle Scholar
  11. M. de Sá, L. Carriço, J. Faria, and I. Sá. 2012. Children psychotherapy with mobile devices. In Human-Computer Interaction: The Agency Perspective, Studies in Computational Intelligence, M. Zacarias and J. V. de Oliveira, eds. Springer, 85--109.Google ScholarGoogle Scholar
  12. J. Fletcher. 2010. The prosody of speech: Timing and rhythm. In The Handbook of Phonetic Sciences (2nd ed.),W. J. Hardcastle, J. Laver, F. E. Gibbon, eds. Wiley, Hoboken, NJ, 521--602.Google ScholarGoogle Scholar
  13. K. Forrest. 2003. Diagnostic criteria of developmental apraxia of speech used by clinical speech-language pathologists. American Journal of Speech-Language Pathology 12, 376--380.Google ScholarGoogle ScholarCross RefCross Ref
  14. J. Froehlich, J. Wobbrock, and S. Kane. 2007. Barrier pointing: Using physical edges to assist target acquisition on mobile device touch screens. In ACM SIGACCESS Conference on Computers and Accessibility. 19--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Gaines, C. Missiuna, M. Egan, and J. McLean. 2008. Educational outreach and collaborative care enhances physician's perceived knowledge about Developmental Coordination Disorder. BMC Health Services Research 8, 1--9.Google ScholarGoogle ScholarCross RefCross Ref
  16. A. Georgeadis, D. M. Brennan, L. N. Barker, and C. R. Baron. 2003. Telerehabilitation and its effect on story retelling by adults with neurogenic communication disorders. In Clinical Aphasiology Conference. 639--652.Google ScholarGoogle Scholar
  17. A. M. Harrison, W.-K. Lo, X. Qian, and H. Meng. 2009. Implementation of an extended recognition network for mispronunciation detection and diagnosis in computer-assisted pronunciation training. In SLaTE. 45--48.Google ScholarGoogle Scholar
  18. D. G. Jamieson, G. Kranjc, K. Yu, and W. E. Hodgetts. 2004. Speech intelligibility of young school-aged children in the presence of real-life classroom noise. Journal of the American Academy of Audiology 15, 7, 508--517.Google ScholarGoogle ScholarCross RefCross Ref
  19. Y.-J. Kim and M. C. Beutnagel. 2011. Automatic assessment of American English lexical stress using machine learning algorithms. In SLaTE. 93--96.Google ScholarGoogle Scholar
  20. H. Kolles and W. Feiden. 1995. Computer-assisted speech recognition in diagnostic pathology. Development of the DragonDictate. Pathologe 16, 6, 439--442.Google ScholarGoogle Scholar
  21. T. Kristjansson, S. Deligne, and P. Olsen. 2005. Voicing features for robust speech detection. Entropy 2, 3.Google ScholarGoogle Scholar
  22. S. Kwon and S. S. Narayanan. 2002. Speaker change detection using a new weighted distance measure. In International Conference on Spoken Language Processing (ICSLP’02). 2537--2540.Google ScholarGoogle Scholar
  23. K. Li, X. Qian, S. Kang, and H. Meng. 2013. Lexical stress detection for L2 English speech using deep belief networks. In INTERSPEECH. 1811--1815.Google ScholarGoogle Scholar
  24. E. Maas, D. A. Robin, S. N. A. Hula, S. E. Freedman, G. Wulf, K. J. Ballard, et al. 2008. Principles of motor learning in treatment of motor speech disorders. American Journal of Speech-Language Pathology 17, 277--298.Google ScholarGoogle ScholarCross RefCross Ref
  25. A. Maier, T. Haderlein, F. Stelzle, E. Nöth, E. Nkenke, and F. Rosanowski, et al. 2010. Automatic speech recognition systems for the evaluation of voice and speech disorders in head and neck cancer. EURASIP Journal on Audio, Speech, and Music Processing 1, Article ID: 926951. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. McCabe, A. G. Macdonald-D’Silva, L. J. van Rees, K. J. Ballard, and J. Arciuli. 2014. Orthographically sensitive treatment for dysprosody in children with Childhood Apraxia of Speech using ReST intervention. Developmental Neurorehabilitation 17, 137--145.Google ScholarGoogle ScholarCross RefCross Ref
  27. O. Mich. 2009. Evaluation of software tools with deaf children. In International ACM SIGACCESS Conference on Computers and Accessibility. 235--236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Moore and M. Churchward. 2010. Moodle 1.9 Extension Development. Packt Publishing, Birmingham, UK. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. R. J. Moran, R. B. Reilly, P. de Chazal, and P. D. Lacy. 2006. Telephony-based voice pathology assessment using automated speech analysis. IEEE Transactions on Biomedical Engineering 53, 468--477.Google ScholarGoogle ScholarCross RefCross Ref
  30. E. Murray, P. McCabe, and K. J. Ballard. 2012. A comparison of two treatments for childhood apraxia of speech: Methods and treatment protocol for a parallel group randomised control trial. BMC Pediatrics 12, 112.Google ScholarGoogle ScholarCross RefCross Ref
  31. D. Newbury and A. Monaco. 2010. Genetic advances in the study of speech and language disorders. Neuron 68, 309--320.Google ScholarGoogle ScholarCross RefCross Ref
  32. K. Newell, M. Carlton, and A. Antoniou. 1990. The interaction of criterion and feedback information in learning a drawing task. Journal of Motor Behavior 22, 536--552.Google ScholarGoogle ScholarCross RefCross Ref
  33. A. M. Oster, D. House, A. Protopapas, and A. Hatzis. 2002. Presentation of a new EU project for speech therapy: Ortho-Logo-Paedia. Presented at the Proceedings of TMH-QPSR, Fonetik.Google ScholarGoogle Scholar
  34. A. Parnandi, V. Karappa, Y. Son, M. Shahin, J. McKechnie, K. Ballard, et al. 2013. Architecture of an automated therapy tool for childhood apraxia of speech. In 15th International ACM SIGACCESS Conference on Computers and Accessibility. 5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. M. A. Rahurkar, J. H. Hansen, J. Meyerhoff, G. Saviolakis, and M. Koenig. 2002. Frequency band analysis for stress detection using a teager energy operator based feature. In INTERSPEECH.Google ScholarGoogle Scholar
  36. J. Rick, A. Harris, P. Marshall, R. Fleck, N. Yuill, and Y. Rogers. 2009. Children designing together on a multi-touch tabletop: an analysis of spatial orientation and user interactions. In Conference on Interaction Design and Children. 106--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. S. Rvachew and F. Brosseau-Lapre. 2006. Speech perception intervention. In Interventions for Speech Sound Disorders in Children, S. McLeod, (ed.). Brookes Publishing, Baltimore, MD.Google ScholarGoogle Scholar
  38. R. A. Schmidt and T. Lee. 2005. Motor Control and Learning, 4th ed. Human Kinetics, Champaign, IL.Google ScholarGoogle Scholar
  39. M. A. Shahin, B. Ahmed, and K. J. Ballard. 2012. Automatic classification of unequal lexical stress patterns using machine learning algorithms. In 2012 IEEE Spoken Language Technology Workshop (SLT). 388--391.Google ScholarGoogle Scholar
  40. K. Shobaki, J. P. Hosom, and R. A. Cole. 2000. The OGI kids’ speech corpus and recognizers. In International Conference on Spoken Language Processing.Google ScholarGoogle Scholar
  41. L. D. Shriberg, T. F. Campbell, H. B. Karlsson, R. L. Brown, J. L. Mcsweeny, and C. J. Nadler. 2003. A diagnostic marker for childhood apraxia of speech: The lexical stress ratio. Clinical Linguistics & Phonetics 17, 549--574.Google ScholarGoogle ScholarCross RefCross Ref
  42. J. Tepperman and S. Narayanan. 2005. Automatic Syllable Stress Detection Using Prosodic Features for Pronunciation Evaluation of Language Learners. In ICASSP (1), 937--940.Google ScholarGoogle Scholar
  43. T. K. Veale. 1999. Targeting temporal processing deficits through fast ForWord® language therapy with a new twist. Language, Speech, and Hearing Services in Schools 30, 353--362.Google ScholarGoogle ScholarCross RefCross Ref
  44. D. Vilozni, M. Barker, H. Jellouschek, G. Heimann, and H. Blau. 2001. An interactive computer-animated system (SpiroGame) facilitates spirometry in preschool children. American Journal of Respiratory and Critical Care Medicine 164, 2200--2205.Google ScholarGoogle ScholarCross RefCross Ref
  45. M. Waite, L. Cahill, D. Theodoros, S. Busuttin, and T. Russell. 2006. A pilot study of online assessment of childhood speech disorders. Journal of Telemedicine and Telecare 92--94.Google ScholarGoogle ScholarCross RefCross Ref
  46. A. Williams. 2006. Multiple oppositions intervention. In Interventions for Speech Sound Disorders in Children, A. L. Williams, S. McLeod, R. J. McCauley, et al. (eds.). Brookes Publishing, Baltimore, MD.Google ScholarGoogle Scholar
  47. P. Williams and H. Stephens. 2010. Nuffield Centre Dyspraxia Programme. In Interventions for Speech Sound Disorders in Children, A. L. Williams, S. McLeod, R. J. McCauley, et al. (eds.). Brookes Publishing, Baltimore, MD.Google ScholarGoogle Scholar
  48. Y. Wren, S. Roulstone, and A. L. Williams. 2006. Computer-Based Interventions. In Interventions for Speech Sound Disorders in Children, S. McLeod (ed.), Brookes Publishing, Baltimore, MD.Google ScholarGoogle Scholar
  49. S.-C. Yin, R. Rose, O. Saz, and E. Lleida. 2009. A study of pronunciation verification in a speech therapy application. In IEEE International Conference on Acoustics, Speech and Signal Processing, 4609--4612. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. S. J. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw, and G. Moore, et al. 2006. The HTK Book, version 3.4. Cambridge University, Cambridge, UK.Google ScholarGoogle Scholar
  51. E. Zwicker. 1961. Subdivision of the audible frequency range into critical bands (Frequenzgruppen). The Journal of the Acoustical Society of America 33, 248--248.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Development of a Remote Therapy Tool for Childhood Apraxia of Speech

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Accessible Computing
              ACM Transactions on Accessible Computing  Volume 7, Issue 3
              Special Issue (Part 2) of Papers from ASSETS 2013
              November 2015
              79 pages
              ISSN:1936-7228
              EISSN:1936-7236
              DOI:10.1145/2836329
              Issue’s Table of Contents

              Copyright © 2015 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 14 November 2015
              • Accepted: 1 May 2015
              • Revised: 1 March 2015
              • Received: 1 June 2014
              Published in taccess Volume 7, Issue 3

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader