An Augmented Reality-Based Word-Learning Mobile Application for Children with Autism to Support Learning Anywhere and Anytime: Object Recognition Based on Deep Learning

Tang, Tiffany Y.; Xu, Jiasheng; Winoto, Pinata

doi:10.1007/978-3-030-23563-5_16

Tiffany Y. Tang¹⁶,
Jiasheng Xu¹⁶ &
Pinata Winoto¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11573))

Included in the following conference series:

International Conference on Human-Computer Interaction

3771 Accesses

Abstract

An abundant earlier controlled studies have underscored the importance of early diagnosis and intervention in autism. Over the past several years, thanks to technological advances, we have witnessed a large number of technology-based teaching and learning applications for children with autism. Among them, augmented reality-based ones have gained much attention recently due to its unique benefits of providing multiple learning stimulus for these children via accessing a kinesthetic moving simply using a mobile device. Despite it, few have been developed for these young children in China, which motivates our study. In particular, in this paper, we present a mobile vocabulary-learning application for Chinese autistic children especially for outdoor and home use. The core object recognition module is implemented within the deep learning platform, TensorFlow; unlike other sophisticated systems, the algorithm has to run in an offline fashion. We conducted two small-scale pilot studies to assess the system’s feasibility and usability with typically developing children, children with autism, their parents and special education teachers with very promising and satisfying results. Our studies did suggest that the downside of the application is the performance of the object-recognition module. Therefore, before we further examine the benefits of such AR-based learning tools in clinical settings, it is crucial to fine-tune the algorithm in order to improve its accuracy. Despite it, since the current literature of AR-technology on Chinese word-learning for children with special needs is still in its infancy, our studies offers early glimpse into the usefulness, usability and applicability of such AR-based mobile learning application, particularly to facilitate learning at anytime and anywhere.

You have full access to this open access chapter, Download conference paper PDF

Coupling AR with Object Detection Neural Networks for End-User Engagement

Towards the Mixed-Reality Platform for the Learning of Children with Autism Spectrum Disorder (ASD): A Case Study in Qatar

Intelligent Educational System for Autistic Children Using Augmented Reality and Machine Learning

Keywords

1 Introduction

Autism spectrum disorder (ASD) is a neurodevelopmental disorder mainly characterized by repetitive and restricted behaviors and deficits in verbal communication, social interaction and emotion recognition [3, 21]. Children with ASD are less willing to communicate with others including in classrooms where education takes place; hence, compared to those typically developing (TD) children, they have more limited channels to acquire new knowledge. In addition, although autistic children demonstrated delays in expressive and receptive language, the extent of such delays largely varies across the population and contexts [18]. On the other hand, infant brains are very malleable, so early intervention which largely capitalize on the great potential of learning that an infant brain has could lead to positive effects in limiting some developmental impairments [7, 10, 17, 21], including early language and nonverbal skills [18]. Thanks to the technological advances, numerous technology-based intervention applications have been developed (see [19] for a brief discussion on these). However, an overwhelming number of these previous works have been focused on teaching individuals with autism social communication skills [20], where few addressed the feasibility and efficacy of early language intervention; in additions, many prior works targets learning in a more formal and established learning environment (such as classrooms, clinical centers, etc.).

On the other hand, AR uniquely combines multiple methods of instruction channels including static and dynamic visual stimuli and auditory stimuli via accessing a kinesthetic movement using a mobile phone; it allows users to interact with the real-world in an enhanced way. By learning in different ways according to learners’ different needs is paramount and has been emphasized and grounded in a foundational framework called Universal Design for Learning (UDL) theory [16] which has been served as a launching point for these AR-based learning tools. Over the past few years, there are a number of AR-based teaching tools; however, few have been built to target children with autism, which motivates our work.

In particular, in this paper we present a lightweight augmented reality-based (AR) mobile word-learning application which allows users to capture a photo where up-to-four objects can be recognized and spoken out in both English and Chinese. The core of the application is an offline deep-learning technique-based object recognition module which is capable of recognizing objects from any angle. Such unique feature offers superior learning opportunities for not only children with autism but also those with other special needs, which had been confirmed from two very small-scale pilot studies.

In particular, a small-scale feasibility and usability pilot test was conducted during a public show with typically development (TD) children and adults (including parents who tried our application with their young children), with very satisfying results. Based on their comments, we simplified the system design. We demonstrated this enhanced version in a special education school in one of the biggest cities in southern China, interviewed some teachers and let some children play with it; our very positive feed-backs provide us with valuable inputs to further adjusting the design of the system.

The organization of this paper is as follows. Previous works will be presented in Section Two, while system descriptions are shown in Section Three. Our observations and discussions in the two pilot studies are shown in Section Four. We conclude this paper by revealing the early yet valuable insights from children, their parents and special education teachers which can be used for further development of our system.

1.1 Motivation

Since the current literature of AR-technology on Chinese word-learning for children with special needs is still in its infancy, which thus offers limited insights into its therapeutic efficacy, feasibility and applicability of individualized intervention for autistic individuals, particularly children. Our works, although preliminary, particularly offers early insights into our understanding towards the usability and usefulness of such AR-based mobile learning application.

2 Previous Works on AR-Based Technology for Children with Autism

There exists an abundant previous work on the adoption of AR technology for therapeutically use and education, particularly for individuals with development disorders [15]; the major advantage of such an AR-enabled environment is that it highly facilitates the cognitive mapping of what is in users’ prior knowledge with what they are observing in the real world [12]. Such authentic opportunities can thus promote knowledge transfer and offer more opportunistic learning. In this section, we focus on the use of the technology for tailored and personalized intervention for children with autism. The application of it for Chinese speaking autistic individuals is also discussed to motivate the development of our application.

The majority of AR-based applications had targeted intervention for enhancing children’s social and communication skills. For example, [9] described an Object Identification System which allows teachers superimpose digital content on top of physical objects; a five-week study revealed that AR-based application could lead to increased sustained and selective attention of children with autism, and elicits positive emotions, which thus promoted engagement during therapies. However, since the application requires specially trained therapists, it could not easily be used outside the clinic, which thus restricted its usefulness, as most of autistic children’s learning mainly occur outside a classroom. McMahon et al. [15] applied AR in teaching science vocabulary and strongly advocated the authentic opportunities enabled via AR for children with development disorder including autism. Improvement of attraction and enhanced social skills training had been observed in [5] where AR technologies had been used to visually conceptualize social stories for children with high-functioning autism. An AR-based application was also developed to train autistic children’s emotion expression and social skills [4]. Enhancing pretend play had been the focus in [2, 8, 11], and results from these studies indicate that the AR-based technology offers superior advantages over other traditional intervention techniques; among these three studies, [8] focused on such AR-based play setting in a classroom. [1] implemented an audio-augmented paper which supports audio recording with standard sheets of paper in a storytelling activity; the unique feature of the tool is that it is built with tangible physical tools that can be shared between the therapist and the child. The AR-based Google Glass was studied in home-based social affective learning in children with autism [6]. In a series of studies at home with parents on facial affect recognition tasks, a reported increased eye contact and greater social acuity has led the researchers to support its therapeutic purposes [6]. Liu et al. [13] systematically explored, for the first time, the feasibility of autism-focused AR-based smart-glasses for training social communication and behavioral coaching and concluded that the AR-based can significantly increase children’s engagement and fun, which thus might in turn improve the respective skills during intervention. However, a recent study on AR for social skills intervention failed to find significant improvement between groups [14]; further ecological studies are necessary. Despise it, due to the unique and immersive learning environment an AR-based system can offer, it would be more popular in future systems.

Of note, we failed to uncover any published English articles on AR-based application for Chinese speaking users, which is the motivation as well as the uniqueness of our works.

3 System Overview and Offline Object Recognition

3.1 The Offline Object Recognition Module Powered Within the TensorFlow Platform

The offline object recognition module was implemented within Google’s TensorFlow^{Footnote 1} machine learning framework. Since our system aims at offering and facilitating teaching and learning at any time and any place (particularly in an outdoor setting and at home) without relying on online training, we did not modify the sample codes; instead, we took advantage of around one hundred already-trained models to cold start our system. We realized that in the absence of large amount of training data, such algorithm might suffer from inaccurate recognition which was constantly observed during our in-lab testing. To alleviate this problem, we added and integrated a small module to allow the user to correct the result manually; the discussion of it is beyond the scope of this paper.

At present, our system can recognize around one hundred typical daily objects in an offline fashion, which thus offers tremendous advantage particularly for rural users who have much less access to therapists and the internet [19].

3.2 System Development Phases and User Interfaces: System Versions Without Arduino Sensors

Our system went through several design iterations, which aims at facilitating the ease of use for autistic children and their loved ones, particularly in rural areas. Two types of the AR-based systems had been implemented, one with Arduino sensors and one without these sensors (target younger children). In this paper, we focus our discussion on the latter part. Figure 1 presented two sample user interfaces during these design iterations where the system can only recognize one object per use (the first and middle images in Fig. 1).

To differentiate and highlight the items in one scene, colored border will wrap the object and the colored border is matched to colored buttons; when a user presses a colored button, the Chinese word for the item corresponding to the colored button will be shown and spoken, as shown in Fig. 2. Notice in Fig. 2, the user targeted a photo in a browser, which shows that our system is capable of supporting learning anywhere and anytime for these children.

Notice that as shown in Fig. 2, button design has been simplified by removing the texts and replacing them with simple buttons to facilitate younger autistic children with limited vocabulary.

4 Observations from Pilot Studies and Discussions

Since there is no prior works to draw in the design of such AR-based word-learning mobile application, we conducted two small pilot studies to obtain some feed-backs and inputs from TD children and their parents, children with autism and their loved ones and special education teachers. Main observations and discussions will be presented in the following sections.

4.1 Pilot Study One and Main Observations

The first small-scale usability testing is conducted with typical children and adults in a public show on the university campus. Another testing goal is to assess the accuracy of the offline object recognition algorithm and its portability. Figure 3 shows the moment when a young girl whose mother was seen by her side was shown the application; Fig. 4 illustrates a group of adults were demonstrated the offline object-recognition.

The simple and lightweight mobile application receives very good reviews from both adults and children, particularly for parents and young audiences who claimed that such AR-based applications are very rare in China; and children were observed to demonstrate high interest in trying it after several rounds. Parents are particularly satisfied with the audio module of the application which could facilitate teaching and learning at home and largely relieve them from repeated teaching.

Supervising, the accuracy of the application is satisfying, though a few items supplied by the audiences were wrongly recognized. Another main reason that the accuracy seems not be a big issue in this pilot study might be due to the limited time each child is playing with the application.

We hypothesize that objects that are used for teaching and learning with young children are typical ones which mostly can be accurately recognized by the algorithm. However, when children grow up with expanding vocabularies and sophisticated environment, the performance of the algorithm will be declined.

4.2 Pilot Study Two and Discussions

General Description and Goals.

We conducted a second pilot test in a private special education center in Hangzhou, one of China’s biggest cities. The main goals are two folds and same as those in study one.

Study Participants and Environment.

Five children with two different age groups tried the application; one group consists of children under five years old; another is those between six to eight years old. Testing objects including a set of toy animals (see Figs. 5 and 6) we brought and those in the center. Besides children, accompanying parents and teachers had also tried the application and been interviewed by us.

General Observation with Children with Autism and Discussions.

Overall, the group with younger children are observed to have difficulty in using the application correctly; they did show excited-ness and surprise after the application pronounces objects’ names. However, they quickly lost interest in the app for hearing the voice several times. Comparably, the application is very well received among elder children who not only showed high interests in learning with application, but also were observed to use the application without any training and difficulty. Figure 7 shows one testing moment with a child on the animal toy group, while Fig. 8 shows testing on multiple cups.

Feed-backs from Teachers.

Very satisfying feed-backs had been received from teachers regarding its usefulness and usability. Specifically, they are especially satisfied with the advantages of technology-based application to significantly reduce their efforts: the repeatability of the application. Its ease of use and its audio features also attract their attention in that children with special needs could also learn the pronunciations. However, the performance of the offline algorithm is not satisfactory. For example, the application cannot recognize some books due to their seem-to-be strange and varied covers, shapes and colors. Teachers claimed that it would make an excellent learning companion if the performance could be significantly improved.

5 Concluding Remarks and Future Works

We developed a lightweight AR-based word-learning application for children with autism to learn words at anytime and anywhere, particularly outside the classrooms where many AR-based applications targeting TD children works effectively.

Overall, the feasibility and usability results obtained from our two pilot studies are aligned with those in previous ones, particularly in [8, 13] that the application greatly attracts children’s attention, which thus might promote learning at their own pace outside the classroom. Both special education teachers and parents’ feed-backs highlighted the importance of learning while playing and learning at anytime and anyplace, not only for children with autism but those with other special needs.

However, the accuracy of the offline object recognition model significantly compromised the acceptability and general applicability of our application. That said, how to balance accuracy and lightweight is crucial. Our current thought is to integrate a reinforcement learning module to take user-inputs so as to further train the existing algorithm.

Despite it, to the best of our knowledge, our application, as the first few, offers valuable insights into the design of such mobile learning applications, particularly to facilitate learning at anytime and anywhere.

Notes

1.
https://www.tensorflow.org/tutorials/images/image_recognition.

References

Alessandrini, A., Cappelletti, A., Zancanaro, M.: Audio-augmented paper for therapy and educational intervention for children with autistic spectrum disorder. Int. J. Hum Comput Stud. 72(4), 422–430 (2014)
Article Google Scholar
Bai, Z., Blackwell, A.F., Coulouris, G.: Using augmented reality to elicit pretend play for children with autism. IEEE Trans. Vis. Comput. Graph. 21(5), 598–610 (2015)
Article Google Scholar
Charman, T. Swettenham, J: Repetitive behaviors and social-communicative impairments in autism: implications for developmental theory and diagnosis. In: Burack, J.A., Charman, T., Yirmiya, N., Zelazo, P.R. (eds.) The Development of Autism: Perspectives from Theory and Research. Mahwah, New Jersey. Lawerence Eribaum Associates (2001)
Google Scholar
Chen, C., Lee, I., Lin, L.: Augmented reality-based self-facial modeling to promote the emotional expression and social skills of adolescents with autism spectrum disorders. Res. Dev. Disabil. 36, 396–403 (2015)
Article Google Scholar
Chung, C.H., Chen, C.H.: Augmented reality based social stories training system for promoting the social skills of children with autism. In: Soares, M., Falcão, C., Ahram, T. (eds.) Advances in Ergonomics Modeling, Usability & Special Populations. Advances in Intelligent Systems and Computing, vol. 486, pp. 495–505. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41685-4_44
Chapter Google Scholar
Daniels, J., et al.: Exploratory study examining the at-home feasibility of a wearable tool for social-affective learning in children with autism. npj Digit. Med. 1 (2018). Article no. 32
Google Scholar
Dawson, G., et al.: Randomized controlled trial of the early start denver model: a developmental behavioral intervention for toddlers with autism: effects on IQ, adaptive behavior, and autism diagnosis. Pediatrics 125(1), e17–e23 (2010)
Article Google Scholar
Dragomir, M., Manches, A., Fletcher-Watson, S., Pain, H.: Facilitating pretend play in autistic children: results from an augmented reality app evaluation. In: Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018), pp. 407–409. ACM, New York (2018)
Google Scholar
Escobedo, L., Tentori, M., Quintana, E., Favela, J., Garcia-Rosas, D.: Using augmented reality to help children with autism stay focused. IEEE Pervasive Comput. 13(1), 38–46 (2014)
Article Google Scholar
French L., Kennedy E.M.M: Annual research review: early intervention for infants and young children with, or at-risk of, autism spectrum disorder: a systematic review. J. Child Psychol. Psychiatry 59(4), 444–456 (2018)
Article Google Scholar
Hosseini, E., Foutohi-Ghazvini, F.: Play therapy in augmented reality children with autism. J. Mod. Rehab. 10(3), 110–115 (2016)
Google Scholar
Hwang, G.J., Wu, P.W., Chen, C.C., Tu, N.T.: Effects of an augmented reality-based educational game on students’ learning achievements and attitudes in real-world observations. Interact. Learn. Environ. 24(8), 1895–1906 (2016)
Article Google Scholar
Liu, R., Salisbury, J.P., Vahabzadeh, A., Sahin, N.T.: Feasibility of an autism-focused augmented reality smartglasses system for social communication and behavioral coaching. Front. Pediatr. 5, 145 (2017)
Article Google Scholar
Lorenzo, G., Gómez-Puerta, M., Arráez-Vera, G., Lorenzo-Lledó, A.: Preliminary study of augmented reality as an instrument for improvement of social skills in children with autism spectrum disorder. Educ. Inf. Technol. 24(1), 181–204 (2018)
Article Google Scholar
McMahon, D.D., Cihak, D.F., Wright, R.E., Bell, S.M.: Augmented reality for teaching science vocabulary to postsecondary education students with intellectual disabilities and autism. J. Res. Technol. Educ. 48(1), 38–56 (2016)
Article Google Scholar
Smith, F.G., LeConte, P.: Universal design for learning. In: Encyclopedia of Distance Learning, vol. 4, pp. 1926–1928 (2004)
Google Scholar
Su, M.S., Haga, C.: Effectiveness of cognitive, developmental, and behavioural interventions for autism spectrum disorder in preschool-aged children: a systematic review and meta-analysis. Heliyon 4(9), e00763 (2018)
Article Google Scholar
Szatmari, P., Bryson, S.E., Boyle, M.H., Streiner, D.L., Duku, E.: Predictors of outcome among high functioning children with autism and Asperger syndrome. J. Child Psychol. Psychiatry 44(4), 520–528 (2003)
Article Google Scholar
Tang, T.Y., Flatla, D.R.: Autism awareness and technology- based intervention research in China: the good, the bad, and the challenging. In: Proceedings of Workshop on Autism and Technology - Beyond Assistance & Intervention, in Conjunction with the CHI 2016 (2016)
Google Scholar
Virnes, M., Kärnä, E., Vellonen, V.: Review of research on children with autism spectrum disorder and the use of technology. J. Spec. Educ. Technol. 30(1), 13–27 (2015)
Article Google Scholar
Zwaigenbaum, L., et al.: Early intervention for children with autism spectrum disorder under 3 years of age: recommendations for practice and research. Pediatrics 136, S60–S81 (2015)
Article Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge financial support from Zhejiang Provincial Natural Science Foundation of China (LGJ19F020001) and Wenzhou City Science and Technology Bureau (H20180001).

Author information

Authors and Affiliations

Media Lab, Department of Computer Science, Wenzhou-Kean University, Wenzhou, China
Tiffany Y. Tang, Jiasheng Xu & Pinata Winoto

Authors

Tiffany Y. Tang
View author publications
You can also search for this author in PubMed Google Scholar
Jiasheng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Pinata Winoto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tiffany Y. Tang .

Editor information

Editors and Affiliations

Foundation for Research and Technology – Hellas (FORTH), Heraklion, Crete, Greece
Margherita Antona
University of Crete and Foundation for Research and Technology – Hellas (FORTH), Heraklion, Crete, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, T.Y., Xu, J., Winoto, P. (2019). An Augmented Reality-Based Word-Learning Mobile Application for Children with Autism to Support Learning Anywhere and Anytime: Object Recognition Based on Deep Learning. In: Antona, M., Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. Multimodality and Assistive Environments. HCII 2019. Lecture Notes in Computer Science(), vol 11573. Springer, Cham. https://doi.org/10.1007/978-3-030-23563-5_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-23563-5_16
Published: 04 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23562-8
Online ISBN: 978-3-030-23563-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics