Abstract
Specific reading disorders are conditions caused by neurological dysfunctions that affect the linguistic processing of printed text. Many people go untreated due to the lack of specific tools and the high cost of using proprietary software; however, new audio signal processing technologies can help identify genetic pathologies. The methodology developed by medical specialists extracts characteristics from the reading of a text aloud and returns evidence of dyslexia. This work proposes an improvement of the research presented in [25], extracting new features and improvements serving as a tool for dyslexia indication efficiently. The analysis is done in recordings of the reading of pre-defined texts with school-age children. Direct and indirect characteristics of the audio signal are extracted. The direct ones are obtained through the methodology of separation of pauses and syllables. Simultaneously, the indirect characteristics are extracted through the alignment of audio signals, the Hidden Markov Model, and some heuristics of improvement. The indication of the probability of dyslexia is performed using a machine learning algorithm. The tests were compared with the specialist’s classification, obtaining high accuracy on the evidence of dyslexia. The difference between the values of the characteristics collected automatically and manually was below 20% for most features. Finally, the results show a promising research area for audio signal processing concerning the aid to specialists in the decision making related to language pathologies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al-Barhamtoshy, H.M., Motaweh, D.M.: Diagnosis of dyslexia using computation analysis. In: 2017 International Conference on Informatics, Health & Technology (ICIHT), pp. 1–7. IEEE (2017)
Alghabban, W.G., Salama, R.M., Altalhi, A.H.: Mobile cloud computing: an effective multimodal interface tool for students with dyslexia. Comput. Hum. Behav. 75, 160–166 (2017)
Alves, L.M.: A prosódia na leitura da criança disléxica. Ph.D. thesis, Universidade Federal de Minas Gerais - Faculdade de Letras, Belo Horizonte, May 2007. www.bibliotecadigital.ufmg.br/dspace/bitstream/
Alves, L.M., da Conceição Reis, C.A., Pinheiro, Â.M.V., Capellini, S.A.: Aspectos prosódicos temporais da leitura de escolares com dislexia do desenvolvimento. Revista da Sociedade Brasileira de Fonoaudiologia 14(2), 197–204 (2009). http://www.scielo.br/pdf/rsbf/v14n2/10.pdf
Van den Audenaeren, L., et al.: DYSL-X: design of a tablet game for early risk detection of dyslexia in preschoolers. In: Schouten, B., Fedtke, S., Bekker, T., Schijven, M., Gekker, A. (eds.) Games for Health, pp. 257–266. Springer, Wiesbaden (2013). https://doi.org/10.1007/978-3-658-02897-8_20
Barbedo, J.G.A., Lopes, A.: Discriminador voz/música baseado na estimação de múltiplas frequências fundamentais. IEEE Lat. Am. Trans. 5(5), 294–300 (2007)
Bartolomé, N.A., Zorrilla, A.M., Zapirain, B.G.: Dyslexia diagnosis in reading stage though the use of games at school. CGmaes 2012: The 17th International Conference on Computer Games, pp. 12–16 (2012)
Behlau, M.P.: Voz: o livro do especialista, vol. 1. Revinter (2001)
Breznitz, Z., Leikin, M.: Effects of accelerated reading rate on processing words’ syntactic functions by normal and dyslexic readers: event related potentials evidence. J. Genet. Psychol. 162, 276–296 (2001)
Brognaux, S., Drugman, T.: HMM-based speech segmentation: improvements of fully automatic approaches. IEEE/ACM Trans. Audio Speech Lang. Proces. 24(1), 5–15 (2016)
Cano, P., Loscos, A., Bonada, J.: Score performance matching using HMMs. In: Proceedings of the International Computer Music Conference, San Francisco, pp. 441–444 (1999)
Deuschle, V.P., Cechella, C.: O déficit em consciência fonológica e sua relação com a dislexia: diagnóstico e intervenção. Revista CEFAC - Speech Lang. Hear. Sci. Educ. J. 11(Supl 2), 194–200 (2009)
Drigas, A.S., Politi-Georgousi, S.: ICTs as a distinct detection approach for dyslexia screening: a contemporary view. IJOE: Int. J. Online Biomed. Eng. 15(13), 46–60 (2019)
Fellow, L.R.R.: A tutorial on hidden Markov models and selected applications in speech recognition. IEEE 77(2), 257–286 (1989)
Geurts, L., et al.: DIESEL-X: a game-based tool for early risk detection of dyslexia in preschoolers. In: Torbeyns, J., Lehtinen, E., Elen, J. (eds.) Describing and Studying Domain-Specific Serious Games. AGL, pp. 93–114. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-20276-1_7
Gusso, G., Lopes, J.M.C.: Tratado de Medicina de FamÃlia e Comunidade: PrincÃpios, Formação e Prática, vol. 2. Artmed (2012)
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)
Leon, P., Pucher, M., Yamagishi, J., Hernaez, I., Saratxaga, I.: Evaluation of speaker verification security and detection of HMM-based synthetic speech. IEEE Trans. Audio Speech Lang. Process. 20(8), 2280–2290 (2012)
Marinus, J.V.M.L., Araújo, J.M.F.R., Gomes, H.M., Costa, S.C.: On the use of cepstral coefficients and multilayer perceptron networks for vocal fold edema diagnosis. In: ITAB 2009–9th International Conference on Information Technology and Applications in Biomedicine, pp. 1–4 (2009)
Jothi Prabha, A., Bhargavi, R.: Prediction of dyslexia using machine learning—a research travelogue. In: Nath, V., Mandal, J.K. (eds.) Proceedings of the Third International Conference on Microelectronics, Computing and Communication Systems. LNEE, vol. 556, pp. 23–34. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-7091-5_3
Prates, L.P.C.S., Martins, V.O.: Distúrbios da fala e da linguagem na infância. Revista de Medicina de Minas Gerais 21(4), 54–60 (2011)
Rahman, A., Hassanain, E., Rashid, M., Barnes, S.J., Hossain, M.S.: Spatial blockchain-based secure mass screening framework for children with dyslexia. IEE Access: Spec. Sect. Mob. Multimed. Healthc. 6, 61876–61885 (2018)
Rahman, M.A., Hassanain, E., Rashid, M.M., Barnes, S.J., Hossain, M.S.: Spatial blockchain-based secure mass screening framework for children with dyslexia. IEEE Access: Multidiscip. Open Access J. 6, 61876–61885 (2018)
Rello, L., Romero, E., Ali, A., Williams, K., Rauschenberger, M., Bigham, J.P., White, N.C.: Screening dyslexia for English using HCI measures and machine learning. In: DH 2018: 2018 International Digital Health Conference, pp. 23–26 (2018)
Ribeiro, F.M., Pereira Jr., A.R., Paiva, D.M.B., Alves, L.M., Bianchi, A.G.C.: Early dyslexia evidences using speech features. In: Proceedings of the 22nd International Conference on Enterprise Information Systems, ICEIS, vol. 1, pp. 640–647. INSTICC, SciTePress (2020). https://doi.org/10.5220/0009574906400647
Santos, M.C.S.: Disvoice: Aplicativo de apoio à Fonoaudiologia para dispositivos móveis. Mathesis, Fundação de Ensino EurÃpides Soares da Rocha - UNIVEM (2013)
Shaywitz, S.: Entendendo a dislexia : um novo e completo programa para todos os nÃveis de problemas de leitura. Artmed, Porto Alegre, 1 edn. (2006). trad. sob a direção de Vinicius Figueira
Shrestha, S., Murano, P.: An algorithm for automatically detecting dyslexia on the fly. Intl. J. Comput. Sci. Inf. Technol. (IJCSIT) 10(3), 1–18 (2018)
Sidhu, M.S., Manzura, E.: An effective conceptual multisensory multimedia model to support dyslexic children in learning. IJICTE - Int. J. Inf. Commun. Technol. Educ. 7(3), 34–50 (2011)
Silva, E.L.F., Oliveira, H.M.: Implementação de um algoritmo de divisão silábica automática para arquivos de fala na lÃngua portuguesa. Anais do XIX Congresso Brasileiro de Automática, CBA 2012, pp. 4161–4166 (2012). www2.ee.ufpe.br/codec/CBA2012_vf.pdf
Zarim, A., Azimah, N.: Android based dyslexia early screening test. Ph.D. thesis, UTeM (2016)
Zavaleta, J., Costa, R.J.M., da Cruz, S.M.S., Manhaes, M., Alfredo, L., Mousinho, R.: Dysdtool: Uma ferramenta inteligente para a avaliação e intervenção no apoio ao diagnóstico da dislexia. CSBC (2012) XXXII Congresso da Sociedade Brasileira de Computacao: XII WorKshop de Informatica Medica, WIM 2012 (2012)
Acknowledgements
This study was financed by Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG) and Universidade Federal de Ouro Preto (UFOP).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Ribeiro, F.M., Pereira, A.R., Paiva, D.M.B., Alves, L.M., Bianchi, A.G.C. (2021). Extraction of Speech Features and Alignment to Detect Early Dyslexia Evidences. In: Filipe, J., Śmiałek, M., Brodsky, A., Hammoudi, S. (eds) Enterprise Information Systems. ICEIS 2020. Lecture Notes in Business Information Processing, vol 417. Springer, Cham. https://doi.org/10.1007/978-3-030-75418-1_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-75418-1_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75417-4
Online ISBN: 978-3-030-75418-1
eBook Packages: Computer ScienceComputer Science (R0)