ABSTRACT
The use of the MathML language made possible to improve the accessibility of mathematics for blind or low-vision persons in digital media. Synthetic speech technologies have advanced significantly using MathML, however, the speech synthesizers' standard reading style is still not suitable for mathematics. Making mathematical reading of the speech synthesizers more natural and expressive is still a challenge. The creation of models to produce the appropriate prosody in the synthesized speech of math content is therefore necessary, as shown in previous research. This article presents a proposal for a model to improve prosody in the synthesized speech of mathematical expressions based on MathML. A corpus of mathematical expressions spoken by Mathematics teachers was created to support the model's development. The Fujisaki intonation model was adopted for intonation control, accent and phrase commands have been extracted from the corpus, and some adjustments have been made to manipulate prosodic parameters in the speech of mathematical expression in correlation with the MathML tree; additionally, a pattern of pauses control is being created.
- Helder Ferreira and Diamantino Freitas. 2004. Enhancing the accessibility of mathematics for blind people: The audiomath project. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 3118: 678–685. https://doi.org/10.1007/978-3-540-27817-7_101Google Scholar
- Neil Soiffer. 2018. The Benetech math editor: An inclusive multistep math editor for solving problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 565–572. https://doi.org/10.1007/978-3-319-94277-3_88Google ScholarDigital Library
- Islam Elkabani and Rached Zantout. 2016. A framework for helping the visually impaired learn and practice math. In 2015 5th International Conference on Information and Communication Technology and Accessibility, ICTA 2015. https://doi.org/10.1109/ICTA.2015.7426909Google Scholar
- Adriana Souza and Diamantino Freitas. 2018. Tecnologias Assistivas para Apoiar o Ensino e Aprendizagem de Pessoas com Deficiência Visual na Matemática: Uma Revisão Sistemática da Literatura. In Anais do XXIX Simpósio Brasileiro de Informática na Educação (SBIE 2018), 923. https://doi.org/10.5753/cbie.sbie.2018.923Google ScholarCross Ref
- Enda Bates and Dónal Fitzpatrick. 2010. Spoken mathematics using prosody, earcons and spearcons. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 407–414. https://doi.org/10.1007/978-3-642-14100-3_61Google Scholar
- Jinfu Ni, Shinsuke Sakai, Tohru Shimizu, and Satoshi Nakamura. 2008. Frequency modulation technique for prosodic modification. In Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, 117–120. https://doi.org/10.1109/CHINSL.2008.ECP.41Google ScholarCross Ref
- Adriana Souza and Diamantino Freitas. 2019. Technologies in Mathematics teaching: A transcript of the voices of visually impaired students, braille teachers, and screen readers. In 2019 International Symposium on Computers in Education, SIIE 2019. https://doi.org/10.1109/SIIE48397.2019.8970140Google ScholarCross Ref
- Hiroya Fujisaki. 2004. Information, prosody, and modeling-with emphasis on tonal features of speech. In Speech Prosody 2004, International ConferenceGoogle Scholar
- Hansjörg Mixdorff. 2015. Extraction, Analysis and Synthesis of Fujisaki model Parameters. . Springer, Berlin, Heidelberg, 35–47. https://doi.org/10.1007/978-3-662-45258-5_3Google Scholar
- Raquel Meister, Ko Freitag, and Luciana Lucente. 2017. Prosódia da fala: pesquisa e ensino. Editora: Edgard Blücher.Google Scholar
- Alessandro Mazzei, Michele Monticone, and Cristian Bernareggi. 2019. Using NLG for speech synthesis of mathematical sentences. In INLG 2019 - 12th International Conference on Natural Language Generation, Proceedings of the Conference, 463–472. https://doi.org/10.18653/v1/w19-8658Google ScholarCross Ref
- Lois Frankel, Beth Brownstein, Neil Soiffer, and Eric Hansen. 2016. Development and Initial Evaluation of the ClearSpeak Style for Automated Speaking of Algebra. ETS Research Report Series 2016, 2: 1–43. https://doi.org/10.1002/ets2.12103Google ScholarCross Ref
- Marcus Vinicius Moreira Martins and Waldemar Ferreira Netto. 2017. Os Limiares de Diferenciação Tonal do Português Brasileiro. Revista do GEL 14, 2: 157–182. https://doi.org/10.21165/gel.v14i2.1762Google Scholar
Recommendations
A Dynamic Model for Pauses in the Synthesized Speech of Mathematical Expressions in MathML
ICETC '22: Proceedings of the 14th International Conference on Education Technology and ComputersVoice synthesizers still present several challenges in the speech of mathematical content, as spoken mathematics has quite peculiar rules. In the synthesized speech, pauses help blind and visually impaired students identify the limits of mathematical ...
Evaluating prosodic cues as a means to disambiguate algebraic expressions: an empirical study
Assets '09: Proceedings of the 11th international ACM SIGACCESS conference on Computers and accessibilityThe automatic translation of written mathematical expressions to their spoken equivalent is a difficult task. Written mathematics makes use of specialized symbols and a 2-dimensional layout that is hard to translate into clear and unambiguous spoken ...
Helping Those with Visual Impairments Read Mathematics: A Spatial Approach
PETRA '22: Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive EnvironmentsThough many tools have been designed for low vision mathematics readers, mathematical literacy among individuals with blindness or severe visual impairment (IBSVI) remains astonishingly low. In this paper we present a novel system to facilitate access ...
Comments