research-article

Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

Authors:
Adriana Souza

Doctoral Program in Digital Media Faculty of Engineering - University of Porto Porto Portugal Bahia Federal Institute of Education Science and Technology Porto Seguro Brazil, BR

Doctoral Program in Digital Media Faculty of Engineering - University of Porto Porto Portugal Bahia Federal Institute of Education Science and Technology Porto Seguro Brazil, BR
View Profile

,
Diamantino Freitas

Department of Electrical and Computer EngineeringFaculty of Engineering - University of PortoPorto Portugal, PT

Department of Electrical and Computer EngineeringFaculty of Engineering - University of PortoPorto Portugal, PT
View Profile

DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusionDecember 2020Pages 105–110https://doi.org/10.1145/3439231.3440617

Published:09 June 2021Publication History

DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion

Pages 105–110

ABSTRACT

The use of the MathML language made possible to improve the accessibility of mathematics for blind or low-vision persons in digital media. Synthetic speech technologies have advanced significantly using MathML, however, the speech synthesizers' standard reading style is still not suitable for mathematics. Making mathematical reading of the speech synthesizers more natural and expressive is still a challenge. The creation of models to produce the appropriate prosody in the synthesized speech of math content is therefore necessary, as shown in previous research. This article presents a proposal for a model to improve prosody in the synthesized speech of mathematical expressions based on MathML. A corpus of mathematical expressions spoken by Mathematics teachers was created to support the model's development. The Fujisaki intonation model was adopted for intonation control, accent and phrase commands have been extracted from the corpus, and some adjustments have been made to manipulate prosodic parameters in the speech of mathematical expression in correlation with the MathML tree; additionally, a pattern of pauses control is being created.

References

Helder Ferreira and Diamantino Freitas. 2004. Enhancing the accessibility of mathematics for blind people: The audiomath project. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 3118: 678–685. https://doi.org/10.1007/978-3-540-27817-7_101Google Scholar
Neil Soiffer. 2018. The Benetech math editor: An inclusive multistep math editor for solving problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 565–572. https://doi.org/10.1007/978-3-319-94277-3_88Google ScholarDigital Library
Islam Elkabani and Rached Zantout. 2016. A framework for helping the visually impaired learn and practice math. In 2015 5th International Conference on Information and Communication Technology and Accessibility, ICTA 2015. https://doi.org/10.1109/ICTA.2015.7426909Google Scholar
Adriana Souza and Diamantino Freitas. 2018. Tecnologias Assistivas para Apoiar o Ensino e Aprendizagem de Pessoas com Deficiência Visual na Matemática: Uma Revisão Sistemática da Literatura. In Anais do XXIX Simpósio Brasileiro de Informática na Educação (SBIE 2018), 923. https://doi.org/10.5753/cbie.sbie.2018.923Google ScholarCross Ref
Enda Bates and Dónal Fitzpatrick. 2010. Spoken mathematics using prosody, earcons and spearcons. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 407–414. https://doi.org/10.1007/978-3-642-14100-3_61Google Scholar
Jinfu Ni, Shinsuke Sakai, Tohru Shimizu, and Satoshi Nakamura. 2008. Frequency modulation technique for prosodic modification. In Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, 117–120. https://doi.org/10.1109/CHINSL.2008.ECP.41Google ScholarCross Ref
Adriana Souza and Diamantino Freitas. 2019. Technologies in Mathematics teaching: A transcript of the voices of visually impaired students, braille teachers, and screen readers. In 2019 International Symposium on Computers in Education, SIIE 2019. https://doi.org/10.1109/SIIE48397.2019.8970140Google ScholarCross Ref
Hiroya Fujisaki. 2004. Information, prosody, and modeling-with emphasis on tonal features of speech. In Speech Prosody 2004, International ConferenceGoogle Scholar
Hansjörg Mixdorff. 2015. Extraction, Analysis and Synthesis of Fujisaki model Parameters. . Springer, Berlin, Heidelberg, 35–47. https://doi.org/10.1007/978-3-662-45258-5_3Google Scholar
Raquel Meister, Ko Freitag, and Luciana Lucente. 2017. Prosódia da fala: pesquisa e ensino. Editora: Edgard Blücher.Google Scholar
Alessandro Mazzei, Michele Monticone, and Cristian Bernareggi. 2019. Using NLG for speech synthesis of mathematical sentences. In INLG 2019 - 12th International Conference on Natural Language Generation, Proceedings of the Conference, 463–472. https://doi.org/10.18653/v1/w19-8658Google ScholarCross Ref
Lois Frankel, Beth Brownstein, Neil Soiffer, and Eric Hansen. 2016. Development and Initial Evaluation of the ClearSpeak Style for Automated Speaking of Algebra. ETS Research Report Series 2016, 2: 1–43. https://doi.org/10.1002/ets2.12103Google ScholarCross Ref
Marcus Vinicius Moreira Martins and Waldemar Ferreira Netto. 2017. Os Limiares de Diferenciação Tonal do Português Brasileiro. Revista do GEL 14, 2: 157–182. https://doi.org/10.21165/gel.v14i2.1762Google Scholar

Recommendations

A Dynamic Model for Pauses in the Synthesized Speech of Mathematical Expressions in MathML
ICETC '22: Proceedings of the 14th International Conference on Education Technology and Computers

Voice synthesizers still present several challenges in the speech of mathematical content, as spoken mathematics has quite peculiar rules. In the synthesized speech, pauses help blind and visually impaired students identify the limits of mathematical ...
Read More
Evaluating prosodic cues as a means to disambiguate algebraic expressions: an empirical study
Assets '09: Proceedings of the 11th international ACM SIGACCESS conference on Computers and accessibility

The automatic translation of written mathematical expressions to their spoken equivalent is a difficult task. Written mathematics makes use of specialized symbols and a 2-dimensional layout that is hard to translate into clear and unambiguous spoken ...
Read More
Helping Those with Visual Impairments Read Mathematics: A Spatial Approach
PETRA '22: Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments

Though many tools have been designed for low vision mathematics readers, mathematical literacy among individuals with blindness or severe visual impairment (IBSVI) remains astonishingly low. In this paper we present a novel system to facilitate access ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion
December 2020
245 pages
ISBN:9781450389372
DOI:10.1145/3439231

Copyright © 2020 ACM
Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 June 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Accessibility
Mathematics
Visual Impairment
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate17of23submissions,74%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 92
  Total Downloads
- Downloads (Last 12 months)35
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion

ABSTRACT

References

Cited By

Recommendations

A Dynamic Model for Pauses in the Synthesized Speech of Mathematical Expressions in MathML

Evaluating prosodic cues as a means to disambiguate algebraic expressions: an empirical study

Helping Those with Visual Impairments Read Mathematics: A Spatial Approach

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion

ABSTRACT

References

Cited By

Recommendations

A Dynamic Model for Pauses in the Synthesized Speech of Mathematical Expressions in MathML

Evaluating prosodic cues as a means to disambiguate algebraic expressions: an empirical study

Helping Those with Visual Impairments Read Mathematics: A Spatial Approach

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media