skip to main content
10.1145/3439231.3440617acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdsaiConference Proceedingsconference-collections
research-article

Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

Authors Info & Claims
Published:09 June 2021Publication History

ABSTRACT

The use of the MathML language made possible to improve the accessibility of mathematics for blind or low-vision persons in digital media. Synthetic speech technologies have advanced significantly using MathML, however, the speech synthesizers' standard reading style is still not suitable for mathematics. Making mathematical reading of the speech synthesizers more natural and expressive is still a challenge. The creation of models to produce the appropriate prosody in the synthesized speech of math content is therefore necessary, as shown in previous research. This article presents a proposal for a model to improve prosody in the synthesized speech of mathematical expressions based on MathML. A corpus of mathematical expressions spoken by Mathematics teachers was created to support the model's development. The Fujisaki intonation model was adopted for intonation control, accent and phrase commands have been extracted from the corpus, and some adjustments have been made to manipulate prosodic parameters in the speech of mathematical expression in correlation with the MathML tree; additionally, a pattern of pauses control is being created.

References

  1. Helder Ferreira and Diamantino Freitas. 2004. Enhancing the accessibility of mathematics for blind people: The audiomath project. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 3118: 678–685. https://doi.org/10.1007/978-3-540-27817-7_101Google ScholarGoogle Scholar
  2. Neil Soiffer. 2018. The Benetech math editor: An inclusive multistep math editor for solving problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 565–572. https://doi.org/10.1007/978-3-319-94277-3_88Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Islam Elkabani and Rached Zantout. 2016. A framework for helping the visually impaired learn and practice math. In 2015 5th International Conference on Information and Communication Technology and Accessibility, ICTA 2015. https://doi.org/10.1109/ICTA.2015.7426909Google ScholarGoogle Scholar
  4. Adriana Souza and Diamantino Freitas. 2018. Tecnologias Assistivas para Apoiar o Ensino e Aprendizagem de Pessoas com Deficiência Visual na Matemática: Uma Revisão Sistemática da Literatura. In Anais do XXIX Simpósio Brasileiro de Informática na Educação (SBIE 2018), 923. https://doi.org/10.5753/cbie.sbie.2018.923Google ScholarGoogle ScholarCross RefCross Ref
  5. Enda Bates and Dónal Fitzpatrick. 2010. Spoken mathematics using prosody, earcons and spearcons. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 407–414. https://doi.org/10.1007/978-3-642-14100-3_61Google ScholarGoogle Scholar
  6. Jinfu Ni, Shinsuke Sakai, Tohru Shimizu, and Satoshi Nakamura. 2008. Frequency modulation technique for prosodic modification. In Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008, 117–120. https://doi.org/10.1109/CHINSL.2008.ECP.41Google ScholarGoogle ScholarCross RefCross Ref
  7. Adriana Souza and Diamantino Freitas. 2019. Technologies in Mathematics teaching: A transcript of the voices of visually impaired students, braille teachers, and screen readers. In 2019 International Symposium on Computers in Education, SIIE 2019. https://doi.org/10.1109/SIIE48397.2019.8970140Google ScholarGoogle ScholarCross RefCross Ref
  8. Hiroya Fujisaki. 2004. Information, prosody, and modeling-with emphasis on tonal features of speech. In Speech Prosody 2004, International ConferenceGoogle ScholarGoogle Scholar
  9. Hansjörg Mixdorff. 2015. Extraction, Analysis and Synthesis of Fujisaki model Parameters. . Springer, Berlin, Heidelberg, 35–47. https://doi.org/10.1007/978-3-662-45258-5_3Google ScholarGoogle Scholar
  10. Raquel Meister, Ko Freitag, and Luciana Lucente. 2017. Prosódia da fala: pesquisa e ensino. Editora: Edgard Blücher.Google ScholarGoogle Scholar
  11. Alessandro Mazzei, Michele Monticone, and Cristian Bernareggi. 2019. Using NLG for speech synthesis of mathematical sentences. In INLG 2019 - 12th International Conference on Natural Language Generation, Proceedings of the Conference, 463–472. https://doi.org/10.18653/v1/w19-8658Google ScholarGoogle ScholarCross RefCross Ref
  12. Lois Frankel, Beth Brownstein, Neil Soiffer, and Eric Hansen. 2016. Development and Initial Evaluation of the ClearSpeak Style for Automated Speaking of Algebra. ETS Research Report Series 2016, 2: 1–43. https://doi.org/10.1002/ets2.12103Google ScholarGoogle ScholarCross RefCross Ref
  13. Marcus Vinicius Moreira Martins and Waldemar Ferreira Netto. 2017. Os Limiares de Diferenciação Tonal do Português Brasileiro.  Revista do GEL 14, 2: 157–182. https://doi.org/10.21165/gel.v14i2.1762Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion
    December 2020
    245 pages
    ISBN:9781450389372
    DOI:10.1145/3439231

    Copyright © 2020 ACM

    Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 9 June 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate17of23submissions,74%
  • Article Metrics

    • Downloads (Last 12 months)35
    • Downloads (Last 6 weeks)1

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format