Grapheme-to-phoneme (G2P) conversion plays an important role in speech processing applications and other fields of computational linguistics. Sinhala must have a grapheme-to-phoneme conversion for speech processing because Sinhala writing system does not always reflect its actual pronunciations. This paper describes a rule based G2P conversion method to convert Sinhala text strings into phonemic representations. We use a previously defined rule set and enhance it to get a more accurate G2P conversion. The performance of our rule-based system shows that the rule based sound patterns are effective on Sinhala G2P conversion.
Cite as: Nadungodage, T., Liyanage, C., Prerera, A., Pushpananda, R., Weerasinghe, R. (2018) Sinhala G2P Conversion for Speech Processing. Proc. 6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018), 112-116, doi: 10.21437/SLTU.2018-24
@inproceedings{nadungodage18_sltu, author={Thilini Nadungodage and Chamila Liyanage and Amathri Prerera and Randil Pushpananda and Ruvan Weerasinghe}, title={{Sinhala G2P Conversion for Speech Processing}}, year=2018, booktitle={Proc. 6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018)}, pages={112--116}, doi={10.21437/SLTU.2018-24} }