loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Dávid Sztahó 1 ; Miklós Gábriel Tulics 1 ; Jinzi Qi 2 ; Hugo Van Hamme 2 and Klára Vicsi 1

Affiliations: 1 Department of Telecommunication and Media Informatics, Budapest University of Technology and Economics, Budapest, Hungary ; 2 KU Leuven, Department Electrical Engineering ESAT-PSI, Leuven, Belgium

Keyword(s): Dysphonia, Speech, Machine Learning, Speech Disorder, SVM.

Abstract: Dysphonic voices can be detected using features derived from speech samples. Works aiming at this topic usually deal with mono-lingual experiments using a speech dataset in a single language. The present paper targets extension to a cross-lingual scenario. A Hungarian and a Dutch speech dataset are used. Automatic binary separation of normal and dysphonic speech and dysphonia severity level estimation are performed and evaluated by various metrics. Various speech features are calculated specific to an entire speech sample and to a given phoneme. Feature selection and model training is done on Hungarian and evaluated on the Dutch dataset. The results show that cross-lingual detection of dysphonic speech may be possible on the applied corpora. It was found that cross-lingual detection of dysphonic speech is indeed possible with acceptable generalization ability, while features calculated on phoneme-level parts of speech can improve the results. Considering cross-lingual classification test sets, 0.86 and 0.81 highest F1-scores can be achieved for feature sets with the vowel /E/ included and excluded, respectively and 0.72 and 0.65 highest Pearson correlations can be achieved or severity prediction using features sets with the vowel /E/ included and excluded, respectively. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 13.59.231.155

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Sztahó, D.; Tulics, M.; Qi, J.; Van Hamme, H. and Vicsi, K. (2022). Cross-lingual Detection of Dysphonic Speech for Dutch and Hungarian Datasets. In Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2022) - BIOSIGNALS; ISBN 978-989-758-552-4; ISSN 2184-4305, SciTePress, pages 215-220. DOI: 10.5220/0010890200003123

@conference{biosignals22,
author={Dávid Sztahó. and Miklós Gábriel Tulics. and Jinzi Qi. and Hugo {Van Hamme}. and Klára Vicsi.},
title={Cross-lingual Detection of Dysphonic Speech for Dutch and Hungarian Datasets},
booktitle={Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2022) - BIOSIGNALS},
year={2022},
pages={215-220},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010890200003123},
isbn={978-989-758-552-4},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2022) - BIOSIGNALS
TI - Cross-lingual Detection of Dysphonic Speech for Dutch and Hungarian Datasets
SN - 978-989-758-552-4
IS - 2184-4305
AU - Sztahó, D.
AU - Tulics, M.
AU - Qi, J.
AU - Van Hamme, H.
AU - Vicsi, K.
PY - 2022
SP - 215
EP - 220
DO - 10.5220/0010890200003123
PB - SciTePress