Abstract
Using dataset analysis as a research method is becoming more popular among many researchers with diverse data collection and analysis backgrounds. This paper provides the first publicly available dataset consisting of audio segments and appropriate textual transcription in the Macedonian language. It is appropriately preprocessed and prepared for direct utilization in the automatic speech recognition pipelines. The dataset was created by students at the Faculty of Computer Science and Engineering as part of the elective course, ‘Digital Libraries’, with the audio segments sourced from a YouTube channel.
Supported by Faculty of Computer Science and Engineering, Skopje, N. Macedonia.
M. Mishev, B. Penkova, M. Mitreska, M. Kostoska, A, Todorovska, M. Simjanoska, and K. Mishev —Equal Contribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ardila, R., et al.: Common voice: a massively-multilingual speech corpus. arXiv preprint arXiv:1912.06670 (2019)
Besacier, L., Barnard, E., Karpov, A., Schultz, T.: Automatic speech recognition for under-resourced languages: a survey. Speech Commun. 56, 85–100 (2014)
Datasets, H.F.: Oscar: a large-scale multilingual benchmark for evaluating cross-lingual transfer (2021). https://paperswithcode.com/dataset/oscar. Accessed: [date]
Forsberg, M.: Why is speech recognition difficult. Chalmers University of Technology (2003)
Friedman, V.A.: Macedonian: a south slavic language. In: Kortmann, B., van der Auwera, J. (eds.) The Slavic Languages, pp. 441–483. Cambridge University Press (2019)
Reddy, D.R.: Speech recognition by machine: a review. Proc. IEEE 64(4), 501–531 (1976)
Rousseau, A., Deléglise, P., Esteve, Y.: Ted-lium: an automatic speech recognition dedicated corpus. In: LREC, pp. 125–129 (2012)
Sarikaya, R., Hinton, G.E., Weninger, F.F.: Deep neural network language models for speech recognition. IEEE/ACM Trans. Audio, Speech, Lang. Process. 22(4), 800–814 (2014)
Valk, J., Alumäe, T.: Voxlingua107: a dataset for spoken language recognition. In: 2021 IEEE Spoken Language Technology Workshop (SLT), pp. 652–658. IEEE (2021)
Acknowledgements
This work was partially financed by the Faculty of Computer Science and Engineering at the Ss. Cyril and Methodius University in Skopje.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mishev, M. et al. (2024). MakedonASRDataset - A Dataset for Speech Recognition in the Macedonian Language. In: Mihova, M., Jovanov, M. (eds) ICT Innovations 2023. Learning: Humans, Theory, Machines, and Data. ICT Innovations 2023. Communications in Computer and Information Science, vol 1991. Springer, Cham. https://doi.org/10.1007/978-3-031-54321-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-54321-0_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-54320-3
Online ISBN: 978-3-031-54321-0
eBook Packages: Computer ScienceComputer Science (R0)