Abstract
Automatic speech recognition (ASR) has become a familiar input component for voice user faces, such as texting, preparing structured documents, searching, and voice commands. Inclusive ASR is a premise for a sustainable HCI that supports all types of languages and users. ASR is quite robust for majority languages but still needs to be adequate for smaller languages. Moreover, specific user groups cannot use ASR due to, e.g., speech impediments or accents. This paper discusses the possibilities and limitations of ASR in its current form within a Nordic context raising the following questions: i) What is the state of ASR in terms of usability for smaller and minority languages, and ii) How is ASR discussed regarding future development and inclusion? The current research discusses such topics within the lens of ability-based participatory design established on exploratory data collection. Among other findings, the paper stresses that small and minority languages are neglected in ASR development from an international perspective. Consequently, these languages risk exclusion from the digital development of language technology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
UN: Universal declaration of human rights. United Nations, New York (2015)
Directorate-General of the UNESCO: Intersectional Mid-Term Strategy on Languages and Multilingualism. United Nations, New York (2007)
Sutton, S.J., et al.: Voice as a design material: sociophonetic inspired design strategies in human-computer interaction. In Brewster, S., Fitzpatrick, G. (eds.) Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Paper 603, Association for Computing Machinery, Glasgow (2019)
Dourish, P.: Where the Action is: The Foundations of Embodied Interaction. MIT Press, Cambridge (2004)
De Smedt, K., et al.: The Norwegian Language in the Digital Age = Norsk i den digitale tidalderen. Springer, Berlin (2012)
Jing, C.T., Chen, C.J.: A research review: How technology helps to improve the learning process of learners with dyslexia. J. Cognit. Sci. Hum. Dev. 2(2), 26–43 (2017)
Alapetite, A., Boje Andersen, H., Hertzum, M.: Acceptance of speech recognition by physicians: a survey of expectations, experiences, and social influence. Int. J. Hum. Comput. Stud. 67(1), 36–49 (2009)
Kirchmeier, S., et al.: World class language technology: developing a language technology strategy for Danish. In: Calzolari, N. (ed.) Proceedings of the 12th Language Resources and Evaluation Conference 2020, pp. 3297–3301. European Language Association, Marseille (2020)
Nordic Council of Ministers: Declaration on a Nordic language policy. Nordisk ministerråd, København (2006)
Christensen, L.H., Christensen, R.Z.: Dansk grammatik [Danish grammar]. Syddansk Universitetsforlag, Odense (2019)
UN: Convention on the rights of the child. UN, New York (1990)
Council of Europe: The European charter for regional and minority languages. Council of Europe, Strasbourg (1998)
Hansen, E., Heltoft, L.: Grammatik over det danske sprog [Grammar of the Danish language]. Syddansk Universitetsforlag, Odense (2011)
Language Council: Det danske ordforråd [The Danish vocabulary]. https://dsn.dk/sprogets-udvikling/det-danske-ordforrad/. Accessed 05 June 2022
Frostad, B.H., Schall, V., Holten, S.M.: Towards ASR that supports linguistic diversity in Norway. In: Choukri, K., Mariani, J., Sakti, S. (eds.) Proceedings of the Language Technologies for All (LT4All) 2019, pp. 328–331. Unesco, Paris (2019)
Holmen, A.: Hvorfor styrke og revitalisere små sprog? [Why strengthen and revitalize minor languages?]. In: Niia, K.K. (ed.) Framgång för små språk [Progress for minor languages], pp. 10–15. Uppsala Arkivcentrum, Uppsala (2021)
Antonsen, L., et al: Machine translation with North Saami as a pivot language. In: Tiedemann, J., Tahmasebi, N. (eds.) Proceedings of the 21st Nordic Conference of Computational Linguistics 2017, pp. 123–131. Linköping University Electronic Press, Gothenburg (2017)
Laxström, N., Wilcock, G., Jokinen, K.: Internationalisation and localisation of spoken dialogue systems. In: Jokinen, K., Wilcock, G. (eds.) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol. 427, pp. 207–219. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-2585-3_16
Fournier, H.: New trends in HCI research and development: a cautionary tale. In: Carliner, S. (ed.) E-Learn: World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education, 2018, pp. 1349–1355. Association for the Advancement of Computing in Education, Waynesville (AACE) (2018)
Curry, A., Flett, P., Hollingsworth, I.: Managing Information & Systems: The Business Perspective. Routledge, London (2006)
Lewis, J.R.: Practical Speech User Interface Design. CRC Press, Boca Rotan (2016)
Rosen, K., Yampolsky, S.: Automatic speech recognition and a review of its functioning with dysarthric speech. Augment. Altern. Commun. 16(1), 48–60 (2000)
Ogundokun, R.O., et al.: Speech recognition system: overview of the state-of-the-arts. Int. J. Eng. Res. Technol. 13(3), 384–392 (2020)
Kirkedal, A.: Danish Stød and Automatic Speech Recognition. Copenhagen Business School, Copenhagen (2016)
Hjortdal, A., Frid, J., Roll, M.: Phonetic and phonological cues to prediction: neurophysiology of Danish stød. J. Phon. 94, 1–15 (2022)
Vase, S.: How workarounds occur in relation to automatic speech recognition at Danish hospitals. In: Kurosu, M. (ed.) HCII 2021. LNCS, vol. 12764, pp. 458–472. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-78468-3_31
Nass, C.I., Brave, S.: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. MIT Press, Cambridge (2005)
Koenecke, A., et al.: Racial disparities in automated speech recognition. Proc. Natl. Acad. Sci. 117(14), 7684–7689 (2020)
Singh, A.P., Nath, R., Kumar, S.: A survey: speech recognition approaches and techniques. In: Yadav, D. (ed.) 2018 5th IEEE Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON) 2018, pp. 563–567. IEEE, New Jersey (2018)
Xiong, W., et al.: The Microsoft 2017 conversational speech recognition system. In: Hayes, M., Hanseok, K. (eds.) 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018, pp. 5934–5938. IEEE, New Jersey (2018)
Ministry of Local Government and Modernization: One digital public sector: Digital strategy for the public sector 2019–2025. Ministry of Local Government and Modernization, Oslo (2019)
Teknologirådet: Kunstig intelligens: Muligheter, utfordringer og en plan for Norge [Artificial intelligence: Opportunities, challenges and a plan for Norway]. Teknologirådet, Oslo (2018)
Lithari, E.: Fractured academic identities: dyslexia, secondary education, self-esteem and school experiences. Int. J. Incl. Educ. 23(3), 280–296 (2019)
Adams, W.R.: High-accuracy detection of early Parkinson’s disease using multiple characteristics of finger movement while typing. PLoS ONE 12(11), e0188226 (2017)
Alesi, M., Battaglia, G.: Chapter six: motor development and down syndrome. In: Lanfranchi, S. (ed.) International Review of Research in Developmental Disabilities, vol. 56, pp. 169–211. Academic Press, Cambridge, Massachusetts (2019)
Rocha, T., et al.: Usability evaluation of navigation tasks by people with intellectual disabilities: a google and SAPO comparative study regarding different interaction modalities. Univ. Access Inf. Soc. 16(3), 581–592 (2017)
Sanchez, M.M., et al.: Neurobiological elements of cognitive dysfunction in down syndrome: exploring the role of APP. Biol. Psychiatry 71(5), 403–409 (2012)
Dashtipour, K., et al.: Speech disorders in Parkinson’s disease: pathophysiology, medical management and surgical approaches. Neurodegener. Dis. Manag. 8(5), 337–348 (2018)
Dua, T., et al.: Speech recognition technology for hearing disabled community. Int. J. Adv. Comput. Res. 4(3), 882–887 (2014)
Balaji, V., Sadashivappa, G.: Speech disabilities in adults and the suitable speech recognition software tools: a review. In: Westphall, C.B., Murugesan, S., Ramesh, T. (eds.) 2015 International Conference on Computing and Network Communications (CoCoNet) 2015, pp. 559–564. IEEE, New Jersey (2015)
Cave, R., Bloch, S.: The use of speech recognition technology by people living with amyotrophic lateral sclerosis: a scoping review. Disabil. Rehabil. Assist. Technol. 2021, 1–13 (2021)
Wobbrock, J.O., et al.: Ability-based design: Concept, principles and examples. ACM Trans. Access. Comput. 3(3), Article no. 9 (2011)
Desmond, D., et al.: Assistive technology and people: a position paper from the first global research, innovation and education on assistive technology (GREAT) summit. Disabil. Rehabil. Assist. Technol. 13(5), 437–444 (2018)
Stramondo, J.A.: The distinction between curative and assistive technology. Sci. Eng. Ethics 25(4), 1125–1145 (2019)
Steinfeld, E., Maisel, J.L.: Universal Design: Creating Inclusive Environments. Wiley, New Jersey (2012)
Derman, Y.D., Arenovich, T., Strauss, J.: Speech recognition software and electronic psychiatric progress notes: physicians’ ratings and preferences. BMC Med. Inform. Decis. Mak. 10(1), 44 (2010)
Pedersen, B.S., et al.: The Danish language in the digital age. In: Rehm, G., Uszkoreitm H. (eds.) META-NET White Paper Series: Europe’s Languages in the Digital Age. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30627-3
Arksey, H., O’Malley, L.: Scoping studies: towards a methodological framework. Int. J. Soc. Res. Methodol. 8(1), 19–32 (2005)
Alvesson, M., Sköldberg, K.: Reflexive Methodology: New Vistas for Qualitative Research. Sage, Los Angeles (2017)
Kommunal- og distriktsdepartementet: Samisk språk og IT: Språkteknologi [Sami language and IT: Language technology]. https://www.regjeringen.no/no/tema/urfolk-og-minoriteter/samepolitikk/samiske-sprak/samisk-sprak-og-it/id86947/. Accessed 14 Apr 2022
Government: Strategi for vækst gennem deleøkonomi [Strategy for growth through a sharing economy]. Ministry of Industry, Business and Financial Affairs, Copenhagen (2017)
Kommunal- og moderniseringsdepartementet.: Nasjonal strategi for kunstig intelligens [National strategy for artificial intelligence]. Kommunal- og digitaliseringsdepartementet, Oslo (2020)
Government, LGD, The Danish Regions: A coherent and trustworthy health network for all: Digital health strategy 2018–2022. Ministry of Health, Ministry of Finance, Local Government Denmark and the Regions, Copenhagen (2018)
Agency for Digitalization and GovTech-Program.: Challenge statement. https://challenges.dk/sites/default/files/2019-09/DIGST%20GovTech-Program%20One%20Pager%20ENG.pdf. Accessed 16 Sept 2022
Kvarenes, M.: Klar, men aldri ferdig: En praktisk veileder i klarspråksarbeid [Ready, but never finished: A practical guide to clear language]. Språkrådet og Direktoratet for forvaltning og IKT, Oslo (2010)
Abdul, A., et al.: Trends and trajectories for explainable, accountable and intelligible systems: an HCI research agenda. In: Mandryk, R. Hancock, M. (eds.) CHI 2018: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems 2018, Paper 582. Association for Computing Machinery, Montreal (2018)
Berget, G.: Search and Find?: An Accessibility Study of Dyslexia and Information Retrieval. University of Oslo, Oslo (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Vase, S., Berget, G. (2023). An Exploration of Automatic Speech Recognition Within a Nordic Context. In: Antona, M., Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. HCII 2023. Lecture Notes in Computer Science, vol 14020. Springer, Cham. https://doi.org/10.1007/978-3-031-35681-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-031-35681-0_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35680-3
Online ISBN: 978-3-031-35681-0
eBook Packages: Computer ScienceComputer Science (R0)