An Exploration of Automatic Speech Recognition Within a Nordic Context

Vase, Silja; Berget, Gerd

doi:10.1007/978-3-031-35681-0_19

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14020))

Included in the following conference series:

International Conference on Human-Computer Interaction

810 Accesses
2 Altmetric

Abstract

Automatic speech recognition (ASR) has become a familiar input component for voice user faces, such as texting, preparing structured documents, searching, and voice commands. Inclusive ASR is a premise for a sustainable HCI that supports all types of languages and users. ASR is quite robust for majority languages but still needs to be adequate for smaller languages. Moreover, specific user groups cannot use ASR due to, e.g., speech impediments or accents. This paper discusses the possibilities and limitations of ASR in its current form within a Nordic context raising the following questions: i) What is the state of ASR in terms of usability for smaller and minority languages, and ii) How is ASR discussed regarding future development and inclusion? The current research discusses such topics within the lens of ability-based participatory design established on exploratory data collection. Among other findings, the paper stresses that small and minority languages are neglected in ASR development from an international perspective. Consequently, these languages risk exclusion from the digital development of language technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

UN: Universal declaration of human rights. United Nations, New York (2015)
Google Scholar
Directorate-General of the UNESCO: Intersectional Mid-Term Strategy on Languages and Multilingualism. United Nations, New York (2007)
Google Scholar
Sutton, S.J., et al.: Voice as a design material: sociophonetic inspired design strategies in human-computer interaction. In Brewster, S., Fitzpatrick, G. (eds.) Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Paper 603, Association for Computing Machinery, Glasgow (2019)
Google Scholar
Dourish, P.: Where the Action is: The Foundations of Embodied Interaction. MIT Press, Cambridge (2004)
Google Scholar
De Smedt, K., et al.: The Norwegian Language in the Digital Age = Norsk i den digitale tidalderen. Springer, Berlin (2012)
Google Scholar
Jing, C.T., Chen, C.J.: A research review: How technology helps to improve the learning process of learners with dyslexia. J. Cognit. Sci. Hum. Dev. 2(2), 26–43 (2017)
Google Scholar
Alapetite, A., Boje Andersen, H., Hertzum, M.: Acceptance of speech recognition by physicians: a survey of expectations, experiences, and social influence. Int. J. Hum. Comput. Stud. 67(1), 36–49 (2009)
Article Google Scholar
Kirchmeier, S., et al.: World class language technology: developing a language technology strategy for Danish. In: Calzolari, N. (ed.) Proceedings of the 12th Language Resources and Evaluation Conference 2020, pp. 3297–3301. European Language Association, Marseille (2020)
Google Scholar
Nordic Council of Ministers: Declaration on a Nordic language policy. Nordisk ministerråd, København (2006)
Google Scholar
Christensen, L.H., Christensen, R.Z.: Dansk grammatik [Danish grammar]. Syddansk Universitetsforlag, Odense (2019)
Google Scholar
UN: Convention on the rights of the child. UN, New York (1990)
Google Scholar
Council of Europe: The European charter for regional and minority languages. Council of Europe, Strasbourg (1998)
Google Scholar
Hansen, E., Heltoft, L.: Grammatik over det danske sprog [Grammar of the Danish language]. Syddansk Universitetsforlag, Odense (2011)
Google Scholar
Language Council: Det danske ordforråd [The Danish vocabulary]. https://dsn.dk/sprogets-udvikling/det-danske-ordforrad/. Accessed 05 June 2022
Frostad, B.H., Schall, V., Holten, S.M.: Towards ASR that supports linguistic diversity in Norway. In: Choukri, K., Mariani, J., Sakti, S. (eds.) Proceedings of the Language Technologies for All (LT4All) 2019, pp. 328–331. Unesco, Paris (2019)
Google Scholar
Holmen, A.: Hvorfor styrke og revitalisere små sprog? [Why strengthen and revitalize minor languages?]. In: Niia, K.K. (ed.) Framgång för små språk [Progress for minor languages], pp. 10–15. Uppsala Arkivcentrum, Uppsala (2021)
Google Scholar
Antonsen, L., et al: Machine translation with North Saami as a pivot language. In: Tiedemann, J., Tahmasebi, N. (eds.) Proceedings of the 21st Nordic Conference of Computational Linguistics 2017, pp. 123–131. Linköping University Electronic Press, Gothenburg (2017)
Google Scholar
Laxström, N., Wilcock, G., Jokinen, K.: Internationalisation and localisation of spoken dialogue systems. In: Jokinen, K., Wilcock, G. (eds.) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol. 427, pp. 207–219. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-2585-3_16
Fournier, H.: New trends in HCI research and development: a cautionary tale. In: Carliner, S. (ed.) E-Learn: World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education, 2018, pp. 1349–1355. Association for the Advancement of Computing in Education, Waynesville (AACE) (2018)
Google Scholar
Curry, A., Flett, P., Hollingsworth, I.: Managing Information & Systems: The Business Perspective. Routledge, London (2006)
Book Google Scholar
Lewis, J.R.: Practical Speech User Interface Design. CRC Press, Boca Rotan (2016)
Google Scholar
Rosen, K., Yampolsky, S.: Automatic speech recognition and a review of its functioning with dysarthric speech. Augment. Altern. Commun. 16(1), 48–60 (2000)
Article Google Scholar
Ogundokun, R.O., et al.: Speech recognition system: overview of the state-of-the-arts. Int. J. Eng. Res. Technol. 13(3), 384–392 (2020)
Article Google Scholar
Kirkedal, A.: Danish Stød and Automatic Speech Recognition. Copenhagen Business School, Copenhagen (2016)
Google Scholar
Hjortdal, A., Frid, J., Roll, M.: Phonetic and phonological cues to prediction: neurophysiology of Danish stød. J. Phon. 94, 1–15 (2022)
Article Google Scholar
Vase, S.: How workarounds occur in relation to automatic speech recognition at Danish hospitals. In: Kurosu, M. (ed.) HCII 2021. LNCS, vol. 12764, pp. 458–472. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-78468-3_31
Chapter Google Scholar
Nass, C.I., Brave, S.: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. MIT Press, Cambridge (2005)
Google Scholar
Koenecke, A., et al.: Racial disparities in automated speech recognition. Proc. Natl. Acad. Sci. 117(14), 7684–7689 (2020)
Article Google Scholar
Singh, A.P., Nath, R., Kumar, S.: A survey: speech recognition approaches and techniques. In: Yadav, D. (ed.) 2018 5th IEEE Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON) 2018, pp. 563–567. IEEE, New Jersey (2018)
Google Scholar
Xiong, W., et al.: The Microsoft 2017 conversational speech recognition system. In: Hayes, M., Hanseok, K. (eds.) 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018, pp. 5934–5938. IEEE, New Jersey (2018)
Google Scholar
Ministry of Local Government and Modernization: One digital public sector: Digital strategy for the public sector 2019–2025. Ministry of Local Government and Modernization, Oslo (2019)
Google Scholar
Teknologirådet: Kunstig intelligens: Muligheter, utfordringer og en plan for Norge [Artificial intelligence: Opportunities, challenges and a plan for Norway]. Teknologirådet, Oslo (2018)
Google Scholar
Lithari, E.: Fractured academic identities: dyslexia, secondary education, self-esteem and school experiences. Int. J. Incl. Educ. 23(3), 280–296 (2019)
Article Google Scholar
Adams, W.R.: High-accuracy detection of early Parkinson’s disease using multiple characteristics of finger movement while typing. PLoS ONE 12(11), e0188226 (2017)
Article Google Scholar
Alesi, M., Battaglia, G.: Chapter six: motor development and down syndrome. In: Lanfranchi, S. (ed.) International Review of Research in Developmental Disabilities, vol. 56, pp. 169–211. Academic Press, Cambridge, Massachusetts (2019)
Google Scholar
Rocha, T., et al.: Usability evaluation of navigation tasks by people with intellectual disabilities: a google and SAPO comparative study regarding different interaction modalities. Univ. Access Inf. Soc. 16(3), 581–592 (2017)
Article Google Scholar
Sanchez, M.M., et al.: Neurobiological elements of cognitive dysfunction in down syndrome: exploring the role of APP. Biol. Psychiatry 71(5), 403–409 (2012)
Article Google Scholar
Dashtipour, K., et al.: Speech disorders in Parkinson’s disease: pathophysiology, medical management and surgical approaches. Neurodegener. Dis. Manag. 8(5), 337–348 (2018)
Article Google Scholar
Dua, T., et al.: Speech recognition technology for hearing disabled community. Int. J. Adv. Comput. Res. 4(3), 882–887 (2014)
Google Scholar
Balaji, V., Sadashivappa, G.: Speech disabilities in adults and the suitable speech recognition software tools: a review. In: Westphall, C.B., Murugesan, S., Ramesh, T. (eds.) 2015 International Conference on Computing and Network Communications (CoCoNet) 2015, pp. 559–564. IEEE, New Jersey (2015)
Chapter Google Scholar
Cave, R., Bloch, S.: The use of speech recognition technology by people living with amyotrophic lateral sclerosis: a scoping review. Disabil. Rehabil. Assist. Technol. 2021, 1–13 (2021)
Article Google Scholar
Wobbrock, J.O., et al.: Ability-based design: Concept, principles and examples. ACM Trans. Access. Comput. 3(3), Article no. 9 (2011)
Google Scholar
Desmond, D., et al.: Assistive technology and people: a position paper from the first global research, innovation and education on assistive technology (GREAT) summit. Disabil. Rehabil. Assist. Technol. 13(5), 437–444 (2018)
Article Google Scholar
Stramondo, J.A.: The distinction between curative and assistive technology. Sci. Eng. Ethics 25(4), 1125–1145 (2019)
Article Google Scholar
Steinfeld, E., Maisel, J.L.: Universal Design: Creating Inclusive Environments. Wiley, New Jersey (2012)
Google Scholar
Derman, Y.D., Arenovich, T., Strauss, J.: Speech recognition software and electronic psychiatric progress notes: physicians’ ratings and preferences. BMC Med. Inform. Decis. Mak. 10(1), 44 (2010)
Article Google Scholar
Pedersen, B.S., et al.: The Danish language in the digital age. In: Rehm, G., Uszkoreitm H. (eds.) META-NET White Paper Series: Europe’s Languages in the Digital Age. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30627-3
Arksey, H., O’Malley, L.: Scoping studies: towards a methodological framework. Int. J. Soc. Res. Methodol. 8(1), 19–32 (2005)
Article Google Scholar
Alvesson, M., Sköldberg, K.: Reflexive Methodology: New Vistas for Qualitative Research. Sage, Los Angeles (2017)
Google Scholar
Kommunal- og distriktsdepartementet: Samisk språk og IT: Språkteknologi [Sami language and IT: Language technology]. https://www.regjeringen.no/no/tema/urfolk-og-minoriteter/samepolitikk/samiske-sprak/samisk-sprak-og-it/id86947/. Accessed 14 Apr 2022
Government: Strategi for vækst gennem deleøkonomi [Strategy for growth through a sharing economy]. Ministry of Industry, Business and Financial Affairs, Copenhagen (2017)
Google Scholar
Kommunal- og moderniseringsdepartementet.: Nasjonal strategi for kunstig intelligens [National strategy for artificial intelligence]. Kommunal- og digitaliseringsdepartementet, Oslo (2020)
Google Scholar
Government, LGD, The Danish Regions: A coherent and trustworthy health network for all: Digital health strategy 2018–2022. Ministry of Health, Ministry of Finance, Local Government Denmark and the Regions, Copenhagen (2018)
Google Scholar
Agency for Digitalization and GovTech-Program.: Challenge statement. https://challenges.dk/sites/default/files/2019-09/DIGST%20GovTech-Program%20One%20Pager%20ENG.pdf. Accessed 16 Sept 2022
Kvarenes, M.: Klar, men aldri ferdig: En praktisk veileder i klarspråksarbeid [Ready, but never finished: A practical guide to clear language]. Språkrådet og Direktoratet for forvaltning og IKT, Oslo (2010)
Google Scholar
Abdul, A., et al.: Trends and trajectories for explainable, accountable and intelligible systems: an HCI research agenda. In: Mandryk, R. Hancock, M. (eds.) CHI 2018: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems 2018, Paper 582. Association for Computing Machinery, Montreal (2018)
Google Scholar
Berget, G.: Search and Find?: An Accessibility Study of Dyslexia and Information Retrieval. University of Oslo, Oslo (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Copenhagen, Karen Blixens Plads 8, 2300, Copenhagen S, Denmark
Silja Vase
Oslo Metropolitan University, Postboks 4 St. Olavs plass, N-0130, Oslo, Norway
Gerd Berget

Authors

Silja Vase
View author publications
You can also search for this author in PubMed Google Scholar
Gerd Berget
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Silja Vase .

Editor information

Editors and Affiliations

Foundation for Research and Technology, Heraklion, Crete, Greece
Margherita Antona
Foundation for Research and Technology, Heraklion, Crete, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vase, S., Berget, G. (2023). An Exploration of Automatic Speech Recognition Within a Nordic Context. In: Antona, M., Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. HCII 2023. Lecture Notes in Computer Science, vol 14020. Springer, Cham. https://doi.org/10.1007/978-3-031-35681-0_19

Download citation

DOI: https://doi.org/10.1007/978-3-031-35681-0_19
Published: 09 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35680-3
Online ISBN: 978-3-031-35681-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics