Cognitively Plausible Computational Models of Lexical Processing Can Explain Variance in Human Word Predictions and Reading Times

de Vries, Wietse

doi:10.1007/978-3-030-65154-1_4

Cognitively Plausible Computational Models of Lexical Processing Can Explain Variance in Human Word Predictions and Reading Times

Wietse de Vries¹²

Conference paper
First Online: 05 January 2021

458 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1196))

Abstract

Lexical processing times can yield valuable insights about structure in language and the cognitive processes that enable the use of language. Being able to estimate lexical processing times enables us to estimate readability and reading times of any text. It has been claimed that lexical processing times of words are influenced by word occurrence frequencies as well as the context it appears in (McDonald and Shillcock 2001; Baayen 2010). The context might be important because of predictive processes that enable quicker lexical processing (Christiansen and Chater 2016). In the present paper, the effects of morphosyntactic predictions on lexical processing times are investigated using two computational models. These computational models are trained to predict upcoming part-of-speech tags based on preceding part-of-speech tags and their predictions are compared with human predictions and human reading times from the PROVO corpus (Luke and Christianson 2018). A recurrent neural network is able to explain variance in human prediction errors whereas the Rescorla-Wagner model performs less well. The Rescorla-Wagner prediction associations do however explain more variance in human reading times. Moreover, the Rescorla-Wagner model associations explain more variance in gaze durations than human prediction errors. The human prediction errors and the Recorla-Wagner model associations combined explain most variance (Adj. R\(^2 = 0.719\)) in reading times, which indicates that the part-of-speech tag-based Rescorla-Wagner model associations contain complementary information to explicit human predictions about lexical processing times.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://universaldependencies.org/tagset-conversion/en-penn-uposf.html.

References

Baayen, R.H.: Demythologizing the word frequency effect: a discriminative learning perspective. Mental Lexicon 5(3), 436–461 (2010)
Article Google Scholar
Baayen, R.H., Milin, P., Durdjevic, D.F., Hendrix, P., Marelli, M.: An amorphous model for morphological processing in visual comprehension based on Naive discriminative learning. Psychol. Rev. 118(3), 438 (2011)
Article Google Scholar
Balota, D.A., Yap, M.J., Hutchison, K.A., Cortese, M.J., Kessler, B., Loftis, B., et al.: The English lexicon project. Behav. Res. Methods 39(3), 445–459 (2007)
Article Google Scholar
Christiansen, M.H., Chater, N.: The now-or-never bottleneck: a fundamental constraint on language. Behav. Brain Sci. 39 (2016)
Google Scholar
Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)
Article Google Scholar
Garside, R.: A hybrid grammatical tagger: Claws 4, Corpus annotation. Linguistic information from computer text corpora (1997)
Google Scholar
Hinojosa, J.A., Moreno, E.M., Casado, P., Muñoz, F., Pozo, M.A.: Syntactic expectancy: an event-related potentials study. Neurosci. Lett. 378(1), 34–39 (2005)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv preprint arXiv:1412.6980
Loper, E., Bird, S.: NLTK: the natural language toolkit. arXiv preprint cs/0205028 (2002)
Google Scholar
Luke, S.G., Christianson, K.: Limits on lexical prediction during reading. Cogn. Psychol. 88, 22–60 (2016)
Article Google Scholar
Luke, S.G., Christianson, K.: The Provo corpus: a large eye-tracking corpus with predictability norms. Behav. Res. Methods 50, 826–833 (2018)
Article Google Scholar
Marcus, M., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn Treebank (1993)
Google Scholar
McClelland, J.L., Rumelhart, D.E.: An interactive activation model of context effects in letter perception: I. An account of basic findings. Psychol. Rev. 88(5), 375 (1981)
Article Google Scholar
McDonald, S.A., Shillcock, R.C.: Rethinking the word frequency effect: the neglected role of distributional information in lexical processing. Lang. Speech 44(3), 295–322 (2001)
Article Google Scholar
Mirman, D., Graf Estes, K., Magnuson, J.S.: Computational modeling of statistical learning: effects of transitional probability versus frequency and links to word learning. Infancy 15(5), 471–486 (2010)
Article Google Scholar
Misyak, J.B., Christiansen, M.H., Bruce Tomblin, J.: Sequential expectations: the role of prediction-based learning in language. Topics Cogn. Sci. 2(1), 138–153 (2010)
Article Google Scholar
Nivre, J., Abrams, M., Agić, Ž., et al.: Universal dependencies 2.3. (LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University) (2018)
Google Scholar
Rescorla, R.A., Wagner, A.R., et al.: A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. Class. Conditioning II Curr. Res. Theory 2, 64–99 (1972)
Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Taylor, W.L.: “Cloze procedure”: a new tool for measuring readability. J. Bull. 30(4), 415–433 (1953)
Google Scholar
Van Berkum, J.J., Brown, C.M., Zwitserlood, P., Kooijman, V., Hagoort, P.: Anticipating upcoming words in discourse: evidence from ERPs and reading times. J. Exp. Psychol. Learn. Mem. Cognit. 31(3), 443 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Groningen, Groningen, The Netherlands
Wietse de Vries

Authors

Wietse de Vries
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wietse de Vries .

Editor information

Editors and Affiliations

Vrije Universiteit Brussel, Brussels, Belgium
Bart Bogaerts
Université Libre de Bruxelles, Brussels, Belgium
Gianluca Bontempi
Université de Liège, Liège, Belgium
Pierre Geurts
Vrije Universiteit Brussel, Brussels, Belgium
Nick Harley
Université Libre de Bruxelles, Brussels, Belgium
Bertrand Lebichot
Université Libre de Bruxelles, Brussels, Belgium
Tom Lenaerts
Université de Liège, Liège, Belgium
Gilles Louppe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

de Vries, W. (2020). Cognitively Plausible Computational Models of Lexical Processing Can Explain Variance in Human Word Predictions and Reading Times. In: Bogaerts, B., et al. Artificial Intelligence and Machine Learning. BNAIC BENELEARN 2019 2019. Communications in Computer and Information Science, vol 1196. Springer, Cham. https://doi.org/10.1007/978-3-030-65154-1_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-65154-1_4
Published: 05 January 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65153-4
Online ISBN: 978-3-030-65154-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics