Deep Learning in Automated Essay Scoring

Boulanger, David; Kumar, Vivekanandan

doi:10.1007/978-3-319-91464-0_30

David Boulanger¹⁶ &
Vivekanandan Kumar¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 10858))

Included in the following conference series:

International Conference on Intelligent Tutoring Systems

3350 Accesses

Abstract

This paper explores the application of deep learning in automated essay scoring (AES). It uses the essay dataset #8 from the Automated Student Assessment Prize competition, hosted by the Kaggle platform, and a state-of-the-art Suite of Automatic Linguistic Analysis Tools (SALAT) to extract 1,463 writing features. A non-linear regressor deep neural network is trained to predict holistic scores on a scale of 10–60. This study shows that deep learning holds the promise to improve significantly the accuracy of AES systems, but that the current dataset and most essay datasets fall short of providing them with enough expertise (hand-graded essays) to exploit that potential. After the tuning of different sets of hyperparameters, the results show that the levels of agreement, as measured by the quadratic weighted kappa metric, obtained on the training, validation, and testing sets are 0.84, 0.63, and 0.58, respectively, while an ensemble (bagging) produced a kappa value of 0.80 on the testing set. Finally, this paper upholds that more than 1,000 hand-graded essays per writing construct would be necessary to adequately train the predictive student models on automated essay scoring, provided that all score categories are equally or fairly represented in the sample dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A review of deep-neural automated essay scoring models

Article Open access 20 July 2021

Automatically Grading Brazilian Student Essays

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?

Article 15 September 2020

Notes

1.
Dataset and code are available at: https://1drv.ms/u/s!ApcQo2VlCqiPmXGh2_zjgDLnPzKp

References

Crossley, S.A., Kyle, K., McNamara, D.S.: The tool for the automatic analysis of text cohesion (TAACO): automatic assessment of local, global, and text cohesion. Behav. Res. Methods 48(4), 1227–1237 (2016)
Article Google Scholar
Crossley, S.A., Kyle, K., McNamara, D.S.: Sentiment analysis and social cognition engine (SEANCE): an automatic tool for sentiment, social cognition, and social order analysis. Behav. Res. Methods 49(3), 803–821 (2017)
Article Google Scholar
Guestrin, C., Fox, E.: Machine Learning: Regression. Coursera (2017). https://www.coursera.org/learn/ml-regression. Accessed 22 Mar 2018
Kumar, V., Fraser, S.N., Boulanger, D.: Discovering the predictive power of five baseline writing competences. J. Writ. Anal. 1(1), 176–226 (2017)
Google Scholar
Kyle, K., Crossley, S.A.: Automatically assessing lexical sophistication: indices, tools, findings, and application. TESOL Q. 49(4), 757–786 (2015)
Article Google Scholar
Kyle, K.: Suite of Automatic Linguistic Analysis Tools (SALAT) (2016a). http://www.kristopherkyle.com/. Accessed 25 Apr 2018
Kyle, K.: Measuring syntactic development in L2 writing: fine grained indices of syntactic complexity and usage-based indices of syntactic sophistication. Doctoral Dissertation (2016b). http://scholarworks.gsu.edu/alesl_diss/35
Ng, A.: Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization. Coursera (2017). https://www.coursera.org/learn/deep-neural-network. Accessed 22 Mar 2018
Rosebrock, A.: Deep Learning for Computer Vision with Python, 1st edn. PyImageSearch (2017). https://www.pyimagesearch.com/deep-learning-computer-vision-python-book/. Accessed 22 Mar 2018
Shermis, M.D.: State-of-the-art automated essay scoring: competition, results, and future directions from a United States demonstration. Assess. Writ 20(1), 53–76 (2014)
Article Google Scholar
Zupanc, K., Bosnić, Z.: Automated essay evaluation with semantic analysis. Knowl.-Based Syst. 120, 118–132 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Athabasca University, Edmonton, AB, T5J 3S8, Canada
David Boulanger & Vivekanandan Kumar

Authors

David Boulanger
View author publications
You can also search for this author in PubMed Google Scholar
Vivekanandan Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Boulanger .

Editor information

Editors and Affiliations

Université du Québec, Montreal, Québec, Canada
Roger Nkambou
NCSU, Raleigh, North Carolina, USA
Roger Azevedo
University of Saskatchewan, Saskatoon, Saskatchewan, Canada
Julita Vassileva

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boulanger, D., Kumar, V. (2018). Deep Learning in Automated Essay Scoring. In: Nkambou, R., Azevedo, R., Vassileva, J. (eds) Intelligent Tutoring Systems. ITS 2018. Lecture Notes in Computer Science(), vol 10858. Springer, Cham. https://doi.org/10.1007/978-3-319-91464-0_30

Download citation

DOI: https://doi.org/10.1007/978-3-319-91464-0_30
Published: 17 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91463-3
Online ISBN: 978-3-319-91464-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics