Hybrid Deep Neural Networks for Industrial Text Scoring

Nagappan, Sidharrth; Goh, Hui-Ngo; Lim, Amy Hui-Lan

doi:10.1007/978-3-031-08530-7_58

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13343))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1565 Accesses

Abstract

Academic scoring is mainly explored through the pedagogical fields of Automated Essay Scoring (AES) and Short Answer Scoring (SAS), but text scoring in other domains has received limited attention. This paper focuses on industrial text scoring, namely the processing and adherence checking of long annual reports based on regulatory requirements. To lay the foundations for non-academic scoring, a pioneering corpus of annual reports from companies is scraped, segmented into sections, and domain experts score relevant sections based on adherence. Subsequently, deep neural non-hierarchical attention-based LSTMs, hierarchical attention networks and longformer-based models are refined and evaluated. Since the longformer outperformed LSTM-based models, we embed it into a hybrid scoring framework that employs lexicon and named entity features, with rubric injection via word-level attention, culminating in a Kappa score of 0.9670 and 0.820 in both our corpora, respectively. Though scoring is fundamentally subjective, our proposed models show significant results when navigating thin rubric boundaries and handling adversarial responses. As our work proposes a novel industrial text scoring engine, we hope to validate our framework using more official documentation based on a broader range of regulatory practices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?

Article 15 September 2020

Feature Enhanced Capsule Networks for Robust Automatic Essay Scoring

A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

Notes

References

Beltagy, I., Peters, M.E., Cohan, A.: Longformer: the long-document transformer. CoRR abs/2004.05150 (2020)
Google Scholar
Chen, Q., Zhu, X., Ling, Z., Wei, S., Jiang, H.: Enhancing and combining sequential and tree LSTM for natural language inference. CoRR abs/1609.06038 (2016)
Google Scholar
Dasgupta, T., Naskar, A., Dey, L., Saha, R.: Augmenting textual qualitative features in deep convolution recurrent neural network for automatic essay scoring. In: NLP-TEA@ACL (2018)
Google Scholar
De La Cruz, A., Medina, A., Tang, Y.: Owners of the world’s listed companies. OECD Capital Market Series (2019)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs/1810.04805 (2018)
Google Scholar
Dong, F., Zhang, Y., Yang, J.: Attention-based recurrent convolutional neural network for automatic essay scoring, pp. 153–162, August 2017
Google Scholar
Gu, K., Budhkar, A.: A package for learning on tabular and text data with transformers. In: Proceedings of the Third Workshop on Multimodal Artificial Intelligence, pp. 69–73. Association for Computational Linguistics, June 2021
Google Scholar
Kumar, V., Boulanger, D.: Explainable automated essay scoring: deep learning really has pedagogical value. Front. Educ. 5, 186 (2020)
Article Google Scholar
Mayfield, E., Black, A.W.: Should you fine-tune Bert for automated essay scoring? In: BEA (2020)
Google Scholar
Page, E.B.: Project essay grade: Peg. J. Educ. Technol. (2003)
Google Scholar
Pennington, J., Socher, R., Manning, C.: Glove: Global vectors for word representation, vol. 14, pp. 1532–1543 (2014)
Google Scholar
Riordan, B., Horbach, A., Cahill, A., Zesch, T., Lee, C.M.: Investigating neural architectures for short answer scoring, pp. 159–168. Association for Computational Linguistics, September 2017
Google Scholar
Shermis, M.D., Burstein, J.: Automated essay scoring: a cross-disciplinary perspective. In: Proceedings of the 2003 International Conference on Computational Linguistics, p. 13 (2003)
Google Scholar
Taghipour, K., Ng, H.T.: A neural approach to automated essay scoring. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1882–1891. Association for Computational Linguistics, November 2016
Google Scholar
Uto, M., Xie, Y., Ueno, M.: Neural automated essay scoring incorporating handcrafted features. In: COLING (2020)
Google Scholar
Wang, T., Inoue, N., Ouchi, H., Mizumoto, T., Inui, K.: Inject rubrics into short answer grading system. In: Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, pp. 175–182 (2019)
Google Scholar
Wolf, T., et al.: Huggingface’s transformers: State-of-the-art natural language processing. CoRR abs/1910.03771 (2019)
Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. pp. 1480–1489. Association for Computational Linguistics, June 2016
Google Scholar
Zaheer, M., et al.: Big bird: transformers for longer sequences. CoRR abs/2007.14062 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Malaysia
Sidharrth Nagappan, Hui-Ngo Goh & Amy Hui-Lan Lim

Authors

Sidharrth Nagappan
View author publications
You can also search for this author in PubMed Google Scholar
Hui-Ngo Goh
View author publications
You can also search for this author in PubMed Google Scholar
Amy Hui-Lan Lim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sidharrth Nagappan .

Editor information

Editors and Affiliations

i-SOMET, Inc., Morioka-shi, Iwate, Japan
Hamido Fujita
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guangdong, China
Philippe Fournier-Viger
Texas State University, San Marcos, TX, USA
Moonis Ali
Shanghai University of Finance and Economics, Shanghai, China
Yinglin Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nagappan, S., Goh, HN., Lim, A.HL. (2022). Hybrid Deep Neural Networks for Industrial Text Scoring. In: Fujita, H., Fournier-Viger, P., Ali, M., Wang, Y. (eds) Advances and Trends in Artificial Intelligence. Theory and Practices in Artificial Intelligence. IEA/AIE 2022. Lecture Notes in Computer Science(), vol 13343. Springer, Cham. https://doi.org/10.1007/978-3-031-08530-7_58

Download citation

DOI: https://doi.org/10.1007/978-3-031-08530-7_58
Published: 30 August 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08529-1
Online ISBN: 978-3-031-08530-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Hybrid Deep Neural Networks for Industrial Text Scoring

Abstract

Access this chapter

Similar content being viewed by others

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?

Feature Enhanced Capsule Networks for Robust Automatic Essay Scoring

A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Hybrid Deep Neural Networks for Industrial Text Scoring

Abstract

Access this chapter

Similar content being viewed by others

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?

Feature Enhanced Capsule Networks for Robust Automatic Essay Scoring

A Study on Performance Sensitivity to Data Sparsity for Automated Essay Scoring

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation