Understandable Prediction Models of Student Performance Using an Attribute Dictionary

Sorour, Shaymaa E.; Abd El Rahman, Shaimaa; Kahouf, Samir A.; Mine, Tsunenori

doi:10.1007/978-3-319-47440-3_18

Shaymaa E. Sorour^18,19,
Shaimaa Abd El Rahman¹⁹,
Samir A. Kahouf¹⁹ &
…
Tsunenori Mine²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10013))

Included in the following conference series:

International Conference on Web-Based Learning

1307 Accesses
1 Citations

Abstract

This paper proposes a new approach for predicting final student grade with high accuracy. It builds an attribute dictionary (AD) automatically from students’ comments collected after every lesson. Furthermore, it combines white-box models: Decision Tree (DT) and Random Forest (RF), and a black-box model: Support Vector Machine (SVM) to construct an interpretable prediction model and carry out eclectic rule-extraction. First, the AD is built from students’ comments, which are converted to attribute vectors. Second, the output decision is generated by SVM using the attribute vectors in the training phase and then DT and RF are applied to the output decision to extract symbolic rules. Experimental results illustrate the validity of the AD constructed automatically and the superiority of the proposed approach compared to single machine learning techniques: DT, RF and SVM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

nltk package. http://www.nltk.org/api/nltk.html#nltk.text.TextCollection
Barakat, N., Bradley, A.P.: Rule extraction from support vector machines: a review. Neurocomputing 74(1), 178–190 (2010)
Article Google Scholar
Barakat, N., Diederich, J.: Learning-based rule-extraction from support vector machines. In: The 14th International Conference on Computer Theory and Applications (ICCTA) (2004)
Google Scholar
Breiman, L.: Decision-tree forests. Mach. Learn. 45(1), 5–32 (2001)
Article MATH Google Scholar
Goda, K., Mine, T.: Analysis of students’ learning activities through quantifying time-series comments. In: König, A., Dengel, A., Hinkelmann, K., Kise, K., Howlett, R.J., Jain, L.C. (eds.) KES 2011, Part II. LNCS (LNAI), vol. 6882, pp. 154–164. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23863-5_16
Chapter Google Scholar
Khajah, M., Wing, R.M., Lindsey, R.V., Mozer., M.C.: Integrating latent-factor and knowledge-tracing models to predict individual differences in learning. In: Educational Data Mining (EDM), pp. 99–106 (2014)
Google Scholar
Kijsirikul, B.: Artificial Intelligence. Department of Computer Engineer, Faculty of Engineering, Chulalongkorn University (2003)
Google Scholar
Kudo, T.: Mecab program. http://sourceforge.net/projects/mecab/
Nidhra, S., Dondeti, J.: Black box and white box testing techniques - a literature review. Int. J. Embed. Syst. Appl. (IJESA) 2(2), 29–50 (2012)
Google Scholar
Pardo, A., Mirriahi, N., Martinez-Maldonado, R., Jovanovic, J., Dawson, S., Gašević, D.: Generating actionable predictive models of academic performance. In: Proceedings of the Sixth International Conference on Learning Analytics & Knowledge, pp. 474–478. ACM (2016)
Google Scholar
Salzberg, S.L.: Book review: C4.5: programs for machine learning. Mach. Learn. 16, 235–240 (1994)
MathSciNet Google Scholar
Scheffel, M., Drachsler, H., Stoyanov, S., Specht, M.: Quality indicators for learning analytics. Educ. Technol. Soc. 17(4), 117–132 (2014)
Google Scholar
Shibata, Y.: Negative structure in Japanese. Univ. Pennsylvania Work. Pap. Linguist. 20(1), 290–299 (2014)
Google Scholar
Sorour, S.E., Goda, K., Mine, T.: Evaluation of effectiveness of time-series comments by using machine learning techniques. J. Inf. Process. (JIP) 23(6), 784–794 (2015)
Google Scholar
Sorour, S.E., Mine, T., Goda, K., Hirokawa, S.: Predictive model to evaluate student performance. J. Inf. Process. 23(2), 192–201 (2015)
Google Scholar
Tsytsarau, M., Palpanas, T.: Survey on mining subjective data on the web. Data Min. Knowl. Discov. 24(3), 478–514 (2012)
Article MATH Google Scholar
Wanli, X., Rui, G., Eva, P., Sean, G.: Participation-based student final performance prediction model through interpretable genetic programming: integrating learning analytics, educational data mining and theory. Comput. Hum. Behav. 47, 168–181 (2015)
Article Google Scholar
Wise, A.F., Shaffer, D.W.: Why theory matters more than ever in the age of big data. J. Learn. Anal. 2(2), 5–13 (2015)
Article Google Scholar

Download references

Acknowledgments

This work was supported in part by JSPS KAKENHI Grant Number 26350357, 26540183 and 16H02926.

Author information

Authors and Affiliations

Graduate School of ISEE, Kyushu University, Motooka, Fukuoka, 819-0395, Japan
Shaymaa E. Sorour
Faculty of Specific Education, Kafr Elsheik University, Kafr Elsheikh, 33516, Egypt
Shaymaa E. Sorour, Shaimaa Abd El Rahman & Samir A. Kahouf
Faculty of ISEE, Kyushu University, Motooka, Fukuoka, 819-0395, Japan
Tsunenori Mine

Authors

Shaymaa E. Sorour
View author publications
You can also search for this author in PubMed Google Scholar
Shaimaa Abd El Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Samir A. Kahouf
View author publications
You can also search for this author in PubMed Google Scholar
Tsunenori Mine
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Shaymaa E. Sorour or Tsunenori Mine .

Editor information

Editors and Affiliations

University of Hong Kong, Hong Kong, Hong Kong
Dickson K.W. Chiu
Leibniz University of Hanover, Hannover, Germany
Ivana Marenzi
Sapienza University of Rome, Rome, Italy
Umberto Nanni
University of Caen Basse-Normandie, Caen, France
Marc Spaniol
Sapienza University of Rome, Rome, Italy
Marco Temperini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sorour, S.E., Abd El Rahman, S., Kahouf, S.A., Mine, T. (2016). Understandable Prediction Models of Student Performance Using an Attribute Dictionary. In: Chiu, D., Marenzi, I., Nanni, U., Spaniol, M., Temperini, M. (eds) Advances in Web-Based Learning – ICWL 2016. ICWL 2016. Lecture Notes in Computer Science(), vol 10013. Springer, Cham. https://doi.org/10.1007/978-3-319-47440-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-47440-3_18
Published: 01 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47439-7
Online ISBN: 978-3-319-47440-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics