Skip to main content

Advertisement

Log in

Predicting students’ performance in English and Mathematics using data mining techniques

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

This study attempts to predict secondary school students’ performance in English and Mathematics subjects using data mining (DM) techniques. It aims to provide insights into predictors of students’ performance in English and Mathematics, characteristics of students with different levels of performance, the most effective DM technique for students’ performance prediction, and the relationship between these two subjects. The study employed the archival data of students who were 16 years old in 2019 and sat for the Malaysian Certificate of Examination (MCE) in 2021. The learning of English and Mathematics is a concern in many countries. Three main factors, namely students’ past academic performance, demographics, and psychological attributes were scrutinized to identify their impact on the prediction. This study utilized the Orange software for the DM process. It employed Decision Tree (DT) rules to determine the characteristics of students with low, moderate, and high performance in English and Mathematics subjects. DT and Naïve Bayes (NB) techniques show the best predictive performance for English and Mathematics subjects, respectively. Such characteristics and predictions may cue appropriate interventions to improve students’ performance in these subjects. This study revealed students’ past academic performance as the most critical predictor, as well as a few demographics and psychological attributes. By examining top predictors derived using four different classifier types, this study found that students’ past Mathematics performance predicts their MCE English performance and students’ past English performance predicts their MCE Mathematics performance. This finding shows students’ performances in both subjects are interrelated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Download references

Acknowledgements

This work is supported by the Malaysian Ministry of Higher Education, Fundamental Research Grant Scheme, FRGS/1/2020/SS10/UNIMAS/01/1, and UNIMAS Zamalah Scholarship.

Funding

This work is funded by the Malaysian Ministry of Higher Education, Fundamental Research Grant Scheme, FRGS/1/2020/SS10/UNIMAS/01/1, and UNIMAS Zamalah Scholarship.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chwen Jen Chen.

Ethics declarations

Ethics approval statement

The study obtained approval from the Education Policy Research and Development Division, Ministry of Education, Malaysia to use the archival data from the schools involved.

Conflict of interest

There is no potential conflict of interest in this study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Roslan, M.H.B., Chen, C.J. Predicting students’ performance in English and Mathematics using data mining techniques. Educ Inf Technol 28, 1427–1453 (2023). https://doi.org/10.1007/s10639-022-11259-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-022-11259-2

Keywords