Skip to main content
Log in

Identifying the contextual factors related to the reading performance of Moroccan fourth-grade students from a Machine Learning-based Approach.

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

Based on the 2016 Progress in International Reading Literacy Research (PIRLS) data, this study determined the contextual characteristics that distinguished Moroccan fourth-grade kids with high- and low-achieving reading abilities. The PIRLS 2016 public dataset included 5446 Moroccan pupils, with 1545 scoring well and 3901 doing poorly on the reading portion of the exam. The objective of the present research was to determine the ideal combination of significant contextual elements that jointly affect fourth-grade children’s reading ability. Machine learning technique such as gradient boosting was used to find the most critical contextual variables that might distinguish between the two groups of students. The findings demonstrate that 12 significant contextual variables from GradientBoosting, selected from 486 contextual variables at the levels of the student, teacher, and school, Identifying high- and low-performing students collectively have consequences for teaching and learning in the future about the reading ability of primary school students.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

Availability of data and materials

The data used this study are available from https://timssandpirls.bc.edu/pirls2016/international-database/index.html

Code availability

The code that support the findings of this study are available from the corresponding author upon reasonable request.

References

  • Aler, R., Galván, I.M., Ruiz-Arias, J.A., Gueymard C.A. (2017). Improving the separation of direct and diffuse solar radiation components using machine learning by gradient boosting. Solar Energy, 150, 558–569.

    Article  ADS  Google Scholar 

  • Aram, D. , Levin, I. (2002). Mother–child joint writing and storybook reading: Relations with literacy among low ses kindergartners. Merrill-Palmer Quarterly (1982-), 202–224.

  • Areepattamannil, S., Freeman, J.G., Klinger, D.A. (2011). Influence of motivation, self-beliefs, and instructional practices on science achievement of adolescents in canada. Social Psychology of Education. 14(2), 233–259.

    Article  Google Scholar 

  • Berninger, V. W., Nielsen, K. H., Abbott, R. D., Wijsman, E., Raskind, W. (2008). Gender differences in severity of writing and reading disabilities. Journal of school psychology. 46(2), 151–172.

    Article  PubMed  Google Scholar 

  • Berrar, D. (2019). Cross-Validation.

  • Breiman, L. (2001). Random forests. Machine learning. 45(1), 5–32.

    Article  Google Scholar 

  • Burman, D. D., Bitan, T., Booth, J. R. (2008). Sex differences in neural processing of language among children. Neuropsychologia. 46(5), 1349–1362.

    Article  PubMed  PubMed Central  Google Scholar 

  • Carlisle, J. F., Correnti, R., Phelps, G., Zeng, J. (2009). Exploration of the contribution of teachers’ knowledge about reading to their students’ improvement in reading. Reading and Writing. 22(4), 457–486.

    Article  Google Scholar 

  • Chen, J., Zhang, Y., Hu, J. (2021). Synergistic effects of instruction and affect factors on high-and low-ability disparities in elementary students’ reading literacy. Reading and Writing. 34(1), 199–230.

    Article  Google Scholar 

  • Cichosz, P. (2015). Classification model evaluation.

  • Cordero, J. M., Santín, D., Simancas, R. (2017). Assessing european primary school performance through a conditional nonparametric model. Journal of the Operational Research Society. 68(4), 364–376.

    Article  Google Scholar 

  • Creemers, B., Kyriakides, L. (2010). School factors explaining achievement on cognitive and affective outcomes: Establishing a dynamic model of educational effectiveness. Scandinavian Journal of Educational Research. 54(3), 263–294.

    Article  Google Scholar 

  • Epstein, J. L. (2018). School, family, and community partnerships: Preparing educators and improving schools. Routledge.

  • Farver, J. A. M., Xu, Y., Eppe, S., Lonigan, C. J. (2006). Home environments and young latino children’s school readiness. Early Childhood Research Quarterly. 21(2), 196–212.

    Article  Google Scholar 

  • Giambona, F., Porcu, M. (2018). School size and students’ achievement. empirical evidences from pisa survey data. Socio-Economic Planning Sciences. 64, 66–77.

    Article  Google Scholar 

  • Gnaldi, M., Schagen, I., Twist, L., Morrison, J. (2005). Attitude items and low ability students: The need for a cautious approach to interpretation. Educational Studies. 31(2), 103–113.

    Article  Google Scholar 

  • Gregorutti, B., Michel, B., Saint-Pierre, P. (2017). Correlation and variable importance in random forests. Statistics and Computing. 27(3), 659–678.

    Article  MathSciNet  Google Scholar 

  • Gustafsson, J. E., Balke, G. (1993). General and specific abilities as predictors of school achievement. Multivariate behavioral research. 28(4), 407–434.

    Article  CAS  PubMed  Google Scholar 

  • Hearst, M. A., Dumais, S. T., Osuna, E., Platt, J., Scholkopf, B. (1998). Support vector machines. IEEE Intelligent Systems and their applications. 13(4), 18–28.

    Article  Google Scholar 

  • Hosmer Jr, D. W. , Lemeshow, S. ,Sturdivant, R. X. (2013). Applied logistic regression (Vol. 398). John Wiley & Sons.

  • Hu, J. (2014). An analysis of the design process of a language learning management system. Control and Intelligent Systems. 42(1), 80–86.

    Article  Google Scholar 

  • Huljanah, M. , Rustam, Z. , Utama, S. , Siswantining, T. (2019). Feature selection using random forest classifier for predicting prostate cancer. Iop conference series: Materials science and engineering (Vol. 546, p. 052031).

  • Jehangir, K., Glas, C. A., van den Berg, S. (2015). Exploring the relation between socio-economic status and reading achievement in pisa 2009 through an intercepts-and-slopes-as-outcomes paradigm. International Journal of Educational Research. 71, 1–15.

    Article  Google Scholar 

  • Jiang, S., Pang, G., Wu, M., Kuang, L. (2012). An improved k-nearest-neighbor algorithm for text categorization. Expert Systems with Applications. 39(1), 1503–1509.

    Article  Google Scholar 

  • Jing, Y., Li, B., Chen, N., Li, X., Hu, J., Zhu, F. et al. (2015). The discrimination of learning styles by bayes-based statistics: An extended study on ils system. Control and Intelligent Systems. 43(2), 68–75.

    Article  Google Scholar 

  • John, G. , Langley, P. (1995). Estimating continuous distributions in bayesian classifiers. in proceedings of the eleventh conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc. 338–345.

  • Law, Y. K. (2009). The role of attribution beliefs, motivation and strategy use in chinese fifth-graders’ reading comprehension. Educational research. 51(1), 77–95.

    Article  Google Scholar 

  • Lee, V. E., Burkam, D. T. (2003). Dropping out of high school: The role of school organization and structure. American educational research journal. 40(2), 353–393.

    Article  Google Scholar 

  • Li, B. , Yu, S. , Lu, Q. (2003). An improved k-nearest neighbor algorithm for text categorization. arXiv preprint cs/0306099.

  • Lyaqini, S., Nachaoui, M., Hadri, A. (2022). An efficient primal-dual method for solving non-smooth machine learning problem. Chaos, Solitons & Fractals. 155, 111754.

  • Machin, S., McNally, S., Wyness, G. (2013). Educational attainment across the uk nations: performance, inequality and evidence. Educational Research. 55(2), 139–164.

    Article  Google Scholar 

  • Marzban, C. (2004). The roc curve and the area under it as performance measures. Weather and Forecasting. 19(6), 1106–1114.

    Article  ADS  Google Scholar 

  • Mullis, I. V. , Kennedy, A. M. , Martin, M. O. , Sainsbury, M. (2004). Pirls 2006 assessment framework and specifications: Progress in international reading literacy study. ERIC.

  • Mullis, I. V. , Martin, M. O. , Foy, P. , Arora, A. (2020). Timss 2019 international results in mathematics. ERIC.

  • Mullis, I. V. , Martin, M. O. , Foy, P. , Drucker, K. T. (2012). Pirls 2011 international results in reading. ERIC.

  • Mullis, I. V. , Martin, M. O. , Foy, P. , Hooper, M. (2017). epirls 2016: International results in online informational reading. International Association for the Evaluation of Educational Achievement.

  • Mullis, I. V. , Martin, M. O. , Gonzalez, E. J. , Chrostowski, S. J. (2004). Timss 2003 international mathematics report: Findings from iea’s trends in international mathematics and science study at the fourth and eighth grades. ERIC.

  • Mursalin, M., Zhang, Y., Chen, Y., Chawla, N. V. (2017). Automated epileptic seizure detection using improved correlation-based feature selection with random forest classifier. Neurocomputing. 241, 204–214.

    Article  Google Scholar 

  • Nachaoui, M., Lyaqini, S., Chaouch, M. (2023). Indicating if water is safe for human consumption using an enhanced machine learning approach. Statistics, Optimization & Information Computing. 11(1), 70–81.

    Article  Google Scholar 

  • Ngorosho, D. (2011). Reading and writing ability in relation to home environment: A study in primary education in rural tanzania. Child Indicators Research. 4(3), 369–388.

    Article  Google Scholar 

  • Park, Y. (2011). How motivational constructs interact to predict elementary students’ reading performance: Examples from attitudes and self-concept in reading. Learning and Individual Differences. 21(4), 347–358.

    Article  Google Scholar 

  • Rokach, L. (2005). Data mining and knowledge discovery handbook. Springer Science+ Business Media, Incorporated.

    Google Scholar 

  • Rutter, M., Maughan, B. (2002). School effectiveness findings 1979–2002. Journal of school psychology. 40(6), 451–475.

    Article  Google Scholar 

  • Schneider, S. L. (2013). The international standard classification of education 2011. Class and stratification analysis. Emerald Group Publishing Limited.

    Book  Google Scholar 

  • Shultz, T. R., Fahlman, S. E., Craw, S., Andritsos, P., Tsaparas, P., Silva, R., ... others (2011). Confusion matrix. Encyclopedia of machine learning. 61(8), 209–209.

  • Tang, J., Alelyani, S., Liu, H. (2014). Feature selection for classification: A review. Data classification: Algorithms and applications. 37.

    Google Scholar 

  • Tramonte, L., Willms, J. D. (2010). Cultural capital and its effects on education outcomes. Economics of Education Review. 29(2), 200–213.

    Article  Google Scholar 

  • Tse, S. K., Xiao, X. Y. (2014). Differential influences of affective factors and contextual factors on high-proficiency readers and low-proficiency readers: A multilevel analysis of pirls data from hong kong. Large-scale Assessments in Education. 2(1), 1–24.

    Article  Google Scholar 

  • Wei, Y., Hu, J. (2020). Differentiating critical thinking dispositions of newly enrolled efl undergraduates and postgraduates through svm approach. Mechatronic Systems and Control. 48, 159–164.

    Article  CAS  Google Scholar 

  • Zhang, H. (2004). The optimality of naive bayes. Aa. 1(2), 3.

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Not applicable.

Corresponding author

Correspondence to Soufiane Lyaqini.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Conflict of interest

The authors declare that they have no confict of interest.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khoudi, Z., Nachaoui, M. & Lyaqini, S. Identifying the contextual factors related to the reading performance of Moroccan fourth-grade students from a Machine Learning-based Approach.. Educ Inf Technol 29, 3047–3073 (2024). https://doi.org/10.1007/s10639-023-11881-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-023-11881-8

Keyword

Navigation