Skip to main content

Advertisement

Log in

A data mining approach to guide students through the enrollment process based on academic performance

  • Original Paper
  • Published:
User Modeling and User-Adapted Interaction Aims and scope Submit manuscript

Abstract

Student academic performance at universities is crucial for education management systems. Many actions and decisions are made based on it, specifically the enrollment process. During enrollment, students have to decide which courses to sign up for. This research presents the rationale behind the design of a recommender system to support the enrollment process using the students’ academic performance record. To build this system, the CRISP-DM methodology was applied to data from students of the Computer Science Department at University of Lima, Perú. One of the main contributions of this work is the use of two synthetic attributes to improve the relevance of the recommendations made. The first attribute estimates the inherent difficulty of a given course. The second attribute, named potential, is a measure of the competence of a student for a given course based on the grades obtained in related courses. Data was mined using C4.5, KNN (K-nearest neighbor), Naïve Bayes, Bagging and Boosting, and a set of experiments was developed in order to determine the best algorithm for this application domain. Results indicate that Bagging is the best method regarding predictive accuracy. Based on these results, the “Student Performance Recommender System” (SPRS) was developed, including a learning engine. SPRS was tested with a sample group of 39 students during the enrollment process. Results showed that the system had a very good performance under real-life conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Al-Radaideh, Q., AI-Shawakfa, M., Al-Najjar, M.: Mining student data using decision trees. In: The 2006 International Arab Conference on Information Technology, Yarmouk University, Jordan (2006)

  • Breiman L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)

    MATH  MathSciNet  Google Scholar 

  • Castellano, E., Martínez, L.: ORIEB, A CRS for academic orientation using qualitative assessments. In: Proceedings of the IADIS International Conference E-Learning, pp. 38–42 (2008)

  • Cestnik, B., Bratko, I.: On estimating probabilities in tree pruning. In: Machine Learning (EWSL’91) Lecture Notes in Computer Science, vol. 482, no. 3, pp. 138–150. Springer-Verlag, Berlin (1991)

  • Cortez, P., Silva, A.: Using data mining to predict secondary school student performance. In: Proceedings of 5th Future Business Technology Conference, Oporto, Portugal, pp. 5–12 (2008)

  • Dekker, G., Pechenizkiy, M., Vleeshouwers, J.: Predicting students drop out: a case study. In: Proceedings of the 2nd International Conference on Educational Data Mining (EDM’09), Cordoba, Spain, pp. 41–50 (2009)

  • Edelstein, H.: Building profitable customer relationships with data mining. In: SPSS White Paper-Executive Briefing, pp. 1–13. Two Crows Corporation (2000)

  • Enas G., Choi S.: Choice of the smoothing parameter and efficiency of K-nearest neighbor classification. Comput. Math. Appl. 12, 235–244 (1986)

    Article  MATH  Google Scholar 

  • Esposito F., Malerba D., Semeraro G.: A comparative analysis of methods for pruning decisión trees. IEEE Trans. Pattern Anal. Mach. Intell. 19(5), 476–491 (1997)

    Article  Google Scholar 

  • Feldman R.: Mining the biomedical literature using semantic analysis. Biosilico 1(2), 69–80 (2003)

    Article  Google Scholar 

  • Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Machine Learning, Proceedings of the Thirteenth International Conference (ICML’96), pp. 148–156 (1996)

  • Han, J.: How can data mining help bio-data analysis? In: Proceedings of the 2nd ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD’2002), Edmonton, Canada, pp. 1–2 (2002)

  • Han J., Kamber M.: Data Mining: Concepts and Techniques. 2nd edn. Morgan Kaufmann, San Francisco (2006)

    Google Scholar 

  • Larose D.: Discovering Knowledge in Data. 1st edn. Willey, New Jersey (2005)

    MATH  Google Scholar 

  • Lehmann E., Casella G.: Theory of Point Estimation. 2nd edn. Springer-Verlag, New York (1998)

    MATH  Google Scholar 

  • Luan, J.: Data mining and knowledge management: a system analysis for establishing a Tiered Knowledge Management Model (TKMM). In: Proceedings of AIR Forum, Toronto, Canada (2001)

  • Luan, J.: Data mining and knowledge management in higher education-potential applications. In: Proceedings of AIR Forum, Toronto, Canada, pp. 1–18 (2002a)

  • Luan, J.: Data Mining Application in Higher Education. SPSS Executive Report, pp. 1–8 (2002b)

  • Mingers J.: Expert Systems-Rule Induction with Statistical Data. J. Oper. Res. Soc. 38, 39–47 (1987)

    Google Scholar 

  • Mingers J.: An empirical comparison of pruning methods for decision tree induction. Mach. Learn. 4(2), 227–243 (1989)

    Article  Google Scholar 

  • Mitchell T.: Machine Learning. 1st edn. McGraw-Hill, Boston (1997)

    MATH  Google Scholar 

  • Mobasher, B., Jain, N., Han, E., Srivastava, J.: Web Mining: Pattern Discovery from World Wide Web Transactions. Technical Report TR96-OS0. Department of Computer Science, University of Minnesota (1996)

  • Opitz D., Maclin R.: Popular ensemble methods: an empirical study. J. Artif. Intell. Res. 11, 169–198 (1999)

    MATH  Google Scholar 

  • Quinlan R.: Simplifying decision trees. Int. J. Man–Mach. Stud. 27, 221–234 (1987)

    Article  Google Scholar 

  • Quinlan R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)

    Google Scholar 

  • Ramaswami M., Bhaskaran R.: A CHAID based performance prediction model in educational data mining. Int. J. Comput. Sci. Issues (IJCSI) 7(1), 10–18 (2010)

    Google Scholar 

  • Rokach L., Maimon O.: Data Mining with Decision Trees: Theory and Applications. World Scientific Publishing, Danvers (2008)

    MATH  Google Scholar 

  • Romero C., Ventura S.: Educational data mining: a review of the state-of-the-art. IEEE Trans. Syst. Man Cybern. C Appl. Rev. 40(6), 601–618 (2010)

    Article  Google Scholar 

  • Schafer, J.B.: The application of data-mining to recommender systems. In: Encyclopedia of Data Warehousing and Mining, vol. 1, pp. 44–48. Idea Group Reference, Hershey, PA (2005)

  • Vialardi, C., Bravo, J., Shafti, L. Ortigosa, A.: Recommendation in higher education using data mining techniques. In: Proceedings of Second Educational Data Mining Conference, Córdoba, Spain, pp. 190–199 (2009)

  • Vialardi, C., Chue, J., Barrientos, A., Victoria, D., Estrella, J., Ortigosa, A., Peche, J.: A case study: data mining applied to student enrollment. In: Proceedings of Third Educational Data Mining Conference, Pennsylvania, USA, pp. 333–335 (2010)

  • Waiyamai, K.: Improving Quality of Graduate Students by Data Mining. Department of Computer Engineering, Faculty of Engineering, Kasetsart University, Bangkok (2003)

  • Wu X. et al.: Top ten algorithms in data mining. Knowl. Inform. Syst. 14(1), 1–37 (2008)

    Article  Google Scholar 

  • Zaïane, O.: Building a recommender agent for E-learning systems. In: International Conference on Computers in Education, New Zealand, pp. 55–59 (2002)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to César Vialardi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vialardi, C., Chue, J., Peche, J.P. et al. A data mining approach to guide students through the enrollment process based on academic performance. User Model User-Adap Inter 21, 217–248 (2011). https://doi.org/10.1007/s11257-011-9098-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11257-011-9098-4

Keywords

Navigation