Abstract
Information is everywhere in a hidden and scattered way. It becomes useful when we apply Data mining to extracts the hidden, meaningful, and potentially useful patterns from these vast data resources. Educational data mining ensures a quality education by analyzing educational data based on various aspects. In this paper, we have analyzed the academic results and behavior of some engineering students. For this study, we collect data from 80 students from the CSE department. We gather data from mark sheets and other relevant factors that accelerate the results, collected through a survey. Our main goal is to predict the students’ performance. According to this prediction, the counseling department will guide them in advance so that those who are likely to have bad results can do better. The classification can be based on various aspects, as many factors improve the educational system. We have created two datasets focusing on two different angles. Our first dataset classifies and predicts the category of a student (good, bad, medium) on a specific course based on their prerequisite course performance. We have implemented this in the artificial intelligence course. Our second dataset also classifies and predicts the final grade (A, B, C) of any random subject, here we organize our data such a way where it will only focus on how their performance was till the midterm exam. We analyze and compare six classification algorithms. We have focused on all aspects of an algorithm, not only the accuracy level but also the complexity and cost. We have built two final models for two of our datasets based on a decision tree and the naive Bayes algorithms accordingly.









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ahmed, S., Paul, R., & Hoque, A.S.Md.L. (2014). Knowledge discovery from academic data using Association Rule Mining. In 2014 17th International Conference on Computer and Information Technology (ICCIT: IEEE.
Alasadi, S.A., & Bhaya, W.S. (2017). Review of data preprocessing techniques in data mining. Journal of Engineering and Applied Sciences, 12.16, 4102–4107.
Bhardwaj, B.K., & Pal, S. (2012). Data Mining: A prediction for performance improvement using classification. arXiv preprint arXiv:1201.3418.
Bhargavi, P., & Jyothi, S. (2009). Applying naive bayes data mining technique for classification of agricultural land soils. International Journal of Computer Science and Network Security, 9.8, 117–122.
Cao, Y., & Wu, J. (2004). Dynamics of projective adaptive resonance theory model: the founda-tion of part algorithm. IEEE Transactions on Neural Networks, 15(2), 245–260.
Francis, B.K., & Babu, S.S. (2019). Predicting academic performance of students using a hybrid data mining approach. Journal of Medical Systems, 43.6, 162.
García, S., & Luengo, J. (2015). And Francisco Herrera. Data preprocessing in data mining. Cham: Springer International Publishing.
Hussain, S., & et al. (2018). Educational data mining and analysis of students’ academic performance using WEKA. Indonesian Journal of Electrical Engineering and Computer Science, 9.2, 447–459.
Kabra, R.R., & Bichkar, R.S. (2011). Performance prediction of engineering students using decision trees. International Journal of Computer Applications, 36.11, 8–12.
Pathan, A.A., & et al. (2014). Educational data mining: A mining model fordeveloping students’ programming skills. In The 8th International Conference onSoftware, Knowledge, Information Management and Applications (SKIMA 2014): IEEE.
Priyam, A., & et al. (2013). Comparative analysis of decision tree classification algorithms. International Journal of Current Engineering and Technology, 3.2, 334–337.
Rahman, Md.H., & Islam, Md.R. (2017). Predict Student’s Academic Performance and Evaluate the Impact of Different Attributes on the Performance Using Data Mining Techniques. In 2017 2nd International Conference on Electrical and Electronic Engineering (ICEEE): IEEE.
Rashu, R.I., Haq, N., & Rahman, R.M. (2014). Data mining approaches to predict final grade by overcoming class imbalance problem. In 2014 17th International Conference on Computer and Information Technology (ICCIT): IEEE.
Tan, P.-N., Steinbach, M., & Kumar, V. (2016). Introduction to data mining. Pearson Education India.
Veeramuthu, P., & Periasamy, R. (2014). Application of higher education system for predicting student using data mining techniques. International Journal of Innovative Research in Advanced Engineering (IJIRAE), 1.5, 36–38.
Zhou, Z.-H. (2009). Ensemble learning. Encyclopedia of Biometrics, 1, 270–273.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
-
ANN: ANN or Artificial Neural Network is a computational algorithm. This algorithm has been designed by using the concept of human neuron. It process information as like human brain analyze and processes information. It has the capability of self-learning that enables it to produce better results.
-
KNN: KNN stands for K-Nearest Neighbour. It is a simple Machine Learning algorithm based on Supervised Learning technique. This algorithm find outs the similarity between new and existing data and puts the new data into the most category which is most similar with the available categories.
-
Weka: An open source machine learning software with a collection of machine learning algorithms and data preprocessing tools. By using this software users can try out existing machine learning methods on their datasets in a flexible way.
Rights and permissions
About this article
Cite this article
Nahar, K., Shova, B.I., Ria, T. et al. Mining educational data to predict students performance. Educ Inf Technol 26, 6051–6067 (2021). https://doi.org/10.1007/s10639-021-10575-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10639-021-10575-3