Skip to main content
Log in

Predicting the impact of internet usage on students’ academic performance using machine learning techniques in Bangladesh perspective

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Education systems have significantly changed with the emergence of the internet. It has a significant impact on how students learn things. Nevertheless, its impact can also be contradicting. Internet addiction can slowly poison the minds of our youths and stand in the way of pursuing their goals. Although Bangladesh has internet connectivity across the country, its potential could be more utilized, particularly in the educational sector. Therefore, proper analysis of the effects of the internet on students, as well as determining the prominent factors relevant to the internet, is a necessary task. In addition, predicting students' academic performance can help determine the changes that must be incorporated to improve the educational system. Hence, this research analyzes the effects of internet usage on students' academic progress and then predicts the students' performance using distinct machine learning (ML) algorithms. The data were collected through an offline survey from Noakhali, Bangladesh. The collected data is preprocessed to select the most relevant features. The preprocessed data were fed into ML algorithms to investigate their behaviors. We have employed logistic regression, decision tree, random forest, and Naïve Bayes algorithms to see their classification performance on our dataset. To minimize the overfitting issue, k-fold cross-validation and hyperparameter optimization have been applied. The results were presented in two parts—exploratory data analysis and classification. Exploratory data analysis shows that the main purpose of internet usage is education and entertainment for school students, social media and entertainment for college students, and education and social media for university students. School and university students browse the internet mainly for academic purposes, whereas college students browse mainly for non-academic purposes. Students prefer to browse the internet at night. For all schools, colleges, and universities, students with better results generally visited websites like Google and YouTube. Students with moderate or bad results generally spent time on social media platforms (mainly Facebook and WhatsApp). Then, the results of the numerical analysis performed with classification algorithms are presented. Results indicate that random forest gives the maximum score in our dataset in all sectors, like accuracy, precision, recall, and f1 score. It gives a maximum of 85% accuracy on the test set. Logistic regression gives the second-best score of 69%. The practical applications and policy recommendations for Bangladesh's education sector are also discussed. The output of this work can contribute to building a policy on internet usage. In this way, it is possible to make the students more concentrative on their education and learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

Authors

Contributions

SHH Conceptualization, investigation, data collection, data curation, writing—original draft, model training, analysis and interpretation of results. MdARK conceptualization, investigation, writing—review and editing, model training, analysis and interpretation of results, supervision and investigation on challenges. IA draft manuscript preparation, writing—review and editing, analysis and interpretation of results. MR, MdASK, and SE study conception, design, data collection.

Corresponding author

Correspondence to Md. Ashikur Rahman Khan.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 79 kb)

Supplementary file2 (CSV 37 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hemal, S.H., Khan, M.A.R., Ahammad, I. et al. Predicting the impact of internet usage on students’ academic performance using machine learning techniques in Bangladesh perspective. Soc. Netw. Anal. Min. 14, 66 (2024). https://doi.org/10.1007/s13278-024-01234-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-024-01234-9

Keywords