Skip to main content

Abstract

This research focuses on enhancing water potability classification through the integration of three machine learning techniques. A comparative analysis of diverse classification methods is conducted, incorporating multiple thresholds by employing a variable extraction approach. The primary objective is to streamline the input set, aiming for a significant reduction in computational costs and model complexity. This streamlined approach not only facilitates a more efficient training process but also implies a shorter duration for model training. To rigorously evaluate the model’s performance, a K-fold cross-validation is implemented within this framework. This comprehensive approach contributes to the advancement of water quality assessment methodologies, with potential implications for improving the efficiency and reliability of potability water classification models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Dataset, October 2023. https://www.kaggle.com/datasets/xiaoxiaoliangzi/water-potability-prediction

  2. Efficient KNN classification algorithm for big data. Neurocomputing 195, 143–148 (2016). https://doi.org/10.1016/j.neucom.2015.08.112, learning for Medical Imaging

  3. Arkaan Susila, M.T., Nadiyansyah Firdaus, I., Chuzairi, M.F., Wibawa, I.P.D., Kallista, M.: Denoising autoencoder - extreme learning machine for improving the classification performance integrated with IoT water quality systems. In: 2023 10th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI), pp. 405–411 (2023). https://doi.org/10.1109/EECSI59885.2023.10295703

  4. Bejani, M.M., Ghatee, M.: A systematic review on overfitting control in shallow and deep neural networks. Artif. Intell. Rev. 54, 6391–6438 (2021)

    Article  Google Scholar 

  5. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)

    Article  Google Scholar 

  6. Falkenmark, M., Widstrand, C.: Population and water resources: a delicate balance. Popul. Bull. 47(3), 1–36 (1992)

    Google Scholar 

  7. F.R.S., K.P.: LIII. on lines and planes of closest fit to systems of points in space. Philos. Mag. Ser. 1 2, 559–572 (1901)

    Google Scholar 

  8. Guo, G., Wang, H., Bell, D., Bi, Y., Greer, K.: KNN model-based approach in classification. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) OTM 2003. LNCS, vol. 2888, pp. 986–996. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39964-3_62

    Chapter  Google Scholar 

  9. Janani, S., Vydehi, S.: Chaos-enhanced bat algorithm for optimizing hyper parameters in water quality classification. In: 2023 International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS), pp. 743–750 (2023). https://doi.org/10.1109/ICSSAS57918.2023.10331752

  10. Jinan, A., Siregar, M., Rolanda, V., Suryani, D.F., Muis, A.: Comparing neural networks, support vector machines, and Naïve Bayes Algorhythms for classifying banana types. J. Comput. Netw. Archit. High Perform. Comput. 6(1), 98–107 (2024)

    Google Scholar 

  11. Khoirul Haq, M.I., Dwi Ramadhan, F., Az-Zahra, F., Kurniawati, L., Helen, A.: Classification of water portability using machine learning algorithms. In: 2021 International Conference on Artificial Intelligence and Big Data Analytics, pp. 1–5 (2021). https://doi.org/10.1109/ICAIBDA53487.2021.9689727

  12. Preetham, M., Krishnan, M.: Improving water quality prediction and monitoring using machine learning algorithms. In: 2023 12th International Conference on System Modeling and Advancement in Research Trends (SMART), pp. 367–373 (2023). https://doi.org/10.1109/SMART59791.2023.10428490

  13. Priskilla Angel Rani, J., Nivasini, R., Yesubai Rubavathi, C., Jona, P.: Machine learning based real time water quality monitoring system. In: 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS), pp. 1366–1370 (2023). https://doi.org/10.1109/ICAIS56108.2023.10073761

  14. Samek, W., Montavon, G., Lapuschkin, S., Anders, C.J., Müller, K.R.: Explaining deep neural networks and beyond: a review of methods and applications. Proc. IEEE 109(3), 247–278 (2021). https://doi.org/10.1109/JPROC.2021.3060483

    Article  Google Scholar 

  15. Sulaiman, K., Ismail, L.H., Razi, M.A.M., Adnan, M.S., Ghazali, R.: Water quality classification using an artificial neural network (ANN). In: IOP Conference Series: Materials Science and Engineering, vol. 601, p. 012005. IOP Publishing (2019)

    Google Scholar 

  16. Uddin, M.G., Nash, S., Olbert, A.I.: A review of water quality index models and their use for assessing surface water quality. Ecol. Ind. 122, 107218 (2021)

    Article  Google Scholar 

  17. Xie, X.: Principal component analysis. Wiley interdisciplinary reviews (2019)

    Google Scholar 

  18. Yang, T., Ying, Y.: AUC maximization in the era of big data and AI: a survey. ACM Comput. Surv. 55(8) (2022). https://doi.org/10.1145/3554729,

Download references

Acknowledgement

Míriam Timiraos’s research was supported by the “Xunta de Galicia” through grants to industrial PhD (http://gain.xunta.gal/), under the “Doutoramento Industrial 2022” grant with reference: 04_IN606D_2022_ 2692965.

Antonio Díaz-Longueira’s research was supported by the Xunta de Galicia (Regional Government of Galicia) through grants to Ph.D. (http://gain.xunta.gal), under the “Axudas á etapa predoutoral” grant with reference: ED481A2023072.

 Grant PID2022-137152NB-I00 funded by MICIU/AEI/10.13039/501100011033 and by ERDF/EU.

Xunta de Galicia. Grants for the consolidation and structuring of competitive research units, GPC (ED431B 2023/49)

CITIC, as a center accredited for excellence within the Galician University System and a member of the CIGUS Network, receives subsidies from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia. Additionally, it is co-financed by the EU through the FEDER Galicia 2021-27 operational program (Ref. ED431G 2023/01).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Míriam Timiraos or Antonio Díaz-Longueira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Timiraos, M., Díaz-Longueira, A., Zayas-Gato, F., Casteleiro-Roca, JL., Fontenla-Romero, Ó., Calvo-Rolle, J.L. (2024). A Machine Learning - Based System for Determining Water Potability. In: Zayas-Gato, F., Díaz-Longueira, A., Casteleiro-Roca, JL., Jove, E. (eds) Distributed Computing and Artificial Intelligence, Special Sessions III - Intelligent Systems Applications, 21st International Conference. DCAI 2024. Lecture Notes in Networks and Systems, vol 1173. Springer, Cham. https://doi.org/10.1007/978-3-031-73910-1_1

Download citation

Publish with us

Policies and ethics