Abstract
This research focuses on enhancing water potability classification through the integration of three machine learning techniques. A comparative analysis of diverse classification methods is conducted, incorporating multiple thresholds by employing a variable extraction approach. The primary objective is to streamline the input set, aiming for a significant reduction in computational costs and model complexity. This streamlined approach not only facilitates a more efficient training process but also implies a shorter duration for model training. To rigorously evaluate the model’s performance, a K-fold cross-validation is implemented within this framework. This comprehensive approach contributes to the advancement of water quality assessment methodologies, with potential implications for improving the efficiency and reliability of potability water classification models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Dataset, October 2023. https://www.kaggle.com/datasets/xiaoxiaoliangzi/water-potability-prediction
Efficient KNN classification algorithm for big data. Neurocomputing 195, 143–148 (2016). https://doi.org/10.1016/j.neucom.2015.08.112, learning for Medical Imaging
Arkaan Susila, M.T., Nadiyansyah Firdaus, I., Chuzairi, M.F., Wibawa, I.P.D., Kallista, M.: Denoising autoencoder - extreme learning machine for improving the classification performance integrated with IoT water quality systems. In: 2023 10th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI), pp. 405–411 (2023). https://doi.org/10.1109/EECSI59885.2023.10295703
Bejani, M.M., Ghatee, M.: A systematic review on overfitting control in shallow and deep neural networks. Artif. Intell. Rev. 54, 6391–6438 (2021)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)
Falkenmark, M., Widstrand, C.: Population and water resources: a delicate balance. Popul. Bull. 47(3), 1–36 (1992)
F.R.S., K.P.: LIII. on lines and planes of closest fit to systems of points in space. Philos. Mag. Ser. 1 2, 559–572 (1901)
Guo, G., Wang, H., Bell, D., Bi, Y., Greer, K.: KNN model-based approach in classification. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) OTM 2003. LNCS, vol. 2888, pp. 986–996. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39964-3_62
Janani, S., Vydehi, S.: Chaos-enhanced bat algorithm for optimizing hyper parameters in water quality classification. In: 2023 International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS), pp. 743–750 (2023). https://doi.org/10.1109/ICSSAS57918.2023.10331752
Jinan, A., Siregar, M., Rolanda, V., Suryani, D.F., Muis, A.: Comparing neural networks, support vector machines, and Naïve Bayes Algorhythms for classifying banana types. J. Comput. Netw. Archit. High Perform. Comput. 6(1), 98–107 (2024)
Khoirul Haq, M.I., Dwi Ramadhan, F., Az-Zahra, F., Kurniawati, L., Helen, A.: Classification of water portability using machine learning algorithms. In: 2021 International Conference on Artificial Intelligence and Big Data Analytics, pp. 1–5 (2021). https://doi.org/10.1109/ICAIBDA53487.2021.9689727
Preetham, M., Krishnan, M.: Improving water quality prediction and monitoring using machine learning algorithms. In: 2023 12th International Conference on System Modeling and Advancement in Research Trends (SMART), pp. 367–373 (2023). https://doi.org/10.1109/SMART59791.2023.10428490
Priskilla Angel Rani, J., Nivasini, R., Yesubai Rubavathi, C., Jona, P.: Machine learning based real time water quality monitoring system. In: 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS), pp. 1366–1370 (2023). https://doi.org/10.1109/ICAIS56108.2023.10073761
Samek, W., Montavon, G., Lapuschkin, S., Anders, C.J., Müller, K.R.: Explaining deep neural networks and beyond: a review of methods and applications. Proc. IEEE 109(3), 247–278 (2021). https://doi.org/10.1109/JPROC.2021.3060483
Sulaiman, K., Ismail, L.H., Razi, M.A.M., Adnan, M.S., Ghazali, R.: Water quality classification using an artificial neural network (ANN). In: IOP Conference Series: Materials Science and Engineering, vol. 601, p. 012005. IOP Publishing (2019)
Uddin, M.G., Nash, S., Olbert, A.I.: A review of water quality index models and their use for assessing surface water quality. Ecol. Ind. 122, 107218 (2021)
Xie, X.: Principal component analysis. Wiley interdisciplinary reviews (2019)
Yang, T., Ying, Y.: AUC maximization in the era of big data and AI: a survey. ACM Comput. Surv. 55(8) (2022). https://doi.org/10.1145/3554729,
Acknowledgement
Míriam Timiraos’s research was supported by the “Xunta de Galicia” through grants to industrial PhD (http://gain.xunta.gal/), under the “Doutoramento Industrial 2022” grant with reference: 04_IN606D_2022_ 2692965.
Antonio Díaz-Longueira’s research was supported by the Xunta de Galicia (Regional Government of Galicia) through grants to Ph.D. (http://gain.xunta.gal), under the “Axudas á etapa predoutoral” grant with reference: ED481A2023072.
Grant PID2022-137152NB-I00 funded by MICIU/AEI/10.13039/501100011033 and by ERDF/EU.
Xunta de Galicia. Grants for the consolidation and structuring of competitive research units, GPC (ED431B 2023/49)
CITIC, as a center accredited for excellence within the Galician University System and a member of the CIGUS Network, receives subsidies from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia. Additionally, it is co-financed by the EU through the FEDER Galicia 2021-27 operational program (Ref. ED431G 2023/01).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Timiraos, M., Díaz-Longueira, A., Zayas-Gato, F., Casteleiro-Roca, JL., Fontenla-Romero, Ó., Calvo-Rolle, J.L. (2024). A Machine Learning - Based System for Determining Water Potability. In: Zayas-Gato, F., Díaz-Longueira, A., Casteleiro-Roca, JL., Jove, E. (eds) Distributed Computing and Artificial Intelligence, Special Sessions III - Intelligent Systems Applications, 21st International Conference. DCAI 2024. Lecture Notes in Networks and Systems, vol 1173. Springer, Cham. https://doi.org/10.1007/978-3-031-73910-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-73910-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-73909-5
Online ISBN: 978-3-031-73910-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)