skip to main content
10.1145/3319619.3326745acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
abstract

Monitoring of drinking-water quality by means of a multi-objective ensemble learning approach

Published: 13 July 2019 Publication History

Abstract

This paper proposes the use of multi-objective ensemble learning to monitor drinking-water quality. Such problem consists of a data set with an extreme imbalance ratio where the events, the minority class, must be correctly detected given a time series denoting water quality and operative data on a minutely basis. First, the given data set is preprocessed for imputing missing data, adjusting concept drift and adding new statistical features, such as moving average, moving standard deviation, moving maximum and moving minimum. Next, two ensemble learning techniques are used, namely SMOTEBoost and RUSBoost. Such techniques have been developed specifically for dealing with imbalanced data, where the base learners are trained by adjusting the ratio between the classes. The first algorithm focuses on oversampling the minority class, while the second focuses on under-sampling the majority class. Finally, multi-objective optimisation is used for pruning the base models of such ensembles in order to maximise the prediction score without reducing generalisation performance. In the training phase, the model is trained, optimised and evaluated using hold-out validation on a given training data set. At the end, the trained model is inserted into a framework, which is used for online event detection and assessing the model's performance.

References

[1]
Nitesh V Chawla, Aleksandar Lazarevic, Lawrence O Hall, and Kevin W Bowyer. 2003. SMOTEBoost: Improving prediction of the minority class in boosting. In European conference on principles of data mining and knowledge discovery. Springer, 107--119.
[2]
Achille Messac. 1996. Physical programming: effective optimization for computational design. AIAA journal 34, 1 (1996), 149--158.
[3]
Frederik Rehbach, Steffen Moritz, and Thomas Bartz-Beielstein. 2019. GECCO 2019 Industrial Challenge: Monitoring of drinking-water quality. (2019). https://www.th-koeln.de/mam/downloads/deutsch/hochschule/fakultaeten/informatik_und_ingenieurwissenschaften/rulesgeccoic2019.pdf
[4]
Gilberto Reynoso-Meza, Javier Sanchis, Xavier Blasco, and Roberto Z Freire. 2016. Evolutionary multi-objective optimisation with preferences for multivariable PI controller tuning. Expert Systems with Applications 51 (2016), 120--133.
[5]
Gilberto Reynoso-Meza, Javier Sanchis, Xavier Blasco, and Miguel Martínez. 2016. Preference driven multi-objective optimization design procedure for industrial controller tuning. Information Sciences 339 (2016), 108--131.
[6]
Victor Henrique Alves Ribeiro and Gilberto Reynoso-Meza. 2018. Online anomaly detection for drinking water quality using a multi-objective machine learning approach. In Proceedings of the Genetic and Evolutionary Computation Conference Companion. ACM, 1--2.
[7]
Chris Seiffert, Taghi M Khoshgoftaar, Jason Van Hulse, and Amri Napolitano. 2010. RUSBoost: A hybrid approach to alleviating class imbalance. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 40, 1 (2010), 185--197.
[8]
Qingfu Zhang and Hui Li. 2007. MOEA/D: A multiobjective evolutionary algorithm based on decomposition. IEEE Transactions on evolutionary computation 11, 6 (2007), 712--731.

Cited By

View all
  • (2024)Industrial Environmental Impact Assessment Method Based on Detection of Complex Anomalies in Time SeriesApplied System Innovation10.3390/asi70500897:5(89)Online publication date: 24-Sep-2024
  • (2024)Applications of Machine Learning in Drinking Water Quality Management: A Critical Review on Water Distribution SystemJournal of Cleaner Production10.1016/j.jclepro.2024.144171(144171)Online publication date: Nov-2024
  • (2023)Multi-criteria Decision-Making Techniques for the Selection of Pareto-optimal Machine Learning Models in a Drinking-Water Quality Monitoring ProblemInternational Journal of Information Technology & Decision Making10.1142/S021962202350010423:01(447-474)Online publication date: 13-Feb-2023
  • Show More Cited By

Index Terms

  1. Monitoring of drinking-water quality by means of a multi-objective ensemble learning approach

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GECCO '19: Proceedings of the Genetic and Evolutionary Computation Conference Companion
      July 2019
      2161 pages
      ISBN:9781450367486
      DOI:10.1145/3319619
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 13 July 2019

      Check for updates

      Author Tags

      1. anomaly detection
      2. evolutionary computation
      3. machine learning
      4. time series

      Qualifiers

      • Abstract

      Funding Sources

      • Conselho Nacional de Desenvolvimento Científico e Tecnológico
      • Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

      Conference

      GECCO '19
      Sponsor:
      GECCO '19: Genetic and Evolutionary Computation Conference
      July 13 - 17, 2019
      Prague, Czech Republic

      Acceptance Rates

      Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 07 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Industrial Environmental Impact Assessment Method Based on Detection of Complex Anomalies in Time SeriesApplied System Innovation10.3390/asi70500897:5(89)Online publication date: 24-Sep-2024
      • (2024)Applications of Machine Learning in Drinking Water Quality Management: A Critical Review on Water Distribution SystemJournal of Cleaner Production10.1016/j.jclepro.2024.144171(144171)Online publication date: Nov-2024
      • (2023)Multi-criteria Decision-Making Techniques for the Selection of Pareto-optimal Machine Learning Models in a Drinking-Water Quality Monitoring ProblemInternational Journal of Information Technology & Decision Making10.1142/S021962202350010423:01(447-474)Online publication date: 13-Feb-2023
      • (2023)Evaluation of Machine Learning Algorithms on Finding Drinking Water Quality Based on Feature Selection Methodologies2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS)10.1109/ICACCS57279.2023.10112799(1883-1888)Online publication date: 17-Mar-2023
      • (2023)Detecting Anomalies in Multidimensional Time Series Using Binary ClassificationCreativity in Intelligent Technologies and Data Science10.1007/978-3-031-44615-3_22(323-336)Online publication date: 14-Oct-2023
      • (2021)Accessing Imbalance Learning Using Dynamic Selection Approach in Water Quality Anomaly DetectionSymmetry10.3390/sym1305081813:5(818)Online publication date: 7-May-2021
      • (2020)Empirical Comparison of Approaches for Mitigating Effects of Class Imbalances in Water Quality Anomaly DetectionIEEE Access10.1109/ACCESS.2020.30386588(218015-218036)Online publication date: 2020

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media