Skip to main content

Wastewater Treatment Plant Performance Prediction with Support Vector Machines

  • Conference paper
Advances in Data Mining. Applications and Theoretical Aspects (ICDM 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7987))

Included in the following conference series:

Abstract

Wastewater treatment plants are essential infrastructures to maintain the environmental balance of the regions where they were installed. The dynamic and complex wastewater treatment procedure must be handled efficiently to ensure good quality effluents. This paper presents a research and development work implemented to predict the performance of a wastewater treatment plant located in the northern Portugal, serving a population of about 45,000 inhabitants. The data we used were recorded based on the daily averaged values of the measured parameters during the period of one year. The predictive models were developed supported by two implementations of Support Vector Machines methods for regression, due to the presence of two lines of treatment in the selected case of study, using two of the most relevant output parameters of a wastewater treatment plant: the biochemical oxygen demand and the total suspended solids. We describe here the wastewater treatment plant we studied as well the data sets used in the mining processes, analyzing and comparing the regression models for both predictive parameters that were selected.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ali, S., Smith-Miles, K.A.: Improved Support Vector Machine Generalization Using Normalized Input Space. In: Sattar, A., Kang, B.-H. (eds.) AI 2006. LNCS (LNAI), vol. 4304, pp. 362–371. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  2. Atanasova, N., Kompare, B.: Modelling of Wastewater Treatment Plant with Decision and Regression Trees. In: 3rd Workshop on Binding Environmental Sciences and Artificial Intelligence (2002a)

    Google Scholar 

  3. Atanasova, N., Kompare, B.: Modelling of waste water treatment plant with regression trees. In: Proc. of the Third International Conference on Data Mining. WIT Press, Bologna (2002b)

    Google Scholar 

  4. Belanche, L.A., et al.: Towards a Model of Input-Output Behaviour of Wastewater Treatment Plants using Soft Computing Techniques. Environmental Modeling and Software 14(5), 409–419 (1999)

    Article  Google Scholar 

  5. Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Haussler, D. (ed.) Proceedings of the Fifth Annual Workshop on Computational Learning Theory, COLT 1992. ACM Press (1992)

    Google Scholar 

  6. Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)

    Article  Google Scholar 

  7. Cărbureanu, M.: Pollution Level Analysis of a Wastewater Treatment Plant Emissary using Data Mining. Petroleum-Gas University of Ploieşti Bulletin Mathematics-Informatics-Physics Series LXII(1), 69–78 (2010)

    Google Scholar 

  8. Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2(3), 27:1–27:27 (2011)

    Google Scholar 

  9. Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)

    MATH  Google Scholar 

  10. Cortez, P.: Data Mining with Multilayer Perceptrons and Support Vector Machines. In: Holmes, D.E., Jain, L.C. (eds.) Data Mining: Found. & Intell. Paradigms. ISRL, vol. 24, pp. 9–25. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  11. Cortez, P., et al.: Lamb Meat Quality Assessment by Support Vector Machines. Processing Letters 24(1), 41–51 (2006)

    Article  Google Scholar 

  12. Dürrenmatt, D.J.: Data Mining and Data-Driven Modelling Approaches to Support Wastewater Treatment Plant Operation. PhD Thesis. ETH, Zurique (2011)

    Google Scholar 

  13. Dixon, M., et al.: Data mining to support anaerobic WWTP monitoring. Control Engineering Practice 15(8), 987–999 (2007)

    Article  Google Scholar 

  14. Drucker, H., et al.: Support vector regression machines. Electronic Engineering 1, 155–161 (1997)

    Google Scholar 

  15. Flexer, A.: Statistical Evaluation of Neural Network Experiments: Minimum Requirements and Current Practice. In: 3th European Meeting on Cybernetics and Systems Research (1996)

    Google Scholar 

  16. Gallop, J.R., et al.: The use of data mining for the monitoring and control of anaerobic wastewater plants. In: 4th International Workshop on Environmental Applications of Machine Learning (2004)

    Google Scholar 

  17. Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3(7-8), 1157–1182 (2003)

    MATH  Google Scholar 

  18. Hamed, M.M., Khalafallah, M.G., Hassanien, E.A.: Prediction of wastewater treatment plant performance using artificial neural networks. Environmental Modelling Software 19(10), 919–928 (2004)

    Article  Google Scholar 

  19. Hong, Y., Fei, L., Yuge, X., Jin, L.: GA Based LS-SVM Classifier for Waste Water Treatment Process. In: 27th Chinese Control Conference (2008)

    Google Scholar 

  20. Hsu, C.-W., Chang, C.-C., Lin, C.-J.: A Practical Guide to Support Vector Classification. Bioinformatics 1(1), 1–16 (2010)

    MathSciNet  Google Scholar 

  21. Huang, Z., Luo, J., Li, X., Zhou, Y.: Prediction of Effluent Parameters of Wastewater Treatment Plant Based on Improved Least Square Support Vector Machine with PSO. In: 1st International Conference on Information Science and Engineering, ICISE (2009)

    Google Scholar 

  22. Kim, J.-H.: Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap. Computational Statistics and Data Analysis 53(11), 3735–3745 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  23. Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: International Joint Conference on Artificial Intelligence, Montreal, Canada (1995)

    Google Scholar 

  24. Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97(1-2), 273–324 (1997)

    Article  MATH  Google Scholar 

  25. Luengo, J., García, S., Herrera, F.: On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowledge and Information Systems, 1–32 (2011)

    Google Scholar 

  26. Luo, F., Yu, R.-H., Xu, Y.-G., Li, Y.: Effluent Quality Prediction of Wastewater Treatment Plant Based on Fuzzy-Rough Sets and Artificial Neural Networks. In: Sixth International Conference on Fuzzy Systems and Knowledge Discovery - FSKD 2009 (2009)

    Google Scholar 

  27. Metcafl, Eddy: Wastewater Engineering: Treatment and Reuse, 4th edn. McGraw-Hill (2003)

    Google Scholar 

  28. Molinaro, A.M., Simon, R., Pfeiffer, R.M.: Prediction error estimation: a comparison of resampling methods. Bioinformatics 21(15), 3301–3307 (2005)

    Article  Google Scholar 

  29. Platt, J.C.: Using Analytic QP and Sparseness to Speed Training of Support Vector Machines. Optimization 11, 1–8 (1999)

    Google Scholar 

  30. Pyle, D.: Data Preparation for Data Mining. Morgan Kaufmann Publishers, Inc., San Francisco (1999)

    Google Scholar 

  31. Schölkopf, B., Smola, A., Williamson, R., Bartlett, P.L.: New support vector algorithms. Neural Computation 12(5), 1207–1245 (2000)

    Article  Google Scholar 

  32. Shevade, S.K., Keerthi, S.S., Bhattacharyya, C., Murthy: Improvements to the SMO Algorithm for SVM Regression. IEEE Transactions on Neural Networks 11(5), 1188–1193 (2000)

    Article  Google Scholar 

  33. Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14(3), 199–222 (2004)

    Article  MathSciNet  Google Scholar 

  34. Vapnik, V.N.: The Nature of Statistical Learning Theory, 2nd edn., New York (1995)

    Google Scholar 

  35. Vapnik, V.N.: An overview of statistical learning theory. IEEE Transactions on Neural Networks 10(5), 988–999 (1999)

    Article  Google Scholar 

  36. Vapnik, V.N., Chervonenkis, A.Y.: Theory of pattern recognition. Nauka, Moscow (1974) (in Russian)

    Google Scholar 

  37. Wang, L.-J., Chen, C.-B.: Support Vector Machine Applying in the Prediction of Effluent Quality of Sewage Treatment Plant with Cyclic Activated Sludge System Process. In: IEEE International Symposium on Knowledge Acquisition and Modeling Workshop. KAM Workshop 2008 (2008)

    Google Scholar 

  38. WEKA, n.d. Class SMOReg, http://weka.sourceforge.net/doc/weka/classifiers/functions/SMOreg.html (accessed July 2, 2012)

  39. Witten, I.H., Frank, E., Hall, M.A.: Data Minig: Pratical Machine Learnign Tools and Tecniques, 3rd edn. Morgan Kaufmann (2011)

    Google Scholar 

  40. Yang, B.-l., Zhao, D.-A., Zhang, J.: Prediction system of sewage outflow COD based on LS-SVM. In: 2nd International Conference on Intelligent Control and Information Processing, ICICIP (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ribeiro, D., Sanfins, A., Belo, O. (2013). Wastewater Treatment Plant Performance Prediction with Support Vector Machines. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2013. Lecture Notes in Computer Science(), vol 7987. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39736-3_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39736-3_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39735-6

  • Online ISBN: 978-3-642-39736-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics