Abstract
Wastewater treatment plants are essential infrastructures to maintain the environmental balance of the regions where they were installed. The dynamic and complex wastewater treatment procedure must be handled efficiently to ensure good quality effluents. This paper presents a research and development work implemented to predict the performance of a wastewater treatment plant located in the northern Portugal, serving a population of about 45,000 inhabitants. The data we used were recorded based on the daily averaged values of the measured parameters during the period of one year. The predictive models were developed supported by two implementations of Support Vector Machines methods for regression, due to the presence of two lines of treatment in the selected case of study, using two of the most relevant output parameters of a wastewater treatment plant: the biochemical oxygen demand and the total suspended solids. We describe here the wastewater treatment plant we studied as well the data sets used in the mining processes, analyzing and comparing the regression models for both predictive parameters that were selected.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ali, S., Smith-Miles, K.A.: Improved Support Vector Machine Generalization Using Normalized Input Space. In: Sattar, A., Kang, B.-H. (eds.) AI 2006. LNCS (LNAI), vol. 4304, pp. 362–371. Springer, Heidelberg (2006)
Atanasova, N., Kompare, B.: Modelling of Wastewater Treatment Plant with Decision and Regression Trees. In: 3rd Workshop on Binding Environmental Sciences and Artificial Intelligence (2002a)
Atanasova, N., Kompare, B.: Modelling of waste water treatment plant with regression trees. In: Proc. of the Third International Conference on Data Mining. WIT Press, Bologna (2002b)
Belanche, L.A., et al.: Towards a Model of Input-Output Behaviour of Wastewater Treatment Plants using Soft Computing Techniques. Environmental Modeling and Software 14(5), 409–419 (1999)
Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Haussler, D. (ed.) Proceedings of the Fifth Annual Workshop on Computational Learning Theory, COLT 1992. ACM Press (1992)
Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)
Cărbureanu, M.: Pollution Level Analysis of a Wastewater Treatment Plant Emissary using Data Mining. Petroleum-Gas University of Ploieşti Bulletin Mathematics-Informatics-Physics Series LXII(1), 69–78 (2010)
Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2(3), 27:1–27:27 (2011)
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)
Cortez, P.: Data Mining with Multilayer Perceptrons and Support Vector Machines. In: Holmes, D.E., Jain, L.C. (eds.) Data Mining: Found. & Intell. Paradigms. ISRL, vol. 24, pp. 9–25. Springer, Heidelberg (2012)
Cortez, P., et al.: Lamb Meat Quality Assessment by Support Vector Machines. Processing Letters 24(1), 41–51 (2006)
Dürrenmatt, D.J.: Data Mining and Data-Driven Modelling Approaches to Support Wastewater Treatment Plant Operation. PhD Thesis. ETH, Zurique (2011)
Dixon, M., et al.: Data mining to support anaerobic WWTP monitoring. Control Engineering Practice 15(8), 987–999 (2007)
Drucker, H., et al.: Support vector regression machines. Electronic Engineering 1, 155–161 (1997)
Flexer, A.: Statistical Evaluation of Neural Network Experiments: Minimum Requirements and Current Practice. In: 3th European Meeting on Cybernetics and Systems Research (1996)
Gallop, J.R., et al.: The use of data mining for the monitoring and control of anaerobic wastewater plants. In: 4th International Workshop on Environmental Applications of Machine Learning (2004)
Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3(7-8), 1157–1182 (2003)
Hamed, M.M., Khalafallah, M.G., Hassanien, E.A.: Prediction of wastewater treatment plant performance using artificial neural networks. Environmental Modelling Software 19(10), 919–928 (2004)
Hong, Y., Fei, L., Yuge, X., Jin, L.: GA Based LS-SVM Classifier for Waste Water Treatment Process. In: 27th Chinese Control Conference (2008)
Hsu, C.-W., Chang, C.-C., Lin, C.-J.: A Practical Guide to Support Vector Classification. Bioinformatics 1(1), 1–16 (2010)
Huang, Z., Luo, J., Li, X., Zhou, Y.: Prediction of Effluent Parameters of Wastewater Treatment Plant Based on Improved Least Square Support Vector Machine with PSO. In: 1st International Conference on Information Science and Engineering, ICISE (2009)
Kim, J.-H.: Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap. Computational Statistics and Data Analysis 53(11), 3735–3745 (2009)
Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: International Joint Conference on Artificial Intelligence, Montreal, Canada (1995)
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97(1-2), 273–324 (1997)
Luengo, J., GarcÃa, S., Herrera, F.: On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowledge and Information Systems, 1–32 (2011)
Luo, F., Yu, R.-H., Xu, Y.-G., Li, Y.: Effluent Quality Prediction of Wastewater Treatment Plant Based on Fuzzy-Rough Sets and Artificial Neural Networks. In: Sixth International Conference on Fuzzy Systems and Knowledge Discovery - FSKD 2009 (2009)
Metcafl, Eddy: Wastewater Engineering: Treatment and Reuse, 4th edn. McGraw-Hill (2003)
Molinaro, A.M., Simon, R., Pfeiffer, R.M.: Prediction error estimation: a comparison of resampling methods. Bioinformatics 21(15), 3301–3307 (2005)
Platt, J.C.: Using Analytic QP and Sparseness to Speed Training of Support Vector Machines. Optimization 11, 1–8 (1999)
Pyle, D.: Data Preparation for Data Mining. Morgan Kaufmann Publishers, Inc., San Francisco (1999)
Schölkopf, B., Smola, A., Williamson, R., Bartlett, P.L.: New support vector algorithms. Neural Computation 12(5), 1207–1245 (2000)
Shevade, S.K., Keerthi, S.S., Bhattacharyya, C., Murthy: Improvements to the SMO Algorithm for SVM Regression. IEEE Transactions on Neural Networks 11(5), 1188–1193 (2000)
Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14(3), 199–222 (2004)
Vapnik, V.N.: The Nature of Statistical Learning Theory, 2nd edn., New York (1995)
Vapnik, V.N.: An overview of statistical learning theory. IEEE Transactions on Neural Networks 10(5), 988–999 (1999)
Vapnik, V.N., Chervonenkis, A.Y.: Theory of pattern recognition. Nauka, Moscow (1974) (in Russian)
Wang, L.-J., Chen, C.-B.: Support Vector Machine Applying in the Prediction of Effluent Quality of Sewage Treatment Plant with Cyclic Activated Sludge System Process. In: IEEE International Symposium on Knowledge Acquisition and Modeling Workshop. KAM Workshop 2008 (2008)
WEKA, n.d. Class SMOReg, http://weka.sourceforge.net/doc/weka/classifiers/functions/SMOreg.html (accessed July 2, 2012)
Witten, I.H., Frank, E., Hall, M.A.: Data Minig: Pratical Machine Learnign Tools and Tecniques, 3rd edn. Morgan Kaufmann (2011)
Yang, B.-l., Zhao, D.-A., Zhang, J.: Prediction system of sewage outflow COD based on LS-SVM. In: 2nd International Conference on Intelligent Control and Information Processing, ICICIP (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ribeiro, D., Sanfins, A., Belo, O. (2013). Wastewater Treatment Plant Performance Prediction with Support Vector Machines. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2013. Lecture Notes in Computer Science(), vol 7987. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39736-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-39736-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39735-6
Online ISBN: 978-3-642-39736-3
eBook Packages: Computer ScienceComputer Science (R0)