Abstract
Most data streams systems that use online Multi-target regression yield vast amounts of data which is not targeted. Targeting this data is usually impossible, time consuming and expensive. Semi-supervised algorithms have been proposed to use this untargeted data (input information only) for model improvement. However, most algorithms are adapted to work on batch mode for classification and require huge computational and memory resources.
Therefore, this paper proposes an semi-supervised algorithm for online processing systems based on AMRules algorithm that handle both targeted and untargeted data and improves the regression model. The proposed method was evaluated through a comparison between a scenario where the untargeted examples are not used on the training and a scenario where some untargeted examples are used. Evaluation results indicate that the use of the untargeted examples improved the target predictions by improving the model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Borchani, H., Varando, G., Bielza, C., Larrañaga, P.: A survey on multi-output regression. Wiley Int. Rev. Data Min. Knowl. Disc. 5(5), 216–233 (2015)
Levatic, J., Ceci, M., Kocev, D., Dzeroski, S.: Semi-supervised learning for multi-target regression. In: Third International Workshop, NFMCP, Held in Conjunction with ECML-PKDD, pp. 3–18 (2014)
Zhou, Z.H., Li, M.: Semi-supervised regression with co-training style algorithms. IEEE Trans. Knowl. Data Eng. 19(11), 1479–1493 (2007)
Duarte J., Gama, J.: Multi-target regression from high-speed data streams with adaptive model rules. In: IEEE Conference on Data Science and Advanced Analytics (2015)
Goldberg, A.B., Zhu, X., Furger, A., Jun-Ming, X.: OASIS: online active semi-supervised learning. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI, San Francisco, California, USA, 7–11 August 2011
Kang, P., Kim, D., Cho, S.: Semi-supervised support vector regression based on self-training with label uncertainty: an application to virtual metrology in semiconductor manufacturing. Expert Syst. Appl. 51, 85–106 (2016)
Ozoh, P., Abd-rahman, S., Labadin, J., Apperley, M.: Article: a comparative analysis of techniques for forecasting electricity consumption. Int. J. Comput. Appl. 88(15), 8–12 (2014)
Chalabi, Z., Mangtani, P., Hashizume, M., Imai, C., Armstrong, B.: Article: time series regression model for infectious disease and weather. Int. J. Environ. Res. 142, 319–327 (2015)
Uslana, H.S.V.: Article: quantitative prediction of peptide binding afnity by using hybrid fuzzy support vector regression. Appl. Soft Comput. 43, 210–221 (2016)
Ariyo, A.A., Adewumi, A.O., Ayo, C.K.: Stock price prediction using the arima model. In: Proceedings of the UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSIM 2014, pp. 106–112, Washington, DC, USA. IEEE Computer Society (2014)
Chapelle, O., Schlkopf, B., Zien, A.: Semi-Supervised Learning, 1st edn. The MIT Press, Cambridge (2010)
Albalate, A., Minker, W.: Semi-supervised and Unsupervised Machine Learning. ISTE/Wiley, London (2011)
Verbeek, J.J., Vlassis, N.: Gaussian fields for semi-supervised regression and correspondence learning. Pattern Recogn. 39(10), 1864–1875 (2006)
Radosavljevic, V., Vucetic, S., Obradovic, Z.: Continuous conditionalrandom fields for regression in remote sensing. In: 19th European Conference on Artificial Intelligence, Proceedings of the 2010 Conference on ECAI 2010, pp. 809–814, Amsterdam, The Netherlands. IOS Press (2010)
Stojanovic, J., Jovanovic, M., Gligorijevic, D., Obradovic, Z.: Semi-supervised learning for structured regression on partially observed attributed graphs. In: SIAM International Conference on Data Mining (SDM) (2015)
Bhattacharyya, B.B.: One sided Chebyshev inequality when the first four moments are known. Commun. Stat. Theor. Methods 16(9), 2789–2791 (1987)
Gama, J., Sebastião, R., Rodrigues, P.P.: On evaluating stream learning algorithms. Mach. Learn. 90(3), 317–346 (2013)
Chen, W.: Passive, Active, and Digital Filters, 3rd edn. CRC Press, Baco Raton (2009)
Friedman, J.H.: Multivariate adaptive regression splines. Ann. Stat. 19(1), 1–67 (1991)
Acknowledgments
This work was partly supported by the European Commission through MAESTRA (ICT-2013-612944) and the Project TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020 is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Sousa, R., Gama, J. (2016). Online Semi-supervised Learning for Multi-target Regression in Data Streams Using AMRules. In: Boström, H., Knobbe, A., Soares, C., Papapetrou, P. (eds) Advances in Intelligent Data Analysis XV. IDA 2016. Lecture Notes in Computer Science(), vol 9897. Springer, Cham. https://doi.org/10.1007/978-3-319-46349-0_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-46349-0_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46348-3
Online ISBN: 978-3-319-46349-0
eBook Packages: Computer ScienceComputer Science (R0)