Highly efficient nonlinear regression for big data with lexicographical splitting

Mohaghegh Neyshabouri, Mohammadreza; Demir, Oguzhan; Delibalta, Ibrahim; Kozat, Suleyman Serdar

doi:10.1007/s11760-016-0972-8

Highly efficient nonlinear regression for big data with lexicographical splitting

Original Paper
Published: 19 September 2016

Volume 11, pages 391–398, (2017)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Mohammadreza Mohaghegh Neyshabouri ORCID: orcid.org/0000-0002-0782-8308¹,
Oguzhan Demir¹,
Ibrahim Delibalta² &
…
Suleyman Serdar Kozat¹

384 Accesses
Explore all metrics

Abstract

This paper considers the problem of online piecewise linear regression for big data applications. We introduce an algorithm, which sequentially achieves the performance of the best piecewise linear (affine) model with optimal partition of the space of the regressor vectors in an individual sequence manner. To this end, our algorithm constructs a class of \(2^D\) sequential piecewise linear models over a set of partitions of the regressor space and efficiently combines them in the mixture-of-experts setting. We show that the algorithm is highly efficient with computational complexity of only \(O(mD^2)\), where m is the dimension of the regressor vectors. This efficient computational complexity is achieved by efficiently representing all of the \(2^D\) models using a “lexicographical splitting graph.” We analyze the performance of our algorithm without any statistical assumptions, i.e., our results are guaranteed to hold. Furthermore, we demonstrate the effectiveness of our algorithm over the well-known data sets in the machine learning literature with computational complexity fraction of the state of the art.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Density-Based Clustering Based on Hierarchical Density Estimates

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

Article 30 August 2016

Aki Vehtari, Andrew Gelman & Jonah Gabry

Sum-of-Squares Relaxations for Information Theory and Variational Inference

Article 05 April 2024

Francis Bach

References

Samah, H.A., Isa, N.M., Toh, K.K.: Automatic false edge elimination using locally adaptive regression kernel. Signal Image Video Process. 9(6), 1339–1351 (2015)
Article Google Scholar
Kozat, S.S., Singer, A.C., Zeitler, G.C.: Universal piecewise linear prediction via context trees. IEEE Trans. Signal Process. 55(7), 3730–3745 (2007)
Article MathSciNet Google Scholar
Helmbold, D.P., Schapire, R.E.: Predicting nearly as well as the best pruning of a decision tree. Mach. Learn. 27(1), 51–68 (1997)
Article Google Scholar
Singer, A.C., Feder, M.: Universal linear prediction by model order weighting. Signal Proces. IEEE Trans. 47(10), 2685–2699 (1999)
Article MATH Google Scholar
Malik, P.: Governing big data: principles and practices. IBM J. Res. Dev. 57(3/4), 1:1–1:13 (2013)
Article Google Scholar
Michel, O.J.J., Hero, A.O., Badel, A.-E.: Tree-structured nonlinear signal modeling and prediction. IEEE Trans. Signal Process. 47(11), 3027–3041 (1999)
Article MATH Google Scholar
Vanli, N.D., Kozat, S.S.: A comprehensive approach to universal piecewise nonlinear regression based on trees. Signal Process. IEEE Trans. 62(20), 5471–5486 (2014)
Article MathSciNet Google Scholar
Willems, F.M.J., Shtarkov, Y.M., Tjalkens, T.J.: Context weighting for general finite-context sources. IEEE Trans. Inform. Theory 42, 42–1514 (1996)
MATH Google Scholar
Kivinen, J., Warmuth, M.K.: Exponentiated gradient versus gradient descent for linear predictors. J. Inf. Comput. 42(5), 1514–1520 (1996)
MATH Google Scholar
Sayed, A.H.: Fundam. Adapt. Filter. Wiley, New Jersey (2003)
Google Scholar
Blake, C., Merz, C.: UCI repository of machine learning databases (1998). http://www.ics.uci.edu/~mlearn/MLRepository.html
Mathews, V.J.: Adaptive polynomial filters. Signal Process. Mag. IEEE 8(3), 10–26 (1991)
Article Google Scholar
Carini, A., Sicuranza, G.L.: Recursive even mirror fourier nonlinear filters and simplified structures. Signal Process. IEEE Trans. 62(24), 6534–6544 (2014)
Article MathSciNet Google Scholar
Hénon, M.: A two-dimensional mapping with a strange attractor. Commun. Math. Phys. 50(1), 69–77 (1976)
Article MathSciNet MATH Google Scholar
Scarpiniti, M., Comminiello, D., Parisi, R., Uncini, A.: Novel cascade spline architectures for the identification of nonlinear systems. Circ. Syst. I Regul. Pap. IEEE Trans. 62(7), 1825–1835 (2015)
Article MathSciNet Google Scholar

Download references

Acknowledgments

This work is in part supported by Turkish Academy of Science Outstanding Researcher Programme, Tubitak Contract No 113E517, and Turk Telekom Inc.

Author information

Authors and Affiliations

Department of Electrical and Electronics Engineering, Bilkent University, 06800, Bilkent, Ankara, Turkey
Mohammadreza Mohaghegh Neyshabouri, Oguzhan Demir & Suleyman Serdar Kozat
Turk Telekom Communications Services Inc., Istanbul, Turkey
Ibrahim Delibalta

Authors

Mohammadreza Mohaghegh Neyshabouri
View author publications
You can also search for this author in PubMed Google Scholar
Oguzhan Demir
View author publications
You can also search for this author in PubMed Google Scholar
Ibrahim Delibalta
View author publications
You can also search for this author in PubMed Google Scholar
Suleyman Serdar Kozat
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammadreza Mohaghegh Neyshabouri.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mohaghegh Neyshabouri, M., Demir, O., Delibalta, I. et al. Highly efficient nonlinear regression for big data with lexicographical splitting. SIViP 11, 391–398 (2017). https://doi.org/10.1007/s11760-016-0972-8

Download citation

Received: 31 December 2015
Revised: 22 August 2016
Accepted: 26 August 2016
Published: 19 September 2016
Issue Date: March 2017
DOI: https://doi.org/10.1007/s11760-016-0972-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Highly efficient nonlinear regression for big data with lexicographical splitting

Abstract

Access this article

Similar content being viewed by others

Density-Based Clustering Based on Hierarchical Density Estimates

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

Sum-of-Squares Relaxations for Information Theory and Variational Inference

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Highly efficient nonlinear regression for big data with lexicographical splitting

Abstract

Access this article

Similar content being viewed by others

Density-Based Clustering Based on Hierarchical Density Estimates

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

Sum-of-Squares Relaxations for Information Theory and Variational Inference

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation