On-line learning of linear functions

Littlestone, Nicholas; Warmuth, Manfred K.; Long, Philip M.

doi:10.1007/BF01277953

On-line learning of linear functions

Published: March 1995

Volume 5, pages 1–23, (1995)
Cite this article

computational complexity Aims and scope Submit manuscript

Nicholas Littlestone¹,
Manfred K. Warmuth² &
Philip M. Long³

115 Accesses
19 Citations
Explore all metrics

Abstract

We present an algorithm for the on-line learning of linear functions which is optimal to within a constant factor with respect to bounds on the sum of squared errors for a worst case sequence of trials. The bounds are logarithmic in the number of variables. Furthermore, the algorithm is shown to be optimally robust with respect to noise in the data (again to within a constant factor).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Beginning with machine learning: a comprehensive primer

Article 21 July 2021

Fast – Asymptotically Optimal – Methods for Determining the Optimal Number of Features

An improvement of the Goldstein line search

Article 05 April 2024

References

S. S. Agaian,Hadamard Matrices and Their Applications. Number 1168 in Lecture Notes in Mathematics. Springer-Verlag, 1985.
E. J. Bernstein, Absolute error bounds for learning linear functions on line.Proceedings of the 1992 Workshop on Computational Learning Theory, 1992, 160–163.
A. Blum, L. Hellerstein, and N. Littlestone, Learning in the presence of finitely many or infinitely many irrelevant attributes.The 1991 Workshop on Computational Learning Theory, 1991, 157–166.
N. Cesa-Bianchi, P. M. Long, and M. K. Warmuth, Worst-case quadratic loss bounds for a generalization of the Widrow-Hoff rule.The 1993 Workshop on Computational Learning Theory, 1993, 429–438.
R. O. Duda and P. E. Hart,Pattern Classification and Scene Analysis. Wiley, 1973.
D. Haussler, Learning conjunctive concepts in structural domains.Machine Learning 4(1) (1989), 7–40.
Google Scholar
M. Kearns, M. Li, L. Pitt, and L. G. Valiant, On the learnability of boolean formulae.Proceedings of the 19th Annual Symposium on the Theory of Computation, 1987, 285–295.
S. Kullback, A lower bound for discrimination in terms of variation.IEEE transactions on Information Theory 13 (1967), 126–127.
Google Scholar
N. Littlestone, Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm.Machine Learning 2 (1988), 285–318.
Google Scholar
N. Littlestone,Mistake Bounds and Logarithmic Linear-threshold Learning Algorithms. PhD thesis, UC Santa Cruz, 1989.
N. Littlestone and M. Warmuth, The weighted majority algorithm.Information and Computation (1994). To appear.
J. Mycielski, A learning algorithm for linear operators.Proceedings of the American Mathematical Society 103(2) (1988), 547–550.
Google Scholar
L. Pitt andM. K. Warmuth, Prediction preserving reducibility.Journal of Computer and System Sciences 41(3) (1990), 430–467.
Google Scholar
G. Strang,Linear Algebra and its Applications. Harcourt, Brace, Jovanovich, 1988.
Google Scholar
B. Widrow and M. E. Hoff, Adaptive switching circuits.1960 IRE WESCON Convention Record (1960), 96–104.

Download references

Author information

Authors and Affiliations

NEC Research Institute, 4 Independence Way, 08540, Princeton, NJ
Nicholas Littlestone
CIS Board, UC Santa Cruz, 95064, Santa Cruz, CA
Manfred K. Warmuth
Computer Science Department, Duke University, P. O. Box 90129, 27708, Durham, NC
Philip M. Long

Authors

Nicholas Littlestone
View author publications
You can also search for this author in PubMed Google Scholar
Manfred K. Warmuth
View author publications
You can also search for this author in PubMed Google Scholar
Philip M. Long
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Littlestone, N., Warmuth, M.K. & Long, P.M. On-line learning of linear functions. Comput Complexity 5, 1–23 (1995). https://doi.org/10.1007/BF01277953

Download citation

Received: 30 May 1992
Issue Date: March 1995
DOI: https://doi.org/10.1007/BF01277953

Key words

Subject classifications

68T05

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On-line learning of linear functions

Abstract

Access this article

Similar content being viewed by others

Beginning with machine learning: a comprehensive primer

Fast – Asymptotically Optimal – Methods for Determining the Optimal Number of Features

An improvement of the Goldstein line search

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key words

Subject classifications

Navigation

On-line learning of linear functions

Abstract

Access this article

Similar content being viewed by others

Beginning with machine learning: a comprehensive primer

Fast – Asymptotically Optimal – Methods for Determining the Optimal Number of Features

An improvement of the Goldstein line search

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key words

Subject classifications

Search

Navigation