Skip to main content

On-line Learning with Delayed Label Feedback

  • Conference paper
Algorithmic Learning Theory (ALT 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3734))

Included in the following conference series:

Abstract

We generalize on-line learning to handle delays in receiving labels for instances. After receiving an instance x, the algorithm may need to make predictions on several new instances before the label for x is returned by the environment. We give two simple techniques for converting a traditional on-line algorithm into an algorithm for solving a delayed on-line problem. One technique is for instances generated by an adversary; the other is for instances generated by a distribution. We show how these techniques effect the original on-line mistake bounds by giving upper-bounds and restricted lower-bounds on the number of mistakes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Littlestone, N.: Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning 2, 285–318 (1988)

    Google Scholar 

  2. Androutsopoulos, I., Koutsias, J., Chandrinos, K., Paliouras, G., Spyropoulos, C.: An evaluation of naive bayesian anti-spam filtering (2000)

    Google Scholar 

  3. Padmanabhan, V.N., Mogul, J.C.: Using predictive prefetching to improve world wide web latency. ACM SIGCOMM Computer Communication Review 26, 22–36 (1996)

    Article  Google Scholar 

  4. Littlestone, N.: Redundant noisy attributes, attribute errors, and linear-threshold learning using winnow. In: Proceedings of the Third Annual Conference on Computational Learning Theory, pp. 147–156 (1991)

    Google Scholar 

  5. Helmbold, D.P., Long, P.M.: Tracking drifting concepts using random examples. In: Proceedings of the Third Annual Conference on Computational Learning Theory, pp. 13–23 (1991)

    Google Scholar 

  6. Kuh, A., Petsche, T., Rivest, R.L.: Learning time-varying concepts. In: Neural Information Processing Systems Three, pp. 183–189. Morgan Kaufmann Publishers, Inc., San Francisco (1991)

    Google Scholar 

  7. Auer, P., Warmuth, M.K.: Tracking the best disjunction. In: Proceedings of the 36th annual symposium on foundations of computer science, pp. 312–321. IEEE Computer Society Press, Los Alamitos (1995)

    Google Scholar 

  8. Devroye, L., Györfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, New York (1991)

    Google Scholar 

  9. Littlestone, N.: Comparing several linear-threshold learning algorithms on tasks involving superfluous attributes. In: Proceeding of the Twelve International Conference on Machine Learning, pp. 353–361 (1995)

    Google Scholar 

  10. Littlestone, N., Mesterharm, C.: An apobayesian relative of winnow. In: Neural Information Processing Systems Nine, pp. 204–210. MIT Press, Cambridge (1997)

    Google Scholar 

  11. Minsky, M.L., Papert, S.A.: Perceptrons. MIT Press, Cambridge (1969)

    MATH  Google Scholar 

  12. Li, Y., Long, P.: The relaxed online maximum margin algorithm. In: Neural Information Processing Systems Twelve, pp. 498–504. MIT Press, Cambridge (2000)

    Google Scholar 

  13. Mesterharm, C.: Tracking linear-threshold concepts with winnow. Journal of Machine Learning Research 4, 819–838 (2003)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mesterharm, C. (2005). On-line Learning with Delayed Label Feedback. In: Jain, S., Simon, H.U., Tomita, E. (eds) Algorithmic Learning Theory. ALT 2005. Lecture Notes in Computer Science(), vol 3734. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564089_31

Download citation

  • DOI: https://doi.org/10.1007/11564089_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29242-5

  • Online ISBN: 978-3-540-31696-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics