Online Prediction Problems with Variation

Lee, Chia-Jung; Tsai, Shi-Chun; Yang, Ming-Chuan

doi:10.1007/978-3-319-08783-2_5

Chia-Jung Lee¹⁸,
Shi-Chun Tsai¹⁸ &
Ming-Chuan Yang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8591))

Included in the following conference series:

International Computing and Combinatorics Conference

1324 Accesses
1 Citations

Abstract

We study the prediction with expert advice problem, where in each round, the player selects one of N actions and incurs the corresponding loss according to an N-dimensional linear loss vector, and aim to minimize the regret. In this paper, we consider a new measure of the loss functions, which we call L _∞ -variation. Consider the loss functions with small L _∞-variation, if the player is allowed to have some information related to the variation in each round, we can obtain an online bandit algorithm for the problem without using the self-concordance methodology, which conditionally answers an open problem in [8]. Another related problem is the combinatorial prediction game, in which the set of actions is a subset of {0,1}^d, and the loss function is in [–1,1]^d. We provide an online algorithm in the semi-bandit setting when the loss functions have small L _∞-variation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Prediction with Expert Advice: A PDE Perspective

Article 08 August 2019

Asymptotically Optimal Strategies for Online Prediction with History-Dependent Experts

Article 11 March 2021

A Priori Advantages of Meta-Induction and the No Free Lunch Theorem: A Contradiction?

References

Abernethy, J., Hazan, E., Rakhlin, A.: Competing in the dark: An efficient algorithm for bandit linear optimization. In: COLT, pp. 263–274 (2008)
Google Scholar
Audibert, J.-Y., Bubeck, S.: Regret Bounds and Minimax Policies under Partial Monitoring. Journal of Machine Learning Research 11, 2635–2686 (2010)
MathSciNet Google Scholar
Audibert, J.-Y., Bubeck, S., Lugosi, G.: Minimax Policies for Combinatorial Prediction Games. In: COLT, pp. 107–132 (2011)
Google Scholar
Chiang, C.-K., Yang, T., Lee, C.-J., Mahdavi, M., Lu, C.-J., Jin, R., Zhu, S.: Online optimization with gradual variations. In: COLT, pp. 6.1–6.20 (2012)
Google Scholar
Dani, V., Hayes, T., Kakade, S.M.: The Price of Bandit Information for Online Optimization. In: NIPS, pp. 345–352 (2008)
Google Scholar
Freund, Y., Schapire, R.E.: A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Article MATH MathSciNet Google Scholar
Hazan, E., Kale, S.: Extracting certainty from uncertainty: Regret bounded by variation in costs. Machine Learning 80(2-3), 165–188 (2010)
Article MathSciNet Google Scholar
Hazan, E., Kale, S.: Better Algorithms for Benign Bandits. Journal of Machine Learning Research 12, 1287–1311 (2011)
MATH MathSciNet Google Scholar
Littlestone, N., Warmuth, M.K.: The Weighted Majority Algorithm. Inf. Comput. 108(2), 212–261 (1994)
Article MATH MathSciNet Google Scholar
Vitter, J.S.: Random sampling with a reservoir. ACM Trans. Math. Softw. 11(1), 37–57 (1985)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, National Chiao-Tung University, Hsinchu, Taiwan
Chia-Jung Lee, Shi-Chun Tsai & Ming-Chuan Yang

Authors

Chia-Jung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Shi-Chun Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Chuan Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Georgia State University, 34 Peachtree Street, Suite 1410, 30303, Atlanta, GA, USA
Zhipeng Cai
Department of Computer Science, Georgia State University, 34 Peachtree Street, Suite 1443, 30303, Atlanta, GA, USA
Alex Zelikovsky
Department of Computer Science, Georgia State University, 34 Peachtree Street, Suite 1449, 30303, Atlanta, GA, USA
Anu Bourgeois

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, CJ., Tsai, SC., Yang, MC. (2014). Online Prediction Problems with Variation. In: Cai, Z., Zelikovsky, A., Bourgeois, A. (eds) Computing and Combinatorics. COCOON 2014. Lecture Notes in Computer Science, vol 8591. Springer, Cham. https://doi.org/10.1007/978-3-319-08783-2_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-08783-2_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08782-5
Online ISBN: 978-3-319-08783-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Online Prediction Problems with Variation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Prediction with Expert Advice: A PDE Perspective

Asymptotically Optimal Strategies for Online Prediction with History-Dependent Experts

A Priori Advantages of Meta-Induction and the No Free Lunch Theorem: A Contradiction?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Online Prediction Problems with Variation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Prediction with Expert Advice: A PDE Perspective

Asymptotically Optimal Strategies for Online Prediction with History-Dependent Experts

A Priori Advantages of Meta-Induction and the No Free Lunch Theorem: A Contradiction?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation