Online State Exploration: Competitive Worst Case and Learning-Augmented Algorithms

Im, Sungjin; Moseley, Benjamin; Xu, Chenyang; Zhang, Ruilong

doi:10.1007/978-3-031-43421-1_20

Sungjin Im¹²,
Benjamin Moseley¹³,
Chenyang Xu¹⁴ &
…
Ruilong Zhang¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14172))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

915 Accesses

Abstract

This paper introduces the online state exploration problem. In the problem, there is a hidden d-dimensional target state. We are given a distance function between different states in the space and a penalty function depending on the current state for each incorrect guess. The goal is to move to a vector that dominates the target state starting from the origin in the d-dimensional space while minimizing the total distance and penalty cost. This problem generalizes several natural online discrete optimization problems such as multi-dimensional knapsack cover, cow path, online bidding, and online search. For online state exploration, the paper gives results in the worst-case competitive analysis model and in the online algorithms augmented with the prediction model. The results extend and generalize many known results in the online setting.

All authors (ordered alphabetically) have equal contributions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The lower bound 4 of worst case algorithms is the best possible robustness ratio.
2.
The code is available at https://github.com/Chenyang-1995/Online-State-Exploration.

References

Ai, L., Wu, X., Huang, L., Huang, L., Tang, P., Li, J.: The multi-shop ski rental problem. In: SIGMETRICS. pp. 463–475. ACM (2014)
Google Scholar
Anand, K., Ge, R., Kumar, A., Panigrahi, D.: A regression approach to learning-augmented online algorithms. In: NeurIPS, vol. 34 (2021)
Google Scholar
Anand, K., Ge, R., Panigrahi, D.: Customizing ML predictions for online algorithms. In: ICML. Proceedings of Machine Learning Research, vol. 119, pp. 303–313. PMLR (2020)
Google Scholar
Angelopoulos, S.: Online search with a hint. In: ITCS. LIPIcs, vol. 185, pp. 51:1–51:16 (2021)
Google Scholar
Antoniadis, A., Gouleakis, T., Kleer, P., Kolev, P.: Secretary and online matching problems with machine learned advice. In: NeurIPS (2020)
Google Scholar
Azar, Y.: On-line load balancing. In: Fiat, A., Woeginger, G.J. (eds.) Online Algorithms. LNCS, vol. 1442, pp. 178–195. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0029569
Chapter Google Scholar
Baeza-Yates, R.A., Culberson, J.C., Rawlins, G.J.E.: Searching in the plane. Inf. Comput. 106(2), 234–252 (1993)
Article MathSciNet MATH Google Scholar
Bamas, É., Maggiori, A., Svensson, O.: The primal-dual method for learning augmented algorithms. In: NeurIPS (2020)
Google Scholar
Charikar, M., Chekuri, C., Feder, T., Motwani, R.: Incremental clustering and dynamic information retrieval. SIAM J. Comput. 33(6), 1417–1440 (2004)
Article MathSciNet MATH Google Scholar
Chrobak, M., Kenyon, C., Noga, J., Young, N.E.: Incremental medians via online bidding. Algorithmica 50(4), 455–478 (2008)
Article MathSciNet MATH Google Scholar
Demaine, E.D., Fekete, S.P., Gal, S.: Online searching with turn cost. Theor. Comput. Sci. 361(2–3), 342–355 (2006)
Article MathSciNet MATH Google Scholar
Dütting, P., Lattanzi, S., Leme, R.P., Vassilvitskii, S.: Secretaries with advice. In: EC, pp. 409–429. ACM (2021)
Google Scholar
Epstein, L., Levin, A.: Randomized algorithms for online bounded bidding. Inf. Process. Lett. 110(12–13), 503–506 (2010)
Article MathSciNet MATH Google Scholar
Im, S., Kumar, R., Qaem, M.M., Purohit, M.: Non-clairvoyant scheduling with predictions. In: SPAA, pp. 285–294. ACM (2021)
Google Scholar
Jiang, Z., Panigrahi, D., Sun, K.: Online algorithms for weighted paging with predictions. In: ICALP, pp. 69:1–69:18 (2020)
Google Scholar
Kao, M., Ma, Y., Sipser, M., Yin, Y.L.: Optimal constructions of hybrid algorithms. J. Algorithms 29(1), 142–164 (1998)
Article MathSciNet MATH Google Scholar
Kao, M., Reif, J.H., Tate, S.R.: Searching in an unknown environment: an optimal randomized algorithm for the cow-path problem. Inf. Comput. 131(1), 63–79 (1996)
Article MathSciNet MATH Google Scholar
Lattanzi, S., Lavastida, T., Moseley, B., Vassilvitskii, S.: Online scheduling via learned weights. In: SODA, pp. 1859–1877 (2020)
Google Scholar
Lotker, Z., Patt-Shamir, B., Rawitz, D.: Rent, lease, or buy: randomized algorithms for multislope ski rental. SIAM J. Discret. Math. 26(2), 718–736 (2012)
Article MathSciNet MATH Google Scholar
Lykouris, T., Vassilvitskii, S.: Competitive caching with machine learned advice. In: ICML, Proceedings of Machine Learning Research, vol. 80, pp. 3302–3311. PMLR (2018)
Google Scholar
Meyerson, A.: The parking permit problem. In: FOCS, pp. 274–284. IEEE Computer Society (2005)
Google Scholar
Mitzenmacher, M., Vassilvitskii, S.: Algorithms with predictions. In: Beyond the Worst-Case Analysis of Algorithms, pp. 646–662. Cambridge University Press (2020)
Google Scholar
Purohit, M., Svitkina, Z., Kumar, R.: Improving online algorithms via ML predictions. In: NeurIPS, pp. 9684–9693 (2018)
Google Scholar
Rohatgi, D.: Near-optimal bounds for online caching with machine learned advice. In: SODA, pp. 1834–1845 (2020)
Google Scholar
Wang, S., Li, J., Wang, S.: Online algorithms for multi-shop ski rental with machine learned advice. In: NeurIPS (2020)
Google Scholar

Download references

Acknowledgements

Chenyang Xu was supported in part by Science and Technology Innovation 2030 -“The Next Generation of Artificial Intelligence” Major Project No.2018AAA0100900, and the Dean’s Fund of Shanghai Key Laboratory of Trustworthy Computing, East China Normal University. Sungjin Im was supported in part by NSF grants CCF-1844939 and CCF-2121745. Benjamin Moseley was supported in part by a Google Research Award, an Infor Research Award, a Carnegie Bosch Junior Faculty Chair, and NSF grants CCF-2121744 and CCF-1845146. Ruilong Zhang was supported by NSF grant CCF-1844890.

Author information

Authors and Affiliations

Electrical Engineering and Computer Science, University of California at Merced, Merced, CA, USA
Sungjin Im
Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA, USA
Benjamin Moseley
Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai, China
Chenyang Xu
Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, USA
Ruilong Zhang

Authors

Sungjin Im
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Moseley
View author publications
You can also search for this author in PubMed Google Scholar
Chenyang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Ruilong Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Sungjin Im , Benjamin Moseley , Chenyang Xu or Ruilong Zhang .

Editor information

Editors and Affiliations

University of Michigan, Ann Arbor, MI, USA
Danai Koutra
University of Vienna, Vienna, Austria
Claudia Plant
Max Planck Institute for Software Systems, Kaiserslautern, Germany
Manuel Gomez Rodriguez
Politecnico di Torino, Turin, Italy
Elena Baralis
CENTAI, Turin, Italy
Francesco Bonchi

Ethics declarations

The current paper is a theoretical work that explores various ideas and concepts related to the topic which aims to strengthen the traditional worst-case algorithm via machine learning advice. As such, there are no ethical issues associated with the research presented here. The paper includes some experiments which aims to verify the efficiency of the proposed algorithms. But this paper does not involve any experiments or studies that involve human and no personal information or data is used in the analysis. Instead, the focus is on developing theoretical models and frameworks that can help to advance our understanding of the subject matter.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Im, S., Moseley, B., Xu, C., Zhang, R. (2023). Online State Exploration: Competitive Worst Case and Learning-Augmented Algorithms. In: Koutra, D., Plant, C., Gomez Rodriguez, M., Baralis, E., Bonchi, F. (eds) Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14172. Springer, Cham. https://doi.org/10.1007/978-3-031-43421-1_20

Download citation

DOI: https://doi.org/10.1007/978-3-031-43421-1_20
Published: 18 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43420-4
Online ISBN: 978-3-031-43421-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Online State Exploration: Competitive Worst Case and Learning-Augmented Algorithms