skip to main content
research-article
Public Access

Approximation Algorithms for Stochastic Submodular Set Cover with Applications to Boolean Function Evaluation and Min-Knapsack

Published: 25 April 2016 Publication History

Abstract

We present a new approximation algorithm for the stochastic submodular set cover (SSSC) problem called adaptive dual greedy. We use this algorithm to obtain a 3-approximation algorithm solving the stochastic Boolean function evaluation (SBFE) problem for linear threshold formulas (LTFs). We also obtain a 3-approximation algorithm for the closely related stochastic min-knapsack problem and a 2-approximation for a variant of that problem.
We prove a new approximation bound for a previous algorithm for the SSSC problem, the adaptive greedy algorithm of Golovin and Krause.
We also consider an approach to approximating SBFE problems using the adaptive greedy algorithm, which we call the Q-value approach. This approach easily yields a new result for evaluation of CDNF (conjunctive / disjunctive normal form) formulas, and we apply variants of it to simultaneous evaluation problems and a ranking problem. However, we show that the Q-value approach provably cannot be used to obtain a sublinear approximation factor for the SBFE problem for LTFs or read-once disjunctive normal form formulas.

References

[1]
M. Adler and B. Heeringa. 2012. Approximating optimal binary decision trees. Algorithmica 62, 1112--1121.
[2]
S. R. Allen, L. Hellerstein, D. Kletenik, and T. Ünlüyurt. 2013. Evaluation of DNF formulas. arXiv:1310.3673.
[3]
A. Bar-Noy, M. Bellare, M. M. Halldórsson, H. Shachnai, and T. Tamir. 1998. On chromatic sums and distributed resource allocation. Information and Computation 140, 183--202.
[4]
G. Bellala, S. Bhavnani, and C. Scott. 2012. Group-based active query selection for rapid diagnosis in time-critical situations. IEEE Transactions on Information Theory 58, 1, 459--478.
[5]
Y. Ben-Dov. 1981. Optimal testing procedures for special structures of coherent systems. Management Science 27, 12, 1410--1420.
[6]
P. Beraldi and A. Ruszczynski. 2002. The probabilistic set-covering problem. Operations Research 50, 956--967.
[7]
A. Bhalgat. 2011. A (2 + ε)-approximation algorithm for the stochastic knapsack problem. Unpublished Manuscript.
[8]
A. Bhalgat, A. Goel, and S. Khanna. 2011. Improved approximation results for stochastic knapsack problems. In Proceedings of the 22nd Annual ACM-SIAM Symposium on Discrete Algorithms. 1647--1665.
[9]
E. Boros and T. Ünlüyurt. 1999. Diagnosing double regular systems. Annals of Mathematics and Artificial Intelligence 26, 171--191.
[10]
E. Boros and T. Ünlüyurt. 2000. Sequential testing of series-parallel systems of small depth. In Computing Tools for Modeling, Optimization and Simulation. Springer, 39--74.
[11]
T. Carnes and D. Shmoys. 2008. Primal-dual schema for capacitated covering problems. In Proceedings of the 13th International Conference on Integer Programming and Combinatorial Optimization (IPCO’08). 288--302.
[12]
R. Carr, L. Fleischer, V. Leung, and C. Phillips. 2000. Strengthening integrality gaps for capacitated network design and covering problems. In Proceedings of the 11th Annual ACM-SIAM Symposium on Discrete Algorithms. 106--115.
[13]
M.-F. Chang, W. Shi, and W. K. Fuchs. 1990. Optimal diagnosis procedures for k-out-of-n structures. IEEE Transactions on Computers 39, 559--564.
[14]
M. Charikar, R. Fagin, V. Guruswami, J. M. Kleinberg, P. Raghavan, and A. Sahai. 2002. Query strategies for priced information. Journal of Computer and Systems Sciences 64, 785--819.
[15]
F. Cicalese, E. Laber, and A. M. Saettler. 2013. Decision trees for the efficient evaluation of discrete functions: Worst case and expected case analysis. arXiv 1309.2796.
[16]
L. Cox, Y. Qiu, and W. Kuehner. 1989. Heuristic least-cost computation of discrete classification functions with uncertain argument values. Annals of Operations Research 21, 1--29.
[17]
B. Dean, M. Goemans, and J. Vondrák. 2004. Approximating the stochastic knapsack problem: The benefit of adaptivity. In Proceedings of the 45th Symposium on Foundations of Computer Science (FOCS’04). 208--217.
[18]
C. Derman, C. Lieberman, and S. Ross. 1978. A renewal decision problem. Management Science 24, 5, 554--561.
[19]
A. Deshpande and L. Hellerstein. 2008. Flow algorithms for parallel query optimization. In Proceedings of the 24th International Conference on Data Engineering (ICDE’08). 754--763.
[20]
U. Feige, L. Lovász, and P. Tetali. 2002. Approximating min-sum set cover. In Proceedings of the 5th International Workshop on Approximation Algorithms for Combinatorial Optimization (APPROX’02). 94--107.
[21]
A. Fiat and D. Pechyony. 2004. Decision trees: More theoretical justification for practical algorithms. In Proceedings of the 15th International Conference on Algorithmic Learning Theory (ALT’04).
[22]
T. Fujito. 1999. On approximation of the submodular set cover problem. Operations Research Letters 25, 4, 169--174.
[23]
T. Fujito. 2000. Approximation algorithms for submodular set cover with applications. IEICE Transactions on Information and Systems 83, 480--487.
[24]
M. Garey. 1973. Optimal task scheduling with precedence constraints. Discrete Mathematics 4, 37--56.
[25]
M. X. Goemans and J. Vondrák. 2006. Stochastic covering and adaptivity. In Proceedings of the 7th Latin American Symposium on Theoretical Informatics (LATIN’06). 532--543.
[26]
D. Golovin and A. Krause. 2011. Adaptive submodularity: Theory and applications in active learning and stochastic optimization. Journal of Artificial Intelligence Research 42, 427--486.
[27]
D. Golovin, A. Krause, and D. Ray. 2010. Near-optimal Bayesian active learning with noisy observations. In Proceedings of the 24th Annual Conference on Neural Information Processing Systems (NIPS’10). 766--774.
[28]
R. Greiner, R. Hayward, M. Jankowska, and M. Molloy. 2006. Finding optimal satisficing strategies for and-or trees. Artificial Intelligence 170, 19--58.
[29]
R. Greiner, R. Hayward, and M. Molloy. 2002. Optimal depth-first strategies for and-or trees. In Proceedings of the 18th National Conference on Artificial Intelligence and the 14th Conference on Innovative Applications of Artificial Intelligence. 725--730.
[30]
D. Guijarro, V. Lavín, and V. Raghavan. 2006. Exact learning when irrelevant variables abound. In Proceedings of the 4th European Conference on Computational Learning Theory (EuroCOLT’99). 91--100.
[31]
A. Guillory and J. Bilmes. 2011. Simultaneous learning and covering with adversarial noise. In Proceedings of the 28th International Conference on Machine Learning (ICML’11). 369--376.
[32]
X. Han and K. Makino. 2010. Online minimization knapsack problem. In Approximation and Online Algorithms. Lecture Notes in Computer Science, Vol. 5893. Springer, 182--193.
[33]
J. Hastad. 1994. On the size of weights for threshold gates. SIAM Journal on Discrete Mathematics 7, 3, 484--492.
[34]
T. Ibaraki and T. Kameda. 1984. On the optimal nesting order for computing n-relational joins. ACM Transactions on Database Systems 9, 482--502.
[35]
S. Im and V. Nagarajan. 2011. Minimum latency submodular cover in metrics. arXiv: 1110.2207.
[36]
S. Im, V. Nagarajan, and R. van der Zwaan. 2012. Minimum latency submodular cover. In Proceedings of the 39th International Colloquium on Automata, Languages, and Programming (ICALP’12). 485--497.
[37]
S. Iwata and K. Nagano. 2009. Submodular function minimization under covering constraints. In Proceedings of the 50th IEEE Symposium on Foundations of Computer Science (FOCS’09). 671--680.
[38]
H. Kaplan, E. Kushilevitz, and Y. Mansour. 2005. Learning with attribute costs. In Proceedings of the Symposium on the Theory of Computing. 356--365.
[39]
R. Krishnamurthy, H. Boral, and C. Zaniolo. 1986. Optimization of nonrecursive queries. In Proceedings of the 12th International Conference on Very Large Data Bases (VLDB’86). 128--137.
[40]
Z. Liu, S. Parthasarathy, A. Ranganathan, and H. Yang. 2008. Near-optimal algorithms for shared filter evaluation in data stream systems. In Proceedings of the 28th ACM SIGMOD International Conference on Management of Data.
[41]
M. Moshkov. 2003. Approximate algorithm for minimization of decision tree depth. In Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. Lecture Notes in Computer Science, Vol. 2639. Springer, 611--614.
[42]
M. Moshkov and I. Chikalov. 1997. Bounds on average weighted depth of decision trees. Fundamenta Informaticae 31, 145--156.
[43]
K. Munagala, S. Babu, R. Motwani, and J. Widom. 2005. The pipelined set cover problem. In Proceedings of the 10th International Conference on Database Theory (ICDT’05). 83--98.
[44]
S. Nijssen and E. Fromont. 2007. Mining optimal decision trees from itemset lattices. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’07). 530--539.
[45]
S. Salloum. 1979. Optimal Testing Algorithms for Symmetric Coherent Systems. Ph.D. Dissertation. University of Southern California.
[46]
S. Salloum and M. Breuer. 1984. An optimum testing algorithm for some symmetric coherent systems. Journal of Mathematical Analysis and Applications 101, 170--194.
[47]
U. Srivastava, K. Munagala, J. Widom, and R. Motwani. 2006. Query optimization over Web services. In Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB’06). 355--366.
[48]
T. Ünlüyurt. 2004. Sequential testing of complex systems: A review. Discrete Applied Mathematics 142, 189--205.
[49]
L. Wolsey. 1982. An analysis of the greedy algorithm for the submodular set covering problem. Combinatorica 2, 385--393.

Cited By

View all
  • (2024)Nonadaptive Stochastic Score Classification and Explainable Half-Space EvaluationOperations Research10.1287/opre.2023.0431Online publication date: 18-Jul-2024
  • (2024)The Power of Adaptivity for Stochastic Submodular CoverOperations Research10.1287/opre.2022.238872:3(1156-1176)Online publication date: 1-May-2024
  • (2024)Semi-Bandit Learning for Monotone Stochastic Optimization*2024 IEEE 65th Annual Symposium on Foundations of Computer Science (FOCS)10.1109/FOCS61266.2024.00083(1260-1274)Online publication date: 27-Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Algorithms
ACM Transactions on Algorithms  Volume 12, Issue 3
June 2016
408 pages
ISSN:1549-6325
EISSN:1549-6333
DOI:10.1145/2930058
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 April 2016
Accepted: 01 January 2016
Revised: 01 January 2016
Received: 01 December 2014
Published in TALG Volume 12, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Boolean function evaluation
  2. Sequential testing

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)131
  • Downloads (Last 6 weeks)22
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Nonadaptive Stochastic Score Classification and Explainable Half-Space EvaluationOperations Research10.1287/opre.2023.0431Online publication date: 18-Jul-2024
  • (2024)The Power of Adaptivity for Stochastic Submodular CoverOperations Research10.1287/opre.2022.238872:3(1156-1176)Online publication date: 1-May-2024
  • (2024)Semi-Bandit Learning for Monotone Stochastic Optimization*2024 IEEE 65th Annual Symposium on Foundations of Computer Science (FOCS)10.1109/FOCS61266.2024.00083(1260-1274)Online publication date: 27-Oct-2024
  • (2023)Online Learning for Adaptive Probing and Scheduling in Dense WLANsIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10228988(1-10)Online publication date: 17-May-2023
  • (2022)A General Framework for Approximating Min Sum Ordering ProblemsINFORMS Journal on Computing10.1287/ijoc.2021.112434:3(1437-1452)Online publication date: 1-May-2022
  • (2022)The Stochastic Boolean Function Evaluation problem for symmetric Boolean functionsDiscrete Applied Mathematics10.1016/j.dam.2021.12.001309:C(269-277)Online publication date: 15-Mar-2022
  • (2022)Regularized impurity reduction: accurate decision trees with complexity guaranteesData Mining and Knowledge Discovery10.1007/s10618-022-00884-737:1(434-475)Online publication date: 28-Nov-2022
  • (2022)Algorithms for the Unit-Cost Stochastic Score Classification ProblemAlgorithmica10.1007/s00453-022-00982-484:10(3054-3074)Online publication date: 1-Oct-2022
  • (2022)Adaptivity Gaps for the Stochastic Boolean Function Evaluation ProblemApproximation and Online Algorithms10.1007/978-3-031-18367-6_10(190-210)Online publication date: 21-Oct-2022
  • (2022)Non-adaptive Stochastic Score Classification and Explainable Halfspace EvaluationInteger Programming and Combinatorial Optimization10.1007/978-3-031-06901-7_21(277-290)Online publication date: 27-Jun-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media