Skip to main content
Log in

Non-submodular maximization on massive data streams

  • Published:
Journal of Global Optimization Aims and scope Submit manuscript

Abstract

The problem of maximizing a normalized monotone non-submodular set function subject to a cardinality constraint arises in the context of extracting information from massive streaming data. In this paper, we present four streaming algorithms for this problem by utilizing the concept of diminishing-return ratio. We analyze these algorithms to obtain the corresponding approximation ratios, which generalize the previous results for the submodular case. The numerical experiments show that our algorithms have better solution quality and competitive running time when compared to an existing algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  1. Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of WSDM, pp. 5–14 (2009)

  2. Ajtai, M., Jayram, T.S., Kumar, R., Sivakumar, D.: Approximate counting of inversions in a data stream. In: Proceedings of STOC, pp. 370–379 (2002)

  3. Badanidiyuru, A., Mirzasoleiman, B., Karbasi, A., Krause, A.: Streaming submodular maximization: massive data summarization on the fly. In: Proceedings of SIGKDD, pp. 671–680 (2014)

  4. Barbosa, R., Ene, A., Nguyen, H.L., Ward, J.: The power of randomization: distributed submodular maximization on massive datasets. In: Proceedings of ICML, pp. 1236–1244 (2015)

  5. Barbosa, R., Ene, A., Nguyen, H.L., Ward, J.: A new framework for distributed submodular maximization. In: Proceedings of FOCS, pp. 645–654 (2016)

  6. Bian, A.A., Buhmann, J.M., Krause, A., Tschiatschek, S.: Guarantees for greedy maximization of non-submodular functions with applications. In: Proceedings of ICML, pp. 498–507 (2017)

  7. Buchbinder, N., Feldman, M., Schwartz, R.: Online submodular maximization with preemption. In: Proceedings of SODA, pp. 1202–1216 (2015)

  8. Chakrabarti, A., Kale, S.: Submodular maximization meets streaming: matchings, matroids, and more. Math. Program. 154, 225–247 (2015)

    Article  MathSciNet  Google Scholar 

  9. Das, A., Kempe, D.: Submodular meets spectral: greedy algorithms for subset selection, sparse approximation and dictionary selection. In: Proceedings of ICML, pp. 1057–1064 (2011)

  10. Du, D.L., Li, Y., Xiu, N.H., Xu, D.C.: Simultaneous approximation of multi-criteria submodular function maximization. J. Oper. Res. Soc. China 2, 271–290 (2014)

    Article  MathSciNet  Google Scholar 

  11. Dueck, D., Frey, B.J.: Non-metric affinity propagation for unsupervised image categorization. In: Proceedings of ICCV, pp. 1–8 (2007)

  12. El-Arini, K., Guestrin, C.: Beyond keyword search: discovering relevant scientific literature. In: Proceedings of SIGKDD, pp. 439–447 (2011)

  13. El-Arini, K., Veda, G., Shahaf, D., Guestrin, C.: Turning down the noise in the blogosphere. In: Proceedings of SIGKDD, pp. 289–298 (2009)

  14. Elenberg, E., Dimakis, A.G., Feldman, M., Karbasi, A.: Streaming weak submodularity: interpreting neural networks on the fly. In: Proceedings of NIPS, pp. 4044–4054 (2017)

  15. Feige, U.: A threshold of \(\ln n\) for approximating set cover. J. ACM 45, 634–652 (1998)

    Article  MathSciNet  Google Scholar 

  16. Goldengorin, B., Ghosh, D.: A multilevel search algorithm for the maximization of submodular functions applied to the quadratic cost partition problem. J. Glob. Optim. 32, 65–82 (2005)

    Article  MathSciNet  Google Scholar 

  17. Golovin, D., Krause, A.: Adaptive submodularity: theory and applications in active learning and stochastic optimization. J. Artif. Intell. Res. 42, 427–486 (2011)

    MathSciNet  MATH  Google Scholar 

  18. Gomes, R., Krause, A.: Budgeted nonparametric learning from data streams. In: Proceedings of ICML, pp. 391–398 (2010)

  19. Gomez Rodriguez, M., Leskovec, J., Krause, A.: Inferring networks of diffusion and influence. ACM Trans. Knowl. Discov. D 8, 36–39 (2018)

    Google Scholar 

  20. Guha, S., Mishra, N., Motwani, R., O’Callaghan, L.: Clustering data streams. In: Proceedings of FOCS, pp. 359–366 (2000)

  21. Kampa, K., Mehta, S., Chou, C.A., Chaovalitwongse, W.A., Grabowski, T.J.: Sparse optimization in feature selection: application in neuroimaging. J. Glob. Optim. 59, 439–457 (2014)

    Article  MathSciNet  Google Scholar 

  22. Kempe, D., Kleinberg, J., Tardos, E.: Maximizing the spread of influence through a social network. In: Proceedings of SIGKDD, pp. 137–146 (2003)

  23. Khanna, R., Elenberg, E., Dimakis, A.G., Negahban, S., Ghosh, J.: Scalable greedy feature selection via weak submodularity. In: Proceedings of AISTATS, pp. 1560–1568 (2017)

  24. Krause, A., Golovin, D.: Submodular function maximization. In tractability: practical approaches to hard problems, pp. 71–104. Cambridge University Press, Cambridge (2014)

    Google Scholar 

  25. Krause, A., Singh, A., Guestrin, C.: Nearoptimal sensor placements in gaussian processes: theory, efficient algorithms and empirical studies. J. Mach. Learn. Res. 9, 235–284 (2008)

    MATH  Google Scholar 

  26. Kuhnle, A., Smith, J.D., Crawford, V.G., Thai, M.T.: Fast maximization of non-submodular, monotonic functions on the integer lattice. In: Proceedings of ICML, pp. 2791–2800 (2018)

  27. Lawrence, N., Seeger, M., Herbrich, R.: Fast sparse gaussian process methods: the informative vector machine. In: Proceedings of NIPS, pp. 625–632 (2003)

  28. Lehmann, B., Lehmann, D., Nisan, N.: Combinatorial auctions with decreasing marginal utilities. Games Econ. Behav. 55, 270–296 (2006)

    Article  MathSciNet  Google Scholar 

  29. Lin, Y., Chen, W., Lui, J.C.S.: Boosting information spread: an algorithmic approach. In: Proceedings of ICDE, pp. 883–894 (2017)

  30. Mirzasoleiman, B., Badanidiyuru, A., Karbasi, A., Vondrák, J., Krause, A.: Lazier than lazy greedy. In: Proceedings of AAAI, pp. 1812–1818 (2015)

  31. Mirzasoleiman, B., Karbasi, A., Sarkar, R., Krause, A.: Distributed submodular maximization: identifying representative elements in massive data. In: Proceedings of NIPS, pp. 2049–2057 (2013)

  32. Muthukrishnan, S.: Data streams: algorithms and applications. Found. Trends Theor. Comput. Sci. 1, 117–236 (2005)

    Article  MathSciNet  Google Scholar 

  33. Nemhauser, G.L., Wolsey, L.A., Fisher, M.L.: An analysis of approximations for maximizing submodular set functions-I. Math. Program. 14, 265–294 (1978)

    Article  MathSciNet  Google Scholar 

  34. Norouzi-Fard, A., Tarnawski, J., Mitrovic, S., Zandieh, A., Mousavifar, A., Svensson, O.: Beyond \(1/2\)-approximation for submodular maximization on massive data streams. In: Proceedings of ICML, pp. 3826–3835 (2018)

  35. Yu, Q., Li, H., Liao, Y., Cui, S.: Fast budgeted influence maximization over multi-action event logs. IEEE Access 6, 14367–14378 (2018)

    Article  Google Scholar 

  36. Zhu, X., Yu, J., Lee, W., Kim, D., Shan, S., Du, D.Z.: New dominating sets in social networks. J. Glob. Optim. 48, 633–642 (2010)

    Article  MathSciNet  Google Scholar 

  37. Zoubin, G.: Scaling the indian buffet process via submodular maximization. In: Proceedings of ICML, pp. 1013–1021 (2013)

Download references

Acknowledgements

We would like to appreciate the anonymous reviewers and editors whose valuable comments and suggestions have greatly improved the quality of this paper. The second author is supported by Natural Science Foundation of China (No. 11531014). The third author is supported by National Natural Science Foundation of China (Nos. 61433012 and U1435215). The fourth author is supported by Natural Science Foundation of China (No. 11871081).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dongmei Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Xu, D., Wang, Y. et al. Non-submodular maximization on massive data streams. J Glob Optim 76, 729–743 (2020). https://doi.org/10.1007/s10898-019-00840-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10898-019-00840-8

Keywords

Navigation