Abstract
The problem of maximizing a normalized monotone non-submodular set function subject to a cardinality constraint arises in the context of extracting information from massive streaming data. In this paper, we present four streaming algorithms for this problem by utilizing the concept of diminishing-return ratio. We analyze these algorithms to obtain the corresponding approximation ratios, which generalize the previous results for the submodular case. The numerical experiments show that our algorithms have better solution quality and competitive running time when compared to an existing algorithm.
Similar content being viewed by others
References
Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of WSDM, pp. 5–14 (2009)
Ajtai, M., Jayram, T.S., Kumar, R., Sivakumar, D.: Approximate counting of inversions in a data stream. In: Proceedings of STOC, pp. 370–379 (2002)
Badanidiyuru, A., Mirzasoleiman, B., Karbasi, A., Krause, A.: Streaming submodular maximization: massive data summarization on the fly. In: Proceedings of SIGKDD, pp. 671–680 (2014)
Barbosa, R., Ene, A., Nguyen, H.L., Ward, J.: The power of randomization: distributed submodular maximization on massive datasets. In: Proceedings of ICML, pp. 1236–1244 (2015)
Barbosa, R., Ene, A., Nguyen, H.L., Ward, J.: A new framework for distributed submodular maximization. In: Proceedings of FOCS, pp. 645–654 (2016)
Bian, A.A., Buhmann, J.M., Krause, A., Tschiatschek, S.: Guarantees for greedy maximization of non-submodular functions with applications. In: Proceedings of ICML, pp. 498–507 (2017)
Buchbinder, N., Feldman, M., Schwartz, R.: Online submodular maximization with preemption. In: Proceedings of SODA, pp. 1202–1216 (2015)
Chakrabarti, A., Kale, S.: Submodular maximization meets streaming: matchings, matroids, and more. Math. Program. 154, 225–247 (2015)
Das, A., Kempe, D.: Submodular meets spectral: greedy algorithms for subset selection, sparse approximation and dictionary selection. In: Proceedings of ICML, pp. 1057–1064 (2011)
Du, D.L., Li, Y., Xiu, N.H., Xu, D.C.: Simultaneous approximation of multi-criteria submodular function maximization. J. Oper. Res. Soc. China 2, 271–290 (2014)
Dueck, D., Frey, B.J.: Non-metric affinity propagation for unsupervised image categorization. In: Proceedings of ICCV, pp. 1–8 (2007)
El-Arini, K., Guestrin, C.: Beyond keyword search: discovering relevant scientific literature. In: Proceedings of SIGKDD, pp. 439–447 (2011)
El-Arini, K., Veda, G., Shahaf, D., Guestrin, C.: Turning down the noise in the blogosphere. In: Proceedings of SIGKDD, pp. 289–298 (2009)
Elenberg, E., Dimakis, A.G., Feldman, M., Karbasi, A.: Streaming weak submodularity: interpreting neural networks on the fly. In: Proceedings of NIPS, pp. 4044–4054 (2017)
Feige, U.: A threshold of \(\ln n\) for approximating set cover. J. ACM 45, 634–652 (1998)
Goldengorin, B., Ghosh, D.: A multilevel search algorithm for the maximization of submodular functions applied to the quadratic cost partition problem. J. Glob. Optim. 32, 65–82 (2005)
Golovin, D., Krause, A.: Adaptive submodularity: theory and applications in active learning and stochastic optimization. J. Artif. Intell. Res. 42, 427–486 (2011)
Gomes, R., Krause, A.: Budgeted nonparametric learning from data streams. In: Proceedings of ICML, pp. 391–398 (2010)
Gomez Rodriguez, M., Leskovec, J., Krause, A.: Inferring networks of diffusion and influence. ACM Trans. Knowl. Discov. D 8, 36–39 (2018)
Guha, S., Mishra, N., Motwani, R., O’Callaghan, L.: Clustering data streams. In: Proceedings of FOCS, pp. 359–366 (2000)
Kampa, K., Mehta, S., Chou, C.A., Chaovalitwongse, W.A., Grabowski, T.J.: Sparse optimization in feature selection: application in neuroimaging. J. Glob. Optim. 59, 439–457 (2014)
Kempe, D., Kleinberg, J., Tardos, E.: Maximizing the spread of influence through a social network. In: Proceedings of SIGKDD, pp. 137–146 (2003)
Khanna, R., Elenberg, E., Dimakis, A.G., Negahban, S., Ghosh, J.: Scalable greedy feature selection via weak submodularity. In: Proceedings of AISTATS, pp. 1560–1568 (2017)
Krause, A., Golovin, D.: Submodular function maximization. In tractability: practical approaches to hard problems, pp. 71–104. Cambridge University Press, Cambridge (2014)
Krause, A., Singh, A., Guestrin, C.: Nearoptimal sensor placements in gaussian processes: theory, efficient algorithms and empirical studies. J. Mach. Learn. Res. 9, 235–284 (2008)
Kuhnle, A., Smith, J.D., Crawford, V.G., Thai, M.T.: Fast maximization of non-submodular, monotonic functions on the integer lattice. In: Proceedings of ICML, pp. 2791–2800 (2018)
Lawrence, N., Seeger, M., Herbrich, R.: Fast sparse gaussian process methods: the informative vector machine. In: Proceedings of NIPS, pp. 625–632 (2003)
Lehmann, B., Lehmann, D., Nisan, N.: Combinatorial auctions with decreasing marginal utilities. Games Econ. Behav. 55, 270–296 (2006)
Lin, Y., Chen, W., Lui, J.C.S.: Boosting information spread: an algorithmic approach. In: Proceedings of ICDE, pp. 883–894 (2017)
Mirzasoleiman, B., Badanidiyuru, A., Karbasi, A., Vondrák, J., Krause, A.: Lazier than lazy greedy. In: Proceedings of AAAI, pp. 1812–1818 (2015)
Mirzasoleiman, B., Karbasi, A., Sarkar, R., Krause, A.: Distributed submodular maximization: identifying representative elements in massive data. In: Proceedings of NIPS, pp. 2049–2057 (2013)
Muthukrishnan, S.: Data streams: algorithms and applications. Found. Trends Theor. Comput. Sci. 1, 117–236 (2005)
Nemhauser, G.L., Wolsey, L.A., Fisher, M.L.: An analysis of approximations for maximizing submodular set functions-I. Math. Program. 14, 265–294 (1978)
Norouzi-Fard, A., Tarnawski, J., Mitrovic, S., Zandieh, A., Mousavifar, A., Svensson, O.: Beyond \(1/2\)-approximation for submodular maximization on massive data streams. In: Proceedings of ICML, pp. 3826–3835 (2018)
Yu, Q., Li, H., Liao, Y., Cui, S.: Fast budgeted influence maximization over multi-action event logs. IEEE Access 6, 14367–14378 (2018)
Zhu, X., Yu, J., Lee, W., Kim, D., Shan, S., Du, D.Z.: New dominating sets in social networks. J. Glob. Optim. 48, 633–642 (2010)
Zoubin, G.: Scaling the indian buffet process via submodular maximization. In: Proceedings of ICML, pp. 1013–1021 (2013)
Acknowledgements
We would like to appreciate the anonymous reviewers and editors whose valuable comments and suggestions have greatly improved the quality of this paper. The second author is supported by Natural Science Foundation of China (No. 11531014). The third author is supported by National Natural Science Foundation of China (Nos. 61433012 and U1435215). The fourth author is supported by Natural Science Foundation of China (No. 11871081).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, Y., Xu, D., Wang, Y. et al. Non-submodular maximization on massive data streams. J Glob Optim 76, 729–743 (2020). https://doi.org/10.1007/s10898-019-00840-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10898-019-00840-8