skip to main content
10.1145/3274005.3274036acmotherconferencesArticle/Chapter ViewAbstractPublication PagescompsystechConference Proceedingsconference-collections
research-article

Neighborhood-based Strategies for Widening of the Greedy Algorithm of the Set Cover Problem

Published: 13 September 2018 Publication History

Abstract

We live in the age of ever-increasing parallel computing resources. For a decade there has been intense research into parallel data mining algorithms. Most of this research is focused on improving the running time of existing algorithms. In contrast our focus is the improvement of the solution quality, or model accuracy. We are looking for "smart" strategies to invest parallel compute resources in order to achieve a better exploration of the search space by exploring several solutions in parallel, referred to as Widening. In this paper, we demonstrate the effect of neighborhood-based Widening with different types of neighborhoods for the greedy algorithm of the set cover problem.

References

[1]
Zaenal Akbar, Violeta N. Ivanova, and Michael R. Berthold. 2012. Parallel Data Mining Revisited. Better, Not Faster. In IDA. 23--34.
[2]
Selim G. Akl. 2002. Parallel Real-Time Computation: Sometimes Quantity Means Quality. In Computing and Informatics. Vol. 21. 455--487.
[3]
K. Bache and M. Lichman. 2013. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml
[4]
Michael R. Berthold, Nicolas Cebron, Fabian Dill, Thomas R. Gabriel, Tobias Kötter, Thorsten Meinl, Peter Ohl, Christoph Sieb, Kilian Thiel, and Bernd Wiswedel. KNIME: The Konstanz Information Miner. In Data Analysis, Machine Learning and Applications, Christine Preisach, Hans Burkhardt, Lars Schmidt-Thieme, and Reinhold Decker (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 319--326.
[5]
John Darlington, Yike Guo, Janjao Sutiwaraphun, and Hing Wing To. 1997. Parallel Induction Algorithms for Data Mining. In IDA, Vol. 1280. 437--445.
[6]
Alberto Fernández, Salvador García, Julián Luengo, Ester Bernadó-Mansilla, and Francisco Herrera. 2010. Genetics-based Machine Learning for Rule Induction: State of the Art, Taxonomy, and Comparative Study. Trans. Evol. Comp 14, 6 (Dec. 2010), 913--941.
[7]
Jerome H Friedman and Bogdan E Popescu. 2008. Predictive learning via rule ensembles. The Annals of Applied Statistics (2008), 916--954.
[8]
William D. Harvey and Matthew L. Ginsberg. 1995. Limited Discrepancy Search. In IJCAI. 607--615.
[9]
Violeta N. Ivanova and Michael R. Berthold. 2013. Diversity-Driven Widening. In IDA. 223--236.
[10]
David S. Johnson. 1973. Approximation algorithms for combinatorial problems. In STOC.
[11]
Hillol Kargupta and Philip Chan. 2000. Advances in Distributed and Parallel Knowledge Discovery. AAAI/MIT Press.
[12]
Richard M. Karp. 1972. Reducibility among combinatorial problems. In Complexity of Computer Computations. 85--103.
[13]
Richard Kufrin. 1995. Decision Trees on Parallel Processors. In PPAI. 279--306.
[14]
Vipin Kumar. 2001. Special Issue on High-performance Data Mining. Academic Press.
[15]
GiseleL. Pappa and AlexA. Freitas. 2010. Creating Rule Ensembles from Automatically-Evolved Rule Induction Algorithms. In Advances in Machine Learning I, Jacek Koronacki, ZbigniewW. Ras, Slawomir T. Wierzchon, and Janusz Kacprzyk (Eds.). Studies in Computational Intelligence, Vol. 262. Springer Berlin Heidelberg, 257--273.
[16]
John Shafer, Rakeeh Agrawal, and Manish Mehta. 1996. SPRINT: A Scalable Parallel Classifier for Data Mining. In VLDB. 544--555.
[17]
Peter Shell, Juan Antonio Hernandez Rubio, and Gonzalo Quiroga Barro. 1994. Improving search through diversity. In AAAI. 1323--1328.
[18]
Anurag Srivastava, Eui-Hong Han, Vipin Kumar, and Vineet Singh. 1999. Parallel Formulations of Decision-Tree Classification Algorithms. DMKD 3, 3 (1999), 31 CompSysTech'18, September 13--14, 2018, Ruse, Bulgaria Violeta N. Ivanova-Rohling 237--261.
[19]
Domenico Talia. 2002. Parallelism in Knowledge Discovery Techniques. In PARA, Vol. 2367. 127--136.
[20]
Mohammed J. Zaki. 1999. Parallel and Distributed Association Mining: a Survey. Concurrency, IEEE 7, 4 (1999), 14--25.
[21]
Mohammed J. Zaki and Ching-Tien Ho. 2000. Large-Scale Parallel Data Mining. Springer.
[22]
Mohammed J. Zaki, Ching-Tien Ho, and Rakesh Agrawal. 1999. Parallel Classification for Data Mining on Shared-Memory Multiprocessors. In ICDE. 198--205.
[23]
Mohammed J. Zaki and Yi Pan. 2002. Introduction: Recent Developments in Parallel and Distributed Data Mining. DPD 11, 2 (2002), 123--127.

Cited By

View all
  • (2024)SkySCALE: A Radio Tomographic Approach Towards Scaling UAV Network DeploymentsProceedings of the Twenty-fifth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing10.1145/3641512.3686384(341-350)Online publication date: 14-Oct-2024
  • (2020)Evaluating Machine Learning Approaches for Discovering Optimal Sets of Projection Operators for Quantum State Tomography of Qubit SystemsCybernetics and Information Technologies10.2478/cait-2020-006120:6(61-73)Online publication date: 31-Dec-2020

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CompSysTech '18: Proceedings of the 19th International Conference on Computer Systems and Technologies
September 2018
206 pages
ISBN:9781450364256
DOI:10.1145/3274005
© 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

In-Cooperation

  • ERSVB: EURORISC SYSTEMS - Varna, Bulgaria
  • FOSEUB: FEDERATION OF THE SCIENTIFIC ENGINEERING UNIONS - Bulgaria
  • UORB: University of Ruse, Bulgaria
  • TECHUVB: Technical University of Varna, Bulgaria

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 September 2018

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

CompSysTech'18

Acceptance Rates

Overall Acceptance Rate 241 of 492 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)SkySCALE: A Radio Tomographic Approach Towards Scaling UAV Network DeploymentsProceedings of the Twenty-fifth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing10.1145/3641512.3686384(341-350)Online publication date: 14-Oct-2024
  • (2020)Evaluating Machine Learning Approaches for Discovering Optimal Sets of Projection Operators for Quantum State Tomography of Qubit SystemsCybernetics and Information Technologies10.2478/cait-2020-006120:6(61-73)Online publication date: 31-Dec-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media