Multiple Seeds Based Evolutionary Algorithm for Mining Boolean Association Rules

Kabir, Mir Md. Jahangir; Xu, Shuxiang; Kang, Byeong Ho; Zhao, Zongyuan

doi:10.1007/978-3-319-42996-0_6

Mir Md. Jahangir Kabir¹⁶,
Shuxiang Xu¹⁶,
Byeong Ho Kang¹⁶ &
…
Zongyuan Zhao¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9794))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1002 Accesses

Abstract

Most of the association rule mining algorithms use a single seed for initializing a population without paying attention to the effectiveness of an initial population in an evolutionary learning. Recently, researchers show that an initial population has significant effects on producing good solutions over several generations of a genetic algorithm. There are two significant challenges raised by single seed based genetic algorithms for real world applications: (1) solutions of a genetic algorithm are varied, since different seeds generate different initial populations, (2) it is a hard process to define an effective seed for a specific application. To avoid these problems, in this paper we propose a new multiple seeds based genetic algorithm (MSGA) which generates multiple seeds from different domains of a solution space to discover high quality rules from a large data set. This approach introduces m-domain model and m-seeds selection process through which the whole solution space is subdivided into m-number of same size domains and from each domain it selects a seed. By using these seeds, this method generates an effective initial population to perform an evolutionary learning of the fitness value of each rule. As a result, this method obtains strong searching efficiency at the beginning of the evolution and achieves fast convergence along with the evolution. MSGA is tested with different mutation and crossover operators for mining interesting Boolean association rules from different real world data sets and compared the results with different single seeds based genetic algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S., Coello, C.A.C.: A survey of multiobjective evolutionary algorithms for data mining: Part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014)
Article Google Scholar
Hipp, J., Güntzer, U., Nakhaeizadeh, G.: Algorithms for association rule mining—a general survey and comparison. ACM SIGKDD Explor. Newsl. 2(1), 58–64 (2000)
Article Google Scholar
del Jesus, M.J., Gámez, J.A., González, P., Puerta, J.M.: On the discovery of association rules by means of evolutionary algorithms. Wiley Interdisc. Rev. Data Min. Knowl. Discov. 1(5), 397–415 (2011)
Article Google Scholar
Borgelt, C.: Efficient implementations of apriori and eclat. In: IEEE ICDM Workshop on Frequent Item Set Mining Implementations, pp. 280–296 (2003)
Google Scholar
Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–390 (2000)
Article Google Scholar
Qodmanan, H.R., Nasiri, M., Minaei-Bidgoli, B.: Multi objective association rule mining with genetic algorithm without specifying minimum support and minimum confidence. Expert Syst. Appl. 38(1), 288–298 (2011)
Article Google Scholar
Yan, X., Zhang, C., Zhang, S.: Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst. Appl. 36(2), 3066–3076 (2009)
Article MathSciNet Google Scholar
Kabir, M.M.J., Xu, S., Kang, B.H., Zhao, Z.: Comparative analysis of genetic based approach and apriori algorithm for mining maximal frequent item sets. In: IEEE Congress on Evolutionary Computation (CEC), pp. 39–45 (2015)
Google Scholar
Martin, D., Rosete, A., Alcala-Fdez, J., Herrera, F.: A new multiobjective evolutionary algorithm for mining a reduced set of interesting positive and negative quantitative association rules. IEEE Trans. Evol. Comput. 18(1), 54–69 (2014)
Article Google Scholar
Kabir, M.M.J., Xu, S., Kang, B.H., Zhao, Z.: A new evolutionary algorithm for extracting a reduced set of interesting association rules. In: Arik, S., Huang, T., Lai, W.K., Liu, Q. (eds.) ICONIP 2015. LNCS, vol. 9490, pp. 133–142. Springer, Heidelberg (2015). doi:10.1007/978-3-319-26535-3_16
Chapter Google Scholar
Shenoy, P., Srinivasa, K., Venugopal, K., Patnaik, L.: Dynamic association rule mining using genetic algorithms. Intell. Data Anal. 9(5), 439–453 (2005)
Google Scholar
Maaranen, H., Miettinen, K., Penttinen, A.: On initial populations of a genetic algorithm for continuous optimization problems. J. Glob. Optim. 37(3), 405–436 (2007)
Article MathSciNet MATH Google Scholar
Maaranen, H., Miettinen, K., Mäkelä, M.M.: Quasi-random initial population for genetic algorithms. Comput. Math. Appl. 47(12), 1885–1895 (2004)
Article MathSciNet MATH Google Scholar
Chang, P.C., Huang, W.H., Ting, C.J.: Dynamic diversity control in genetic algorithm for mining unsearched solution space in TSP problems. Expert Syst. Appl. 37(3), 1863–1878 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering and ICT, University of Tasmania, Hobart, Australia
Mir Md. Jahangir Kabir, Shuxiang Xu, Byeong Ho Kang & Zongyuan Zhao

Authors

Mir Md. Jahangir Kabir
View author publications
You can also search for this author in PubMed Google Scholar
Shuxiang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Byeong Ho Kang
View author publications
You can also search for this author in PubMed Google Scholar
Zongyuan Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mir Md. Jahangir Kabir .

Editor information

Editors and Affiliations

New Mexico State University , Las Cruces, New Mexico, USA
Huiping Cao
University of Technology Sydney , Sydney, New South Wales, Australia
Jinyan Li
Massey University , Auckland, New Zealand
Ruili Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kabir, M.M.J., Xu, S., Kang, B.H., Zhao, Z. (2016). Multiple Seeds Based Evolutionary Algorithm for Mining Boolean Association Rules. In: Cao, H., Li, J., Wang, R. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9794. Springer, Cham. https://doi.org/10.1007/978-3-319-42996-0_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-42996-0_6
Published: 15 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42995-3
Online ISBN: 978-3-319-42996-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics