Multi-objective approach based on grammar-guided genetic programming for solving multiple instance problems

Zafra, Amelia; Ventura, Sebastián

doi:10.1007/s00500-011-0794-0

Multi-objective approach based on grammar-guided genetic programming for solving multiple instance problems

Original Paper
Published: 08 December 2011

Volume 16, pages 955–977, (2012)
Cite this article

Soft Computing Aims and scope Submit manuscript

Amelia Zafra¹ &
Sebastián Ventura¹

304 Accesses
3 Citations
Explore all metrics

Abstract

Multiple instance learning (MIL) is considered a generalization of traditional supervised learning which deals with uncertainty in the information. Together with the fact that, as in any other learning framework, the classifier performance evaluation maintains a trade-off relationship between different conflicting objectives, this makes the classification task less straightforward. This paper introduces a multi-objective proposal that works in a MIL scenario to obtain well-distributed Pareto solutions to multi-instance problems. The algorithm developed, Multi-Objective Grammar Guided Genetic Programming for Multiple Instances (MOG3P-MI), is based on grammar-guided genetic programming, which is a robust tool for classification. Thus, this proposal combines the advantages of the grammar-guided genetic programming with benefits provided by multi-objective approaches. First, a study of multi-objective optimization for MIL is carried out. To do this, three different extensions of MOG3P-MI are designed and implemented and their performance is compared. This study allows us on the one hand, to check the performance of multi-objective techniques in this learning paradigm and on the other hand, to determine the most appropriate evolutionary process for MOG3P-MI. Then, MOG3P-MI is compared with some of the most significant proposals developed throughout the years in MIL. Computational experiments show that MOG3P-MI often obtains consistently better results than the other algorithms, achieving the most accurate models. Moreover, the classifiers obtained are very comprehensible.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Instance selection by genetic-based biological algorithm

Article 21 June 2014

Zong-Yao Chen, Chih-Fong Tsai, … Shih-Wen Ke

A Grammar-Guided Genetic Programming Algorithm for Multi-Label Classification

A Logarithmic Distance-Based Multi-Objective Genetic Programming Approach for Classification of Imbalanced Data

References

Andrews S, Tsochantaridis I, Hofmann T (2002) Support vector machines for multiple-instance learning. In: NIPS’02: proceedings of neural information processing system, Vancouver, Canada, pp 561–568
Auer P, Ortner R (2004) A boosting approach to multiple instance learning. In: ECML’04: proceedings of the 5th European conference on machine learning. Lecture Notes in Computer Science, vol 3201. Springer, Pisa, pp 63–74
Bojarczuk CC, Lopes HS, Freitas AA (2000) Genetic programming for knowledge discovery in chest-pain diagnosis. IEEE Eng Med Biol Mag 19(4):38–44
Article Google Scholar
Chai YM, Yang ZW (2007) A multi-instance learning algorithm based on normalized radial basis function network. In: ISSN’07: proceedings of the 4th International Symposium on Neural Networks. Lecture Notes in Computer Science, vol 4491. Springer, Nanjing, pp 1162–1172
Chen X, Zhang C, Chen S, Rubin S (2009) A human-centered multiple instance learning framework for semantic video retrieval. IEEE Trans Syst Man Cybern Part C Appl Rev 39(2):228–233
Article Google Scholar
Chen Y, Bi J, Wang J (2006) MILES: Multiple-instance learning via embedded instance selection. IEEE Trans Pattern Anal Mach Intell 28(12):1931–1947
Article Google Scholar
Chen Y, Wang JZ (2004) Image categorization by learning and reasoning with regions. J Mach Learn Res 5:913–939
Google Scholar
Chevaleyre YZ, Zucker JD (2001) Solving multiple-instance and multiple-part learning problems with decision trees and decision rules. Application to the mutagenesis problem. In: AI’01: proceedings of the 14th of the Canadian society for computational studies of intelligence. Lecture Notes in Computer Science, vol 2056. Springer, Ottawa, pp 204–214
Chien BC, Lin JY, Hong TP (2002) Learning discriminant functions with fuzzy attributes for classification using genetic programming. Expert Syst Appl 23(1):31–37
Article Google Scholar
Coello CA, Lamont GB, Veldhuizen DAV (2007) Evolutionary algorithms for solving multi-objective problems. Genetic and evolutionary computation. 2nd edn. Springer, Berlin
Couchet J, Manrique D, Ros J, Rodrguez-Patn A (2006) Crossover operators for grammar-guided genetic programming. Soft Comput A Fusion of Found Methodol Appl 11(10):943–955
Google Scholar
Deb K, Agrawal S, Pratap A, Meyarivan T (2000) A fast elitist non-dominated sorting genetic algorithm for multi-objective optimisation: NSGA-II. In: PPSN VI: proceedings of the 6th international conference on parallel problem solving from nature. Springer, London, pp 849–858
Dehuri S, Cho SB (2008) Multi-objective classification rule mining using gene expression programming. In: ICCIT ’08: proceedings of the 3rd international conference on convergence and hybrid information technology. IEEE Computer Society, Washington, DC, pp 754–760
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
MathSciNet MATH Google Scholar
Dietterich TG, Lathrop RH, Lozano-Perez T (1997) Solving the multiple instance problem with axis-parallel rectangles. Artif Intell 89(1–2):31–71
Article MATH Google Scholar
Gao S, Suna Q (2008) Exploiting generalized discriminative multiple instance learning for multimedia semantic concept detection. Pattern Recogn 41(10):3214–3223
Article MATH Google Scholar
Garcia S, Fernandez A, Luengo J, Herrera F (2009) A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability. Soft Comput A Fusion of Found Methodol Appl 13:959–977
Google Scholar
Garcia S, Fernandez A, Luengo J, Herrera F (2010) Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf Sci 180(10):2044–2064
Article Google Scholar
Gartner T, Flach PA, Kowalczyk A, Smola AJ (2002) Multi-instance k0ernels. In: ICML’02: proceedings of the 19th international conference on machine learning. Morgan Kaufmann, Sydney, pp 179–186
Gu Z, Mei T, Hua X, Tang J, Wu X (2008) Multi-layer multi-instance learning for video concept detection. IEEE Trans Multimedia 10(8):1605–1616
Article Google Scholar
Jaszkiewicz A, Kominek P (2003) Genetic local search with distance preserving recombination operator for a vehicle routing problem. Eur J Oper Res 151(2):352–364
Article MathSciNet MATH Google Scholar
Keerthi S, Shevade S, Bhattacharyya C, Murthy K (2001) Improvements to platt’s smo algorithm for svm classifier design. Neural Comput 13(3):637–649
Article MATH Google Scholar
Kishore JK, Patnaik LM, Mani V, Agrawal VK (2000) Application of genetic programming for multicategory pattern classification. IEEE Trans Evol Comput 4(3):242–258
Article Google Scholar
Knuth DE (1964) Backus normal form vs. backus naur form. Commun ACM 7(12):735–736
Article Google Scholar
Liao S, Hsieh C, Lai P (2011) An evolutionary approach for multi-objective optimization of the integrated location-inventory distribution network problem in vendor-managed inventory. Expert Syst Appl 38(6):6768–6776
Article Google Scholar
Lu J, Ma S, Zhang M (2008) Multi-instance clustering approach for web image using one-class support vector machine. J Comput Inf Syst 4(3):1231–1240
Google Scholar
Mangasarian OL, Wild EW (2008) Multiple instance classification via successive linear programming. J Optim Theory Appl 137(3):555–568
Article MathSciNet MATH Google Scholar
Maron O, Lozano-Pérez T (1997) A framework for multiple-instance learning. In: NIPS’97: proceedings of neural information processing system 10, Denver, CO, USA, pp 570–576
Mugambi EM, Hunter A (2003) Multi-objective genetic programming optimization of decision trees for classifying medical data. In: KES’03: knowledge-based intelligent information and engineering systems, pp 293–299.
Panait L, Luke S (2004) Alternative bloat control methods. In: GECCO’04: proceedings of the 2004 conference on genetic and evolutionary computation, Seattle, Washington, USA, pp 630–641
Pang J, Huang Q, Jiang S (2008) Multiple instance boost using graph embedding based decision stump for pedestrian detection. In: ECCV’08: proceedings of the 10th European conference on computer vision. Lectures Note in Computer Science, vol 5305. Springer, Berlin, pp 541–552
Pao H, Chuang S, Xu Y, Fu H (2008) An EM based multiple instance learning method for image classification. Expert Syst Appl 35(3):1468–1472
Article Google Scholar
Parrott D, Xiaodong L, Ciesielski V (2005) Multi-objective techniques in genetic programming for evolving classifiers. In: IEEE congress on evolutionary computation, vol 2, pp 1141–1148
Qian C, Yu Y, Zhou ZH (2011) An analysis on recombination in multi-objective evolutionary optimization. In: GECCO’11: proceedings of the 13th ACM conference on genetic and evolutionary computation, Dublin, Ireland, pp 2051–2058
Ramon J, De Raedt L (2000) Multi-instance neural networks. In: ICML’00: a workshop on attribute-value and relational learning at the 17th conference on machine learning
Ray S, Page D (2001) Multiple instance regression. In: Proceedings of the eighteenth international conference on machine learning, ICML’01, San Francisco, CA, USA, pp 425–432
Ron K (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: IJCAI’95: international joint conference on artificial intelligence, Montreal, Canada, pp 1137–1145
Ruffo G (2000) Learning single and multiple instance decision tree for computer security applications. PhD thesis, Department of Computer Science. University of Turin, Torino, Italy
Shukla PK, Deb K (2007) On finding multiple pareto-optimal solutions using classical and evolutionary generating methods. Eur J Oper Res 181(3):1630–1652
Article MATH Google Scholar
Tan KC, Tay Lee A, Heng T, CM (2002) Mining multiple comprehensible classification rules using genetic programming. In: CEC’02: proceedings of the congress on evolutionary computation, Honolulu, HI, USA, vol 2, pp 1302–1307
Tan K, Chiam S, Mamun A, Goh C (2009) Balancing exploration and exploitation with adaptive variation for evolutionary multi-objective optimization. Eur J Oper Res 197(2):701–713
Article MATH Google Scholar
Tsang CH, Kwong S, Wang H (2007) Genetic-fuzzy rule mining approach and evaluation of feature selection techniques for anomaly intrusion detection. Pattern Recogn 40(9):2373–2391
Article MATH Google Scholar
Veldhuizen DV, Lamont G (2000) Multiobjective evolutionary algorithms: analyzing the state-of-the-art. Evol Comput 8(2):125–147
Article Google Scholar
Ventura S, Romero C, Zafra A, Delgado JA, Hervás C (2007) JCLEC: AZ java framework for evolutionary computation soft computing. Soft Comput 12(4):381–392
Article Google Scholar
Wang J, Zucker JD (2000) Solving the multiple-instance problem: a lazy learning approach. In: ICML’00: Proceedings of the 17th international conference on machine learning, Stanford, CA, USA, pp 1119–1126
Wee H, Lo C, Hsu P (2009) A multi-objective joint replenishment inventory model of deteriorated items in a fuzzy environment. Eur J Oper Res 197(2):620–631
Article MathSciNet MATH Google Scholar
Whigham PA (1995) Grammatically-based genetic programming. In: Proceedings of the workshop on genetic programming: from theory to real-world applications, Tahoe City, CA, USA, pp 33–41
Whigham PA (1996) Grammatical bias for evolutionary learning. PhD thesis, School of Computer Science, University College, University of New South Wales, Australian Defence Force Academy, Canberra, Australia
Wiens TS, Dale BC, Boyce MS, Kershaw PG (2008) Three way k-fold cross-validation of resource selection functions. Ecol Model 212(3–4):244–255
Article Google Scholar
Witten I, Frank E (2005) Data mining: practical machine learning tools and techniques. 2nd edn. Morgan Kaufmann, San Francisco
Witten IH, Frank E (1999) Data mining: practical machine learning tools and techniques with java implementations. Morgan Kaufmann, San Francisco
Wu F, Zhou H, Zhao J, Cen K (2011) A comparative study of the multi-objective optimization algorithms for coal-fired boilers. Expert Syst Appl 38(6):7179–7185
Article Google Scholar
Xu L, Guo MZ, Zou Q, Liu Y, Li HF (2008) An improved diverse density algorithm for multiple overlapped instances. In: ICNC’08: Proceedings of the 4th international conference on natural computation. IEEE Computer Society, Washington, DC, pp 88–91
Xu X (2003) Statistical learning in multiple instance problems. PhD thesis, Department of Computer Science. University of Waikato
Xu X, Frank E (2004) Logistic regression and boosting for labeled bags of instances. In: PAKDD’04: Proceedings of the 8th Conference of Pacific–Asia. Lecture Notes in Computer Science, vol 3056. Springer, Sydney, pp 272–281
Yang C, Dong M, Fotouhi F (2005) Region based image annotation through multiple-instance learning. In: Multimedia’05: proceedings of the 13th annual ACM international conference on multimedia, New York, USA, pp 435–438
Yang E, Erdogan AT, Arslan T, Barton NH (2011) Multi-objective evolutionary optimizations of a space-based reconfigurable sensor network under hard constraints. Soft Computing 15(1):25–36
Article Google Scholar
Zhou Z-H, Wu J, Tang W (2002) Ensembling neural networks: Many could be better than all. Artif Intell 137(1–2): 239–263
Google Scholar
Zafra A, Romero C, Ventura S (2011) Multiple instance learning for classifying students in learning management systems. Expert Syst Appl 38(12):15020–15031
Google Scholar
Zafra A, Ventura S (2010) G3P-MI: A genetic programming algorithm for multiple instance learning. Information Sciences 180(23):4496–4513
Article Google Scholar
Zafra A, Ventura S, Romero C, Herrera-Viedma E (2009) Multi-instance genetic programming for web index recommendation. Expert Syst Appl 36(9):11470–11479
Article Google Scholar
Zhang D, Wanga F, Shib Z, Zhanga C (2010) Interactive localized content based image retrieval with multiple-instance active learning. Pattern Recognition 43(2):478–484
Article MATH Google Scholar
Zhang ML, Zhou ZH (2004) Improve multi-instance neural networks through feature selection. Neural Processing Letters 19(1):1–10
Article MATH Google Scholar
Zhang ML, Zhou ZH (2005) Ensembles of multi-instance Neural Networks. In: IIP’04: International Conference on intelligent information processing II. IFIP international federation for information processing, Beijing, China. vol 163, pp 471–474
Zhang ML, Zhou ZH (2006) Adapting RBF Neural Networks to multi-instance learning. Neural Processing Letters 23(1):1–26
Article Google Scholar
Zhang ML, Zhou ZH (2009) Multi-instance clustering with applications to multi-instance prediction. Applied Intelligences 31(1):47–68
Article Google Scholar
Zhang Q, Goldman S (2001) EM-DD: An improved multiple-instance learning technique. In: NIPS’01: Proceedings of Neural Information Processing System 14, Vancouver, Canada, pp 1073–1080
Zhou ZH, Jiang K, Li M (2005) Multi-instance learning based web mining. Applied Intelligence 22(2):135–147
Article Google Scholar
Zhou ZH, Zhang ML (2007) Solving multi-instance problems with classifier ensemble based on constructive clustering. Knowledge and Information Systems 11(2):155–170
Article Google Scholar
Zitzler E, Laumanns M, Thiele L (2001) SPEA2: Improving the Strength Pareto Evolutionary Algorithm. Tech. Rep. 103, Gloriastrasse 35

Download references

Acknowledgments

The authors gratefully acknowledge the financial subsidy provided by the Spanish Department of Research under TIN2008-06681-C06-03 and P08-TIC-3720 Projects and FEDER fund.

Author information

Authors and Affiliations

Department of Computer Science and Numerical Analysis, University of Cordoba, Cordoba, Spain
Amelia Zafra & Sebastián Ventura

Authors

Amelia Zafra
View author publications
You can also search for this author in PubMed Google Scholar
Sebastián Ventura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sebastián Ventura.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zafra, A., Ventura, S. Multi-objective approach based on grammar-guided genetic programming for solving multiple instance problems. Soft Comput 16, 955–977 (2012). https://doi.org/10.1007/s00500-011-0794-0

Download citation

Published: 08 December 2011
Issue Date: June 2012
DOI: https://doi.org/10.1007/s00500-011-0794-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-objective approach based on grammar-guided genetic programming for solving multiple instance problems

Abstract

Access this article

Similar content being viewed by others

Instance selection by genetic-based biological algorithm

A Grammar-Guided Genetic Programming Algorithm for Multi-Label Classification

A Logarithmic Distance-Based Multi-Objective Genetic Programming Approach for Classification of Imbalanced Data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-objective approach based on grammar-guided genetic programming for solving multiple instance problems

Abstract

Access this article

Similar content being viewed by others

Instance selection by genetic-based biological algorithm

A Grammar-Guided Genetic Programming Algorithm for Multi-Label Classification

A Logarithmic Distance-Based Multi-Objective Genetic Programming Approach for Classification of Imbalanced Data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation