skip to main content
10.1145/1830483.1830647acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Grammar guided genetic programming for multiple instance learning: an experimental study

Published: 07 July 2010 Publication History

Abstract

This paper introduces a new Grammar-Guided Genetic Programming algorithm for solving multi-instance Learning problems. This algorithm, called G3P-MI, is evaluated and compared with other Multi-Instance classification techniques on different application domains. Computational experiments show that the G3P-MI often obtains consistently better results than other algorithms in terms of accuracy, sensitivity and specificity. Moreover, it adds comprehensibility and clarity into the knowledge discovery process, expressing the information in the form of IF-THEN rules. Our results confirm that evolutionary algorithms are appropriate for dealing with multi-instance learning problems.

References

[1]
S. Andrews, I. Tsochantaridis, and T. Hofmann. Support vector machines for multiple-instance learning. In NIPS'02: Proceedings of Neural Information Processing System, pages 561--568, Vancouver, Canada, 2002.
[2]
P. Auer and R. Ortner. A boosting approach to multiple instance learning. In ECML'04: Proceedings of the 5th European Conference on Machine Learning. Lecture Notes in Computer Science, volume 3201, pages 63--74, Pisa, Italy, 2004.
[3]
T. Back, D. Fogel, and Z. Michalewicz. Handbook of Evolutionary Computation. Oxford University Press, Oxford, UK, 1997.
[4]
H. Blockeel, D. Page, and A. Srinivasan. Multi-instance tree learning. In ICML '05: Proceedings of the 22nd international conference on Machine learning, pages 57--64, New York, NY, USA, 2005. ACM Press.
[5]
C. C. Bojarczuk, H. S. Lopes, and A. A. Freitas. Genetic programming for knowledge discovery in chest-pain diagnosis. IEEE Engineering in Medicine and Biology Magazine, 19(4):38--44, July-Aug. 2000.
[6]
Y.-M. Chai and Z.-W. Yang. A multi-instance learning algorithm based on normalized radial basis function network. In ISSN'07: Proceedings of the 4th International Symposium on Neural Networks. Lecture Notes in Computer Science, volume 4491, pages 1162--1172, Nanjing, China, 2007.
[7]
C.-M. Chen and M.-C. Chen. Mobile formative assessment tool based on data mining techniques for supporting web-based learning. Computer and Education, 52(1):256--273, 2009.
[8]
Y. Chen, J. Bi, and J. Wang. Miles: Multiple-instance learning via embedded instance selection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(12):1931--1947, 2006.
[9]
Y.-Z. Chevaleyre and J.-D. Zucker. Solving multiple-instance and multiple-part learning problems with decision trees and decision rules. Application to the mutagenesis problem. In AI'01: Proceedings of the 14th of the Canadian Society for Computational Studies of Intelligence, Lecture Notes in Computer Science, volume 2056, pages 204--214, Ottawa, Canada, 2001.
[10]
J. Demšar. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research, 7:1--30, 2006.
[11]
T. G. Dietterich, R. H. Lathrop, and T. Lozano-Perez. Solving the multiple instance problem with axis-parallel rectangles. Artifical Intelligence, 89(1--2):31--71, 1997.
[12]
O. J. Dunn. Multiple comparisons among means. Journal of the American Statistical Association, 56(293):52--64, 1961.
[13]
T. Gartner, P. A. Flach, A. Kowalczyk, and A. J. Smola. Multi-instance kernels. In ICML'02: Proceedings of the 19th International Conference on Machine Learning, pages 179--186, Sydney, Australia, 2002. Morgan Kaufmann.
[14]
T. Gartner, J. W. Lloyd, and P. A. Flach. Kernels and distances for structured data. Machine Learning, 57(3):205--232, 2004.
[15]
A. Geyer-Schulz. Fuzzy Rule-Based Expert Systems and Genetic Machine Learning. Physica Verlag, Heidelberg, Germany, First edition, 1995.
[16]
Z. Gu, T. Mei, J. Tang, X. Wu, and X. Hua. MILC2: A multi-layer multi-instance learning approach to video concept detection. In MMM'08: Proceedings of the 14th International Conference of Multimedia Modeling, pages 24--34, Kyoto, Japan, 2008.
[17]
G. Herman, G. Ye, J. Xu, and B. Zhang. Region-based image categorization with reduced feature set. In Proceedings of the 10th IEEE Workshop on Multimedia Signal Processing, pages 586--591, Cairns, Qld, 2008.
[18]
S. Keerthi, S. Shevade, C. Bhattacharyya, and K. Murthy. Improvements to platt's smo algorithm for svm classifier design. Neural Computation, 13(3):637--649, 2001.
[19]
P. Kouchakpour, A. Zaknich, and T. Bräunl. Dynamic population variation in genetic programming. Information Sciences, 179(8):1078--1091, 2009.
[20]
J. R. Koza. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, 1992.
[21]
T. Lensberg, A. Eilifsen, and T. E. McKee. Bankruptcy theory development and classification via genetic programming. European Journal of Operational Research, 169(2):677--697, 2006.
[22]
O. Maron and T. Lozano-Pérez. A framework for multiple-instance learning. In NIPS'97: Proceedings of Neural Information Processing System 10, pages 570--576, Denver, Colorado, USA, 1997.
[23]
J. Pang, Q. Huang, and S. Jiang. Multiple instance boost using graph embedding based decision stump for pedestrian detection. In ECCV'08: Proceedings of the 10th European Conference on Computer Vision. LNCS, volume 5305, pages 541--552, Berlin, Heidelberg, 2008. Springer-Verlag.
[24]
H. T. Pao, S. C. Chuang, Y. Y. Xu, and H. Fu. An EM based multiple instance learning method for image classification. Expert Systems with Applications, 35(3):1468--1472, 2008.
[25]
X. Qi and Y. Han. Incorporating multiple svms for automatic image annotation. Pattern Recognition, 40(2):728--741, 2007.
[26]
K. Ron. A study of cross-validation and bootstrap for accuracy estimation and model selection. In IJCAI'95: International Joint Conference on Artificial Intelligence, pages 1137--1145, Canada, 1995.
[27]
K. Tan, A. Tay, T. Lee, and C. Heng. Mining multiple comprehensible classification rules using genetic programming. In CEC'02: Proceedings of the Congress on Evolutionary Computation, volume 2, pages 1302--1307, Honolulu, HI, USA, May 2002.
[28]
A. Tsakonas. A comparison of classification accuracy of four genetic programming-evolved intelligent structures. Information Sciences, 176(6):691--724, 2006.
[29]
S. Ventura, C. Romero, A. Zafra, J. A. Delgado, and C. Hervás. JCLEC: A java framework for evolutionary computation soft computing. Soft Computing, 12(4):381--392, 2007.
[30]
J. Wang and J.-D. Zucker. Solving the multiple-instance problem: A lazy learning approach. In ICML'00: Proceedings of the 17th International Conference on Machine Learning, pages 1119--1126, Standord, CA, USA, 2000.
[31]
P. A. Whigham. Grammatically-based genetic programming. In Proceedings of the Workshop on Genetic Programming: From Theory to Real-World Applications, pages 33--41, California, USA.
[32]
P. A. Whigham. Grammatical Bias for Evolutionary Learning. PhD thesis, School of Computer Science, University College, University of New South Wales, Australian Defence Force Academy, Canberra, Australia, 1996.
[33]
T. S. Wiens, B. C. Dale, M. S. Boyce, and P. G. Kershaw. Three way k-fold cross-validation of resource selection functions. Ecological Modelling, 212(3-4):244--255, April 2008.
[34]
I. Witten and E. Frank. Data Mining: Practical machine learning tools and techniques. 2nd Edition. Morgan Kaufmann, San Francisco, 2005.
[35]
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, 1999.
[36]
X. Xu. Statistical learning in multiple instance problems. PhD thesis, Department of Computer Science. University of Waikato, 2003.
[37]
X. Xu and E. Frank. Logistic regression and boosting for labeled bags of instances. In PAKDD'04: Proceedings of the 8th Conference of Pacific-Asia. Lecture Notes in Computer Science, volume 3056, pages 272--281, Sydney, Australia, 2004.
[38]
C. Yang, M. Dong, and F. Fotouhi. Region based image annotation through multiple-instance learning. In Multimedia'05: Proceedings of the 13th Annual ACM International Conference on Multimedia, pages 435--438, New York, USA, 2005.
[39]
C. Yang and T. Lozano-Perez. Image database retrieval with multiple-instance learning techniques. In ICDE '00: Proceedings of the 16th International Conference on Data Engineering, pages 233--243, Washington, DC, USA, 2000. IEEE Computer Society.
[40]
A. Zafra and S. Ventura. Predicting student grades in learning management systems with multiple instance genetic programming. In EDM'09: Proceedings of the 2sd Conference on Educational Data Mining, pages 309--319, Cordoba, Spain, 2009.
[41]
A. Zafra, S. Ventura, C. Romero, and E. Herrera-Viedma. Multi-instance genetic programming for web index recommendation. Expert Systems with Applications, 36:11470--11479, 2009.
[42]
M.-L. Zhang and Z.-H. Zhou. Ensembles of multi-instance Neural Networks. In IIP'04: International Conference on Intelligent Information Processing II, volume 163 of IFIP, pages 471--474, Beijing, China, 2005.
[43]
M.-L. Zhang and Z.-H. Zhou. Adapting RBF Neural Networks to multi-instance learning. Neural Processing Letters, 23(1):1--26, 2006.
[44]
Q. Zhang and S. Goldman. EM-DD: An improved multiple-instance learning technique. In NIPS'01: Proceedings of Neural Information Processing System 14, pages 1073--1080, Vancouver, Canada, 2001.
[45]
Z.-H. Zhou and M.-L. Zhang. Solving multi-instance problems with classifier ensemble based on constructive clustering. Knowledge and Information Systems, 11(2):155--170, 2007.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation
July 2010
1520 pages
ISBN:9781450300728
DOI:10.1145/1830483
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. grammar guided genetic programming
  2. multiple instance learning
  3. rule learning

Qualifiers

  • Research-article

Conference

GECCO '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 139
    Total Downloads
  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media