skip to main content
10.1145/2464576.2482712acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
tutorial

Goldenberry: EDA visual programming in orange

Published:06 July 2013Publication History

ABSTRACT

Orange is an open-source component-based software framework, featuring visual and scripting interfaces for many machine learning algorithms. Currently it does not support Estimation of Distribution Algorithms (EDA) or other methods for black-box optimization. Here we introduce Goldenberry, an Orange toolbox of EDA visual components for stochastic search-based optimization. Its main purpose is to provide an user-friendly workbench for researchers and practitioners, building upon the versatile visual front-end of Orange, and the powerful reuse and glue principles of component-based software development. Architecture of the toolbox and implementation details are given, including description and working examples for the components included in its first release: cGA, UMDA, PBIL, TILDA, UMDAc, PBILc, BMDA, CostFunctionBuilder and BlackBoxTester. Goldenberry is open-source and freely available at: http://goldenberry.codeplex.com.

References

  1. F. Bachmann, L. Bass, and R. Nord. Modifiability tactics. Technical Report Carnegie Mellon University/SEI-2007-TR-002, Software Engineering Institute, Carnegie Mellon University, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  2. S. Baluja and R. Caruana. Removing the genetics from the standard genetic algorithm. Technical Report Carnegie Mellon University-CS-95-141, Carnegie-Mellon University, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. F. Buschmann, R. Meunier, H. Rohnert, P. Sommerlad, and M. Stal. Pattern-Oriented Software Architecture Volume 1: A System of Patterns. Wiley, volume 1 edition, Aug. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. T. Curk, J. Demsar, Q. Xu, G. Leban, U. Petrovic, I. Bratko, G. Shaulsky, and B. Zupan. Microarray data mining with visual programming. Bioinformatics, 21(3):396--398, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Gamma, R. Helm, R. Johnson, and J. Vlissides. Design Patterns: Elements of Reusable Object-Oriented Software. Addison-Wesley Professional, 1 edition, Nov. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Gonzáles, J. A. Lozano, and P. Larrañaga. Mathematical modelling of UMDAc algorithm with tournament selection. International Journal of Approximate Reasoning, 31(3):313--340, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  7. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. The WEKA data mining software. SIGKDD Explor. Newsl., 11(1):10--18, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. R. Harik and F. G. Lobo. The compact genetic algorithm. IEEE Transactions on Evolutionary Computation, 3:523--528, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. IBM. IBM SPSS® Algorithms Guide. 2012.Google ScholarGoogle Scholar
  10. P. Larrañaga and J. A. Lozano, editors. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation. Springer, 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Luke. Essentials of Metaheuristics. Lulu, 2009. Available for free at http://cs.gmu.edu/~sean/book/metaheuristics/.Google ScholarGoogle Scholar
  12. I. Mierswa, M. Wurst, R. Klinkenberg, M. Scholz, and T. Euler. Yale: Rapid prototyping for complex data mining tasks. In L. Ungar, M. Craven, D. Gunopulos, and T. Eliassi-Rad, editors, Proceedings of the 12th ACM SIGKDD, pages 935--940, NY, USA, August 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Mühlenbein. The equation for response to selection and its use for prediction. Evolutionary Computation, 5(3):303--346, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Pelikan and H. Müehlenbein. The bivariate marginal distribution algorithm. In R. Roy, T. Furuhashi, and P. Chawdhry, editors, Advances in Soft Computing. Springer London, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  15. M. Pelikan, K. Sastry, and E. Cantú-Paz, editors. Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications. Springer-Verlag, NJ, USA, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Rojas-Galeano, E. Hsieh, D. Agranoff, S. Krishna, and D. Fernandez-Reyes. Estimation of relevant variables on high-dimensional biological patterns using iterated weighted kernel functions. PLoS ONE, 3(3), 2008.Google ScholarGoogle Scholar
  17. S. Rojas-Galeano and N. Rodriguez. A memory efficient and continuous-valued compact EDA for large scale problems. In Proceedings of GECCO 2012, pages 281--288, NY, USA, 2012. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Y. Saeys, I. n. Inza, and P. Larrañaga. A review of feature selection techniques in bioinformatics. Bioinformatics, 23(19):2507--2517, Sept. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M. Sebag and A. Ducoulombier. Extending Population-Based incremental learning to continuous search spaces. Lecture Notes in Computer Science, 1498, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Goldenberry: EDA visual programming in orange

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            GECCO '13 Companion: Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
            July 2013
            1798 pages
            ISBN:9781450319645
            DOI:10.1145/2464576
            • Editor:
            • Christian Blum,
            • General Chair:
            • Enrique Alba

            Copyright © 2013 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 6 July 2013

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • tutorial

            Acceptance Rates

            Overall Acceptance Rate1,669of4,410submissions,38%

            Upcoming Conference

            GECCO '24
            Genetic and Evolutionary Computation Conference
            July 14 - 18, 2024
            Melbourne , VIC , Australia

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader