skip to main content
10.1145/3319619.3322058acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Evolving convolutional neural networks through grammatical evolution

Published:13 July 2019Publication History

ABSTRACT

The use of Convolutional Neural Networks (CNNs) have proven to be a solid approach often used to solve many Machine Learning problems, such as image classification and natural language processing tasks. The manual design of CNNs is a hard task, due to the high number of configurable parameters and possible configurations. Recent studies around the automatic design of CNNs have shown positive results. In this work, we propose to explore the design of CNN architectures through the use of Grammatical Evolution (GE), where a BNF grammar is used to define the CNN components and structural rules. Experiments were performed using the MNIST and CIFAR-10 datasets. The obtained results show that the presented approach achieved competitive results and maintaining relatively small architectures when compared with similar state-of-the-art approaches.

References

  1. James Bergstra and Yoshua Bengio. 2012. Random search for hyper-parameter optimization. Journal of Machine Learning Research 13, Feb (2012), 281--305. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. James Bergstra, Daniel Yamins, and David Daniel Cox. 2013. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. (2013).Google ScholarGoogle Scholar
  3. Tsung-Han Chan, Kui Jia, Shenghua Gao, Jiwen Lu, Zinan Zeng, and Yi Ma. 2015. PCANet: A simple deep learning baseline for image classification? IEEE Transactions on Image Processing 24, 12 (2015), 5017--5032.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  5. Alex Krizhevsky and Geoffrey Hinton. 2009. Learning multiple layers of features from tiny images. Technical Report. Citeseer.Google ScholarGoogle Scholar
  6. Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278--2324.Google ScholarGoogle ScholarCross RefCross Ref
  7. Ilya Loshchilov and Frank Hutter. 2016. CMA-ES for hyperparameter optimization of deep neural networks. arXiv preprint arXiv:1604.07269 (2016).Google ScholarGoogle Scholar
  8. Risto Miikkulainen, Jason Liang, Elliot Meyerson, Aditya Rawal, Daniel Fink, Olivier Francon, Bala Raju, Hormoz Shahrzad, Arshak Navruzyan, Nigel Duffy, et al. 2019. Evolving deep neural networks. In Artificial Intelligence in the Age of Neural Networks and Brain Computing. Elsevier, 293--312.Google ScholarGoogle Scholar
  9. Masanori Suganuma, Shinichi Shirakawa, and Tomoharu Nagao. 2017. A genetic programming approach to designing convolutional neural network architectures. In Proceedings of the Genetic and Evolutionary Computation Conference. ACM, 497--504. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Yanan Sun, Bing Xue, and Mengjie Zhang. 2017. Evolving deep convolutional neural networks for image classification. arXiv preprint arXiv:1710.10741 (2017).Google ScholarGoogle Scholar
  11. Yanan Sun, Gary G Yen, and Zhang Yi. 2018. Evolving Unsupervised Deep Neural Networks for Learning Meaningful Representations. IEEE Transactions on Evolutionary Computation (2018).Google ScholarGoogle Scholar

Index Terms

  1. Evolving convolutional neural networks through grammatical evolution

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        GECCO '19: Proceedings of the Genetic and Evolutionary Computation Conference Companion
        July 2019
        2161 pages
        ISBN:9781450367486
        DOI:10.1145/3319619

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 13 July 2019

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,669of4,410submissions,38%

        Upcoming Conference

        GECCO '24
        Genetic and Evolutionary Computation Conference
        July 14 - 18, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader