research-article

Generating Datasets for Classification Task and Predicting Best Classifiers with Conditional Generative Adversarial Networks

Authors:

Ilya Kachalsky,

Alexey Zabashta,

Andrey Filchenkov,

Georgiy KorneevAuthors Info & Claims

ICAAI '19: Proceedings of the 3rd International Conference on Advances in Artificial Intelligence

Pages 97 - 101

https://doi.org/10.1145/3369114.3369153

Published: 21 January 2020 Publication History

Abstract

We focus on the algorithm selection problem and closely related dataset synthesis problem. We present conditional deep convolutional generative adversarial network we call LM-GAN, generator of which is capable of synthesizing dataset for classification in the matrix form with numeric features and discriminator of which can perform the best classifier prediction for a new never seen dataset. We also suggest a technique for transforming matrices representing datasets to a canonical form. Experimental evaluation shows that the presented network working with matrices in the canonical form outperforms baseline solutions in dataset synthesis and the best classifier prediction.

References

[1]

John R Rice. The algorithm selection problem. In Advances in computers, volume 15, pages 65--118. Elsevier, 1976.

[2]

David H Wolpert and William G Macready. No free lunch theorems for optimization. IEEE transactions on evolutionary computation, 1(1):67--82, 1997.

Digital Library

[3]

Pavel Brazdil, Christophe Giraud Carrier, Carlos Soares, and Ricardo Vilalta. Metalearning: Applications to data mining. Springer Science & Business Media, 2008.

[4]

Matthias Feurer, Jost Tobias Springenberg, and Frank Hutter. Initializing bayesian hyperparameter optimization via meta-learning. In Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015.

Digital Library

[5]

Kate Smith-Miles and Thomas T Tan. Measuring algorithm footprints in instance space. In 2012 IEEE Congress on Evolutionary Computation, pages 1--8. IEEE, 2012.

[6]

Jim Young, Patrick Graham, and Richard Penny. Using bayesian networks to create synthetic data. Journal of Official Statistics, 25(4):549, 2009.

[7]

Kate Smith-Miles, Davaatseren Baatar, Brendan Wreford, and Rhyd Lewis. Towards objective measures of algorithm performance across instance space. Computers & Operations Research, 45:12--24, 2014.

Digital Library

[8]

Alexey Zabashta and Andrey Filchenkov. NDSE: instance generation for classification by given meta-feature description. In CEUR Workshop Proceedings, volume 1998, pages 102--104, 2017.

[9]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Advances in neural information processing systems, pages 2672--2680, 2014.

Digital Library

[10]

Yuki Saito, Shinnosuke Takamichi, and Hiroshi Saruwatari. Statistical parametric speech synthesis incorporating generative adversarial networks. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(1):84--96, 2017.

[11]

Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, and Yi-Hsuan Yang. Musegan: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment. In Thirty-Second AAAI Conference on Artificial Intelligence, 2018.

[12]

Evgeny Putin, Arip Asadulaev, Quentin Vanhaelen, Yan Ivanenkov, Anastasia V. Aladinskaya, Alex Aliper, and Alex Zhavoronkov. Adversarial threshold neural computer for molecular de novo design. Molecular Pharmaceutics, 15(10):4386--4397, 2018.

[13]

Alex Zhavoronkov, Yan A Ivanenkov, Alex Aliper, Mark S Veselov, Vladimir A Aladinskiy, Anastasiya V Aladinskaya, Victor A Terentiev, Daniil A Polykovskiy, Maksim D Kuznetsov, Arip Asadulaev, et al. Deep learning enables rapid identification of potent ddr1 kinase inhibitors. Nature biotechnology, pages 1--4, 2019.

[14]

Emily L Denton, Soumith Chintala, Rob Fergus, et al. Deep generative image models using a laplacian pyramid of adversarial networks. In Advances in neural information processing systems, pages 1486--1494, 2015.

Digital Library

[15]

Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, and Honglak Lee. Generative adversarial text to image synthesis. arXiv preprint arXiv:1605.05396, 2016.

Digital Library

[16]

Martin Arjovsky, Soumith Chintala, and Léon Bottou. Wasserstein generative adversarial networks. In International conference on machine learning, pages 214--223, 2017.

Digital Library

[17]

Tero Karras, Samuli Laine, and Timo Aila. A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4401--4410, 2019.

[18]

Jon Gauthier. Conditional generative adversarial nets for convolutional face generation. Class Project for Stanford CS231N: Convolutional Neural Networks for Visual Recognition, Winter semester, 2014(5):2, 2014.

[19]

Nikhil Ketkar. Introduction to PyTorch, pages 195--208. Apress, Berkeley, CA, 2017.

[20]

Joaquin Vanschoren, Jan N Van Rijn, Bernd Bischl, and Luis Torgo. Openml: networked science in machine learning. ACM SIGKDD Explorations Newsletter, 15(2):49--60, 2014.

Digital Library

[21]

Andrey Filchenkov and Arseniy Pendryak. Datasets meta-feature description for recommending feature selection algorithm. In 2015 Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT), pages 11--18, 2015.

[22]

Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. Scikit-learn: Machine learning in python. Journal of machine learning research, 12(Oct):2825--2830, 2011.

Digital Library

[23]

Alexey Zabashta and Andrey Filchenkov. Active dataset generation for meta-learning system quality improvement. In International Conference on Intelligent Data Engineering and Automated Learning. Springer, 2019. in press.

Digital Library

Cited By

Drozdov GZabashta AFilchenkov A(2020)Graph Convolutional Network Based Generative Adversarial Networks for the Algorithm Selection Problem in ClassificationProceedings of the 2020 1st International Conference on Control, Robotics and Intelligent System10.1145/3437802.3437818(88-92)Online publication date: 27-Oct-2020
https://dl.acm.org/doi/10.1145/3437802.3437818
Sahipov IZabashta AFilchenkov A(2020)Stabilization of Dataset Matrix Form for Classification Dataset Generation and Algorithm SelectionIntelligent Data Engineering and Automated Learning – IDEAL 202010.1007/978-3-030-62365-4_7(66-75)Online publication date: 27-Oct-2020
https://doi.org/10.1007/978-3-030-62365-4_7

Index Terms

Generating Datasets for Classification Task and Predicting Best Classifiers with Conditional Generative Adversarial Networks
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Reinforcement learning
        Adversarial learning
      2. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Neural networks

Recommendations

Graph Convolutional Network Based Generative Adversarial Networks for the Algorithm Selection Problem in Classification
CCRIS '20: Proceedings of the 2020 1st International Conference on Control, Robotics and Intelligent System

In this work, we address the algorithm selection problem for classification via meta-learning and generative adversarial networks. We focus on the dataset representation question. The matrix representation of classification dataset is not sensitive to ...
AdaBoost classifiers for pecan defect classification

Highlights The performance of AdaBoost algorithms were compared with support vector machine and Bayesian classifiers for pecan defect classification. AdaBoost classifiers took least time and gave best classification accuracy. AdaBoost classifiers ...
Multi-class classification via heterogeneous ensemble of one-class classifiers

In this paper, a multi-class classification method based on heterogeneous ensemble of one-class classifiers is proposed. The proposed method consists of two phases: training heterogeneous one-class classifiers for each class using various one-class ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICAAI '19: Proceedings of the 3rd International Conference on Advances in Artificial Intelligence

October 2019

253 pages

ISBN:9781450372534

DOI:10.1145/3369114

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Northumbria University: University of Northumbria at Newcastle

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 January 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICAAI 2019

ICAAI 2019: 2019 The 3rd International Conference on Advances in Artificial Intelligence

October 26 - 28, 2019

Istanbul, Turkey

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
79
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Drozdov GZabashta AFilchenkov A(2020)Graph Convolutional Network Based Generative Adversarial Networks for the Algorithm Selection Problem in ClassificationProceedings of the 2020 1st International Conference on Control, Robotics and Intelligent System10.1145/3437802.3437818(88-92)Online publication date: 27-Oct-2020
https://dl.acm.org/doi/10.1145/3437802.3437818
Sahipov IZabashta AFilchenkov A(2020)Stabilization of Dataset Matrix Form for Classification Dataset Generation and Algorithm SelectionIntelligent Data Engineering and Automated Learning – IDEAL 202010.1007/978-3-030-62365-4_7(66-75)Online publication date: 27-Oct-2020
https://doi.org/10.1007/978-3-030-62365-4_7

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents