research-article

EnsembleGAN: Adversarial Learning for Retrieval-Generation Ensemble Model on Short-Text Conversation

Authors:

Rui YanAuthors Info & Claims

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 435 - 444

https://doi.org/10.1145/3331184.3331193

Published: 18 July 2019 Publication History

Abstract

Generating qualitative responses has always been a challenge for human-computer dialogue systems. Existing dialogue systems generally derive from either retrieval-based or generative-based approaches, both of which have their own pros and cons. Despite the natural idea of an ensemble model of the two, existing ensemble methods only focused on leveraging one approach to enhance another, we argue however that they can be further mutually enhanced with a proper training strategy. In this paper, we propose ensembleGAN, an adversarial learning framework for enhancing a retrieval-generation ensemble model in open-domain conversation scenario. It consists of a language-model-like generator, a ranker generator, and one ranker discriminator. Aiming at generating responses that approximate the ground-truth and receive high ranking scores from the discriminator, the two generators learn to generate improved highly relevant responses and competitive unobserved candidates respectively, while the discriminative ranker is trained to identify true responses from adversarial ones, thus featuring the merits of both generator counterparts. The experimental results on a large short-text conversation data demonstrate the effectiveness of the ensembleGAN by the amelioration on both human and automatic evaluation metrics.

References

[1]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In ICLR.

[2]

Hongshen Chen, Xiaorui Liu, Dawei Yin, and Jiliang Tang. 2017. A Survey on Dialogue Systems: Recent Advances and New Frontiers. SIGKDD Explorations, Vol. 19, 2 (2017), 25--35.

Digital Library

[3]

Bo Dai, Sanja Fidler, Raquel Urtasun, and Dahua Lin. 2017. Towards Diverse and Natural Image Descriptions via a Conditional GAN. In ICCV. 2989--2998.

[4]

Joseph L Fleiss. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin, Vol. 76, 5 (1971), 378--382.

[5]

Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron C. Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In NIPS. 2672--2680.

Digital Library

[6]

R. Herbrich. 2008. Large margin rank boundaries for ordinal regression. Advances in Large Margin Classifiers, Vol. 88 (2008).

[7]

Sepp Hochreiter and Jü rgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation, Vol. 9, 8 (1997), 1735--1780.

Digital Library

[8]

Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR.

[9]

Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A Diversity-Promoting Objective Function for Neural Conversation Models. In NAACL. 110--119.

[10]

Jiwei Li, Will Monroe, Tianlin Shi, Sébastien Jean, Alan Ritter, and Dan Jurafsky. 2017. Adversarial Learning for Neural Dialogue Generation. In EMNLP. 2157--2169.

[11]

Kevin Lin, Dianqi Li, Xiaodong He, Ming-Ting Sun, and Zhengyou Zhang. 2017. Adversarial Ranking for Language Generation. In NIPS. 3158--3168.

Digital Library

[12]

Chia-Wei Liu, Ryan Lowe, Iulian Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation. In EMNLP. 2122--2132.

[13]

Linqing Liu, Yao Lu, Min Yang, Qiang Qu, Jia Zhu, and Hongyan Li. 2018. Generative Adversarial Network for Abstractive Text Summarization. In AAAI.

[14]

Mehdi Mirza and Simon Osindero. 2014. Conditional Generative Adversarial Nets. CoRR, Vol. abs/1411.1784 (2014).

[15]

Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. 2016. Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation. In COLING. 3349--3358.

[16]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In ACL. 311--318.

Digital Library

[17]

Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural Responding Machine for Short-Text Conversation. In ACL. 1577--1586.

[18]

Yiping Song, Cheng-Te Li, Ming Zhang, Dongyan Zhao, and Rui Yan. 2018. An Ensemble of Retrieval-Based and Generation-Based Human-Computer Conversation Systems. In IJCAI. 4382--4388.

Digital Library

[19]

Richard S. Sutton, David A. McAllester, Satinder P. Singh, and Yishay Mansour. 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation. In NIPS. 1057--1063.

Digital Library

[20]

Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, and Rui Yan. 2018a. Get the Point of My Utterance! Learning Towards Effective Responses with Multi-head Attention Mechanism. In IJCAI. 4418--4424.

Digital Library

[21]

Chongyang Tao, Lili Mou, Dongyan Zhao, and Rui Yan. 2018b. RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems. In AAAI. 722--729.

[22]

Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019. Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. In WSDM. 267--275.

Digital Library

[23]

Ashwin K. Vijayakumar, Michael Cogswell, Ramprasaath R. Selvaraju, Qing Sun, Stefan Lee, David J. Crandall, and Dhruv Batra. 2018. Diverse Beam Search for Improved Description of Complex Scenes. In AAAI. 7371--7379.

[24]

Chenglong Wang, Feijun Jiang, and Hongxia Yang. 2017. A Hybrid Framework for Text Modeling with Convolutional RNN. In SIGKDD. 2061--2069.

Digital Library

[25]

Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. 2013. A Dataset for Research on Short-Text Conversations. In EMNLP. 935--945.

[26]

Jun Wang, Lantao Yu, Weinan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, and Dell Zhang. 2017. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models. In SIGIR. 515--524.

Digital Library

[27]

J. Weston, E. Dinan, and A. H. Miller. 2018. Retrieve and Refine: Improved Sequence Generation Models For Dialogue. CoRR, Vol. abs/1808.04776 (2018).

[28]

Yu Wu, Furu Wei, Shaohan Huang, Zhoujun Li, and Ming Zhou. 2018. Response Generation by Context-aware Prototype Editing. CoRR, Vol. abs/1806.07042 (2018).

[29]

Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. 2017. Sequential matching network: A new architecture for multi-turn response selection in retrieval-based chatbots. In ACL. 496--505.

[30]

Yu Wu, Wei Wu, Dejian Yang, Can Xu, and Zhoujun Li. 2018. Neural Response Generation With Dynamic Vocabularies. In AAAI. 5594--5601.

[31]

Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma. 2017. Topic Aware Neural Response Generation. In AAAI. 3351--3357.

Digital Library

[32]

Jingjing Xu, Xu Sun, Xuancheng Ren, Junyang Lin, Bingzhen Wei, and Wei Li. 2018. DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text. In EMNLP. 3940--3949.

[33]

Rui Yan, Yiping Song, and Hua Wu. 2016. Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System. In SIGIR. 55--64.

Digital Library

[34]

Rui Yan and Dongyan Zhao. 2018. Coupled context modeling for deep chit-chat: towards conversations between human and computer. In SIGKDD. 2574--2583.

Digital Library

[35]

Rui Yan, Dongyan Zhao, and Weinan E. 2017. Joint learning of response ranking and next utterance suggestion in human-computer conversation system. In SIGIR. 685--694.

Digital Library

[36]

Zhen Yang, Wei Chen, Feng Wang, and Bo Xu. 2018. Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets. In NAACL.

[37]

Lili Yao, Yaoyuan Zhang, Yansong Feng, Dongyan Zhao, and Rui Yan. 2017. Towards Implicit Content-Introducing for Generative Short-Text Conversation Systems. In EMNLP. 2190--2199.

[38]

T. Young, E. Cambria, I. Chaturvedi, M. Huang, H. Zhou, and S. Biswas. 2018. Augmenting End-to-End Dialogue Systems with Commonsense Knowledge. In AAAI. 4970--4977.

[39]

Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. In AAAI. 2852--2858.

Digital Library

[40]

Tiancheng Zhao, Ran Zhao, and Maxine Eskenazi. 2017. Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders. In ACL. 654--664.

[41]

Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. In ICCV, Vol. 2223--2232.

Cited By

Bouafoud CZine-Dine KMadani A(2024)The Evolution of Transformers in Education: A Literature Review2024 International Conference on Circuit, Systems and Communication (ICCSC)10.1109/ICCSC62074.2024.10617128(1-7)Online publication date: 28-Jun-2024
https://doi.org/10.1109/ICCSC62074.2024.10617128
Fu TZhao XYan RSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Delving into Global Dialogue Structures: Structure Planning Augmented Response Selection for Multi-turn ConversationsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599304(495-505)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599304
Firdaus MThangavelu NEkbal ABhattacharyya P(2023)I Enjoy Writing and Playing, Do You?: A Personalized and Emotion Grounded Dialogue Agent Using Generative Adversarial NetworkIEEE Transactions on Affective Computing10.1109/TAFFC.2022.315510514:3(2127-2138)Online publication date: 1-Jul-2023
https://doi.org/10.1109/TAFFC.2022.3155105
Show More Cited By

Index Terms

EnsembleGAN: Adversarial Learning for Retrieval-Generation Ensemble Model on Short-Text Conversation
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
  2. World Wide Web
    1. Web applications

Recommendations

AFPun-GAN: Ambiguity-Fluency Generative Adversarial Network for Pun Generation
Natural Language Processing and Chinese Computing
Abstract
Automatic pun generation is an interesting and challenging text generation task. In this study, we focus on the task of homographic pun generation by given a pair of word senses. Current efforts depend on templates or laboriously annotated pun ...
An Ensemble Method with Cost Function on Churn Prediction
ICAAI '19: Proceedings of the 3rd International Conference on Advances in Artificial Intelligence

Accurate customer churn classification is vital in any business organisation due to the higher cost involved in getting new customers. In telecommunication businesses, companies have used various types of single classifiers to classify customer churn, ...
RotBoost: A technique for combining Rotation Forest and AdaBoost

This paper presents a novel ensemble classifier generation technique RotBoost, which is constructed by combining Rotation Forest and AdaBoost. The experiments conducted with 36 real-world data sets available from the UCI repository, among which a ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2019

1512 pages

ISBN:9781450361729

DOI:10.1145/3331184

General Chairs:
Benjamin Piwowarski
CNRS - Sorbonne Universite, France
,
Max Chevalier
Universite de Toulouse, CNRS, France
,
Eric Gaussier
Universite Grenoble Alpes, CNRS, France
,
Program Chairs:
Yoelle Maarek
Amazon Research, Israel
,
Jian-Yun Nie
University of Montreal, Canada
,
Falk Scholer
RMIT University, Australia

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key Research and Development Program of China
National Science Foundation of China

Conference

SIGIR '19

Sponsor:

SIGIR

SIGIR '19: The 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 21 - 25, 2019

Paris, France

Acceptance Rates

SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
604
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)3

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bouafoud CZine-Dine KMadani A(2024)The Evolution of Transformers in Education: A Literature Review2024 International Conference on Circuit, Systems and Communication (ICCSC)10.1109/ICCSC62074.2024.10617128(1-7)Online publication date: 28-Jun-2024
https://doi.org/10.1109/ICCSC62074.2024.10617128
Fu TZhao XYan RSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Delving into Global Dialogue Structures: Structure Planning Augmented Response Selection for Multi-turn ConversationsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599304(495-505)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599304
Firdaus MThangavelu NEkbal ABhattacharyya P(2023)I Enjoy Writing and Playing, Do You?: A Personalized and Emotion Grounded Dialogue Agent Using Generative Adversarial NetworkIEEE Transactions on Affective Computing10.1109/TAFFC.2022.315510514:3(2127-2138)Online publication date: 1-Jul-2023
https://doi.org/10.1109/TAFFC.2022.3155105
Ling YLiang ZWang TCai FChen H(2022)Sequential or jumping: context-adaptive response generation for open-domain dialogue systemsApplied Intelligence10.1007/s10489-022-04067-153:9(11251-11266)Online publication date: 2-Sep-2022
https://doi.org/10.1007/s10489-022-04067-1
Li JLiu CTao CChan ZZhao DZhang MYan R(2021)Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based ChatbotsACM Transactions on Information Systems10.1145/345318339:4(1-25)Online publication date: 17-Aug-2021
https://dl.acm.org/doi/10.1145/3453183
Li YWen GHu YLuo MFan BWang CYang P(2021)Multi-source Seq2seq guided by knowledge for Chinese healthcare consultationJournal of Biomedical Informatics10.1016/j.jbi.2021.103727117(103727)Online publication date: May-2021
https://doi.org/10.1016/j.jbi.2021.103727
Li MFu PLin ZWang WZang W(2021)Exemplar Guided Latent Pre-trained Dialogue GenerationComputational Science – ICCS 202110.1007/978-3-030-77964-1_10(118-132)Online publication date: 9-Jun-2021
https://doi.org/10.1007/978-3-030-77964-1_10
Huang MZhu XGao J(2020)Challenges in Building Intelligent Open-domain Dialog SystemsACM Transactions on Information Systems10.1145/338312338:3(1-32)Online publication date: 9-Apr-2020
https://dl.acm.org/doi/10.1145/3383123
Yang MLiu JShen YZhao ZChen XWu QLi C(2020)An Ensemble of Generation- and Retrieval-Based Image Captioning With Dual Generator Generative Adversarial NetworkIEEE Transactions on Image Processing10.1109/TIP.2020.302865129(9627-9640)Online publication date: 2020
https://doi.org/10.1109/TIP.2020.3028651
Zhang LYang YZhou JChen CHe L(2020)Retrieval-Polished Response Generation for ChatbotIEEE Access10.1109/ACCESS.2020.3004152(1-1)Online publication date: 2020
https://doi.org/10.1109/ACCESS.2020.3004152

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten