skip to main content
10.1145/3331184.3331193acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

EnsembleGAN: Adversarial Learning for Retrieval-Generation Ensemble Model on Short-Text Conversation

Published: 18 July 2019 Publication History

Abstract

Generating qualitative responses has always been a challenge for human-computer dialogue systems. Existing dialogue systems generally derive from either retrieval-based or generative-based approaches, both of which have their own pros and cons. Despite the natural idea of an ensemble model of the two, existing ensemble methods only focused on leveraging one approach to enhance another, we argue however that they can be further mutually enhanced with a proper training strategy. In this paper, we propose ensembleGAN, an adversarial learning framework for enhancing a retrieval-generation ensemble model in open-domain conversation scenario. It consists of a language-model-like generator, a ranker generator, and one ranker discriminator. Aiming at generating responses that approximate the ground-truth and receive high ranking scores from the discriminator, the two generators learn to generate improved highly relevant responses and competitive unobserved candidates respectively, while the discriminative ranker is trained to identify true responses from adversarial ones, thus featuring the merits of both generator counterparts. The experimental results on a large short-text conversation data demonstrate the effectiveness of the ensembleGAN by the amelioration on both human and automatic evaluation metrics.

References

[1]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In ICLR.
[2]
Hongshen Chen, Xiaorui Liu, Dawei Yin, and Jiliang Tang. 2017. A Survey on Dialogue Systems: Recent Advances and New Frontiers. SIGKDD Explorations, Vol. 19, 2 (2017), 25--35.
[3]
Bo Dai, Sanja Fidler, Raquel Urtasun, and Dahua Lin. 2017. Towards Diverse and Natural Image Descriptions via a Conditional GAN. In ICCV. 2989--2998.
[4]
Joseph L Fleiss. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin, Vol. 76, 5 (1971), 378--382.
[5]
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron C. Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In NIPS. 2672--2680.
[6]
R. Herbrich. 2008. Large margin rank boundaries for ordinal regression. Advances in Large Margin Classifiers, Vol. 88 (2008).
[7]
Sepp Hochreiter and Jü rgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation, Vol. 9, 8 (1997), 1735--1780.
[8]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR.
[9]
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A Diversity-Promoting Objective Function for Neural Conversation Models. In NAACL. 110--119.
[10]
Jiwei Li, Will Monroe, Tianlin Shi, Sébastien Jean, Alan Ritter, and Dan Jurafsky. 2017. Adversarial Learning for Neural Dialogue Generation. In EMNLP. 2157--2169.
[11]
Kevin Lin, Dianqi Li, Xiaodong He, Ming-Ting Sun, and Zhengyou Zhang. 2017. Adversarial Ranking for Language Generation. In NIPS. 3158--3168.
[12]
Chia-Wei Liu, Ryan Lowe, Iulian Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation. In EMNLP. 2122--2132.
[13]
Linqing Liu, Yao Lu, Min Yang, Qiang Qu, Jia Zhu, and Hongyan Li. 2018. Generative Adversarial Network for Abstractive Text Summarization. In AAAI.
[14]
Mehdi Mirza and Simon Osindero. 2014. Conditional Generative Adversarial Nets. CoRR, Vol. abs/1411.1784 (2014).
[15]
Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. 2016. Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation. In COLING. 3349--3358.
[16]
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In ACL. 311--318.
[17]
Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural Responding Machine for Short-Text Conversation. In ACL. 1577--1586.
[18]
Yiping Song, Cheng-Te Li, Ming Zhang, Dongyan Zhao, and Rui Yan. 2018. An Ensemble of Retrieval-Based and Generation-Based Human-Computer Conversation Systems. In IJCAI. 4382--4388.
[19]
Richard S. Sutton, David A. McAllester, Satinder P. Singh, and Yishay Mansour. 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation. In NIPS. 1057--1063.
[20]
Chongyang Tao, Shen Gao, Mingyue Shang, Wei Wu, Dongyan Zhao, and Rui Yan. 2018a. Get the Point of My Utterance! Learning Towards Effective Responses with Multi-head Attention Mechanism. In IJCAI. 4418--4424.
[21]
Chongyang Tao, Lili Mou, Dongyan Zhao, and Rui Yan. 2018b. RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems. In AAAI. 722--729.
[22]
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019. Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. In WSDM. 267--275.
[23]
Ashwin K. Vijayakumar, Michael Cogswell, Ramprasaath R. Selvaraju, Qing Sun, Stefan Lee, David J. Crandall, and Dhruv Batra. 2018. Diverse Beam Search for Improved Description of Complex Scenes. In AAAI. 7371--7379.
[24]
Chenglong Wang, Feijun Jiang, and Hongxia Yang. 2017. A Hybrid Framework for Text Modeling with Convolutional RNN. In SIGKDD. 2061--2069.
[25]
Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. 2013. A Dataset for Research on Short-Text Conversations. In EMNLP. 935--945.
[26]
Jun Wang, Lantao Yu, Weinan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, and Dell Zhang. 2017. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models. In SIGIR. 515--524.
[27]
J. Weston, E. Dinan, and A. H. Miller. 2018. Retrieve and Refine: Improved Sequence Generation Models For Dialogue. CoRR, Vol. abs/1808.04776 (2018).
[28]
Yu Wu, Furu Wei, Shaohan Huang, Zhoujun Li, and Ming Zhou. 2018. Response Generation by Context-aware Prototype Editing. CoRR, Vol. abs/1806.07042 (2018).
[29]
Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. 2017. Sequential matching network: A new architecture for multi-turn response selection in retrieval-based chatbots. In ACL. 496--505.
[30]
Yu Wu, Wei Wu, Dejian Yang, Can Xu, and Zhoujun Li. 2018. Neural Response Generation With Dynamic Vocabularies. In AAAI. 5594--5601.
[31]
Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma. 2017. Topic Aware Neural Response Generation. In AAAI. 3351--3357.
[32]
Jingjing Xu, Xu Sun, Xuancheng Ren, Junyang Lin, Bingzhen Wei, and Wei Li. 2018. DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text. In EMNLP. 3940--3949.
[33]
Rui Yan, Yiping Song, and Hua Wu. 2016. Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System. In SIGIR. 55--64.
[34]
Rui Yan and Dongyan Zhao. 2018. Coupled context modeling for deep chit-chat: towards conversations between human and computer. In SIGKDD. 2574--2583.
[35]
Rui Yan, Dongyan Zhao, and Weinan E. 2017. Joint learning of response ranking and next utterance suggestion in human-computer conversation system. In SIGIR. 685--694.
[36]
Zhen Yang, Wei Chen, Feng Wang, and Bo Xu. 2018. Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets. In NAACL.
[37]
Lili Yao, Yaoyuan Zhang, Yansong Feng, Dongyan Zhao, and Rui Yan. 2017. Towards Implicit Content-Introducing for Generative Short-Text Conversation Systems. In EMNLP. 2190--2199.
[38]
T. Young, E. Cambria, I. Chaturvedi, M. Huang, H. Zhou, and S. Biswas. 2018. Augmenting End-to-End Dialogue Systems with Commonsense Knowledge. In AAAI. 4970--4977.
[39]
Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. In AAAI. 2852--2858.
[40]
Tiancheng Zhao, Ran Zhao, and Maxine Eskenazi. 2017. Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders. In ACL. 654--664.
[41]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. In ICCV, Vol. 2223--2232.

Cited By

View all
  • (2024)The Evolution of Transformers in Education: A Literature Review2024 International Conference on Circuit, Systems and Communication (ICCSC)10.1109/ICCSC62074.2024.10617128(1-7)Online publication date: 28-Jun-2024
  • (2023)Delving into Global Dialogue Structures: Structure Planning Augmented Response Selection for Multi-turn ConversationsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599304(495-505)Online publication date: 6-Aug-2023
  • (2023)I Enjoy Writing and Playing, Do You?: A Personalized and Emotion Grounded Dialogue Agent Using Generative Adversarial NetworkIEEE Transactions on Affective Computing10.1109/TAFFC.2022.315510514:3(2127-2138)Online publication date: 1-Jul-2023
  • Show More Cited By

Index Terms

  1. EnsembleGAN: Adversarial Learning for Retrieval-Generation Ensemble Model on Short-Text Conversation

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
        July 2019
        1512 pages
        ISBN:9781450361729
        DOI:10.1145/3331184
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 18 July 2019

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. ensemble method
        2. generation-based conversation
        3. generative adversarial network
        4. retrieval-based conversation
        5. short-text conversation

        Qualifiers

        • Research-article

        Funding Sources

        • National Key Research and Development Program of China
        • National Science Foundation of China

        Conference

        SIGIR '19
        Sponsor:

        Acceptance Rates

        SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;
        Overall Acceptance Rate 792 of 3,983 submissions, 20%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)8
        • Downloads (Last 6 weeks)3
        Reflects downloads up to 28 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)The Evolution of Transformers in Education: A Literature Review2024 International Conference on Circuit, Systems and Communication (ICCSC)10.1109/ICCSC62074.2024.10617128(1-7)Online publication date: 28-Jun-2024
        • (2023)Delving into Global Dialogue Structures: Structure Planning Augmented Response Selection for Multi-turn ConversationsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599304(495-505)Online publication date: 6-Aug-2023
        • (2023)I Enjoy Writing and Playing, Do You?: A Personalized and Emotion Grounded Dialogue Agent Using Generative Adversarial NetworkIEEE Transactions on Affective Computing10.1109/TAFFC.2022.315510514:3(2127-2138)Online publication date: 1-Jul-2023
        • (2022)Sequential or jumping: context-adaptive response generation for open-domain dialogue systemsApplied Intelligence10.1007/s10489-022-04067-153:9(11251-11266)Online publication date: 2-Sep-2022
        • (2021)Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based ChatbotsACM Transactions on Information Systems10.1145/345318339:4(1-25)Online publication date: 17-Aug-2021
        • (2021)Multi-source Seq2seq guided by knowledge for Chinese healthcare consultationJournal of Biomedical Informatics10.1016/j.jbi.2021.103727117(103727)Online publication date: May-2021
        • (2021)Exemplar Guided Latent Pre-trained Dialogue GenerationComputational Science – ICCS 202110.1007/978-3-030-77964-1_10(118-132)Online publication date: 9-Jun-2021
        • (2020)Challenges in Building Intelligent Open-domain Dialog SystemsACM Transactions on Information Systems10.1145/338312338:3(1-32)Online publication date: 9-Apr-2020
        • (2020)An Ensemble of Generation- and Retrieval-Based Image Captioning With Dual Generator Generative Adversarial NetworkIEEE Transactions on Image Processing10.1109/TIP.2020.302865129(9627-9640)Online publication date: 2020
        • (2020)Retrieval-Polished Response Generation for ChatbotIEEE Access10.1109/ACCESS.2020.3004152(1-1)Online publication date: 2020

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media