Judging Medical Q&A Alignments in Multiple Aspects

Si, Pengda; Deng, Qiang; Wang, Yiru; Zhong, Bin; Xu, Jin; Yang, Yujiu

doi:10.1007/978-3-030-93049-3_23

Pengda Si¹⁴,
Qiang Deng¹⁵,
Yiru Wang¹⁵,
Bin Zhong¹⁵,
Jin Xu¹⁵ &
…
Yujiu Yang¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13070))

Included in the following conference series:

CAAI International Conference on Artificial Intelligence

1231 Accesses

Abstract

Question and answer (Q&A) matching is a widely used task, and there have been many works focusing on this. Previous works tend to give an overall label indicating whether the question matches the answer. However, this method mainly relies on detecting identical or similar keywords in Q&A, which is inappropriate for medical text data. Based on a drug, patients’ questions may vary, such as usage, side effects, symptoms, and price. Thus, it is absurd to judge the answer containing the same drug as a matching answer. We argue a better solution is to judge alignments both in entity and intention aspects. To this end, we propose a novel model, which consists of two modules. Specifically, an extractor module gets matching features from text inputs, and then a discriminator module gives alignment labels in both aspects. An adversarial mechanism is designed to disentangle entity matching feature and intention matching feature, which reduces mutual interference. Experimental results show our method outperforms other baselines, including BERT. Further analysis indicates the effectiveness and interpretability of the proposed method.

This research was supported by the Guangdong Basic and Applied Basic Research Foundation (No. 2019A1515011387).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chen, Q., Zhu, X., Ling, Z., Wei, S., Jiang, H., Inkpen, D.: Enhanced LSTM for natural language inference. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Volume 1: Long Papers, pp. 1657–1668. Association for Computational Linguistics (2017)
Google Scholar
Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., Abbeel, P.: Infogan: interpretable representation learning by information maximizing generative adversarial nets. In: Lee, D.D., Sugiyama, M., von Luxburg, U., Guyon, I., Garnett, R. (eds.) Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems, NeurIPS2016, pp. 2172–2180 (2016)
Google Scholar
Conneau, A., Kiela, D., Schwenk, H., Barrault, L., Bordes, A.: Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, pp. 670–680. Association for Computational Linguistics (2017)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT, no. 1 (2019)
Google Scholar
Fu, Z., Tan, X., Peng, N., Zhao, D., Yan, R.: Style transfer in text: Exploration and evaluation. In: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), pp. 663–670. AAAI Press (2018)
Google Scholar
He, H., Lin, J.J.: Pairwise word interaction modeling with deep neural networks for semantic similarity measurement. In: NAACL HLT 2016, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 937–948. The Association for Computational Linguistics (2016)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Jawahar, G., Sagot, B., Seddah, D.: What does BERT learn about the structure of language? In: Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Volume 1: Long Papers, pp. 3651–3657. Association for Computational Linguistics (2019)
Google Scholar
John, V., Mou, L., Bahuleyan, H., Vechtomova, O.: Disentangled representation learning for non-parallel text style transfer. In: Proceedings of the 57th Conference of the Association for Computational Linguistics, Volume 1: Long Papers, pp. 424–434. Association for Computational Linguistics (2019)
Google Scholar
Lan, W., Xu, W.: Neural network models for paraphrase identification, semantic textual similarity, natural language inference, and question answering. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 3890–3902 (2018)
Google Scholar
Liu, Y., Sun, C., Lin, L., Wang, X.: Learning natural language inference using bidirectional lstm model and inner-attention. arXiv preprint arXiv:1605.09090 (2016)
Liu, Y., et al.: Roberta: A robustly optimized BERT pretraining approach. CoRR abs/1907.11692 (2019)
Google Scholar
Luan, F., Paris, S., Shechtman, E., Bala, K.: Deep photo style transfer. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, pp. 6997–7005. IEEE Computer Society (2017)
Google Scholar
Mueller, J., Thyagarajan, A.: Siamese recurrent architectures for learning sentence similarity. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 2786–2792. AAAI Press (2016)
Google Scholar
Nie, Y., Bansal, M.: Shortcut-stacked sentence encoders for multi-domain inference. In: Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, RepEval@EMNLP 2017, pp. 41–45. Association for Computational Linguistics (2017)
Google Scholar
Parikh, A., Täckström, O., Das, D., Uszkoreit, J.: A decomposable attention model for natural language inference. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2249–2255 (2016)
Google Scholar
Park, T., Efros, A.A., Zhang, R., Zhu, J.-Y.: Contrastive learning for unpaired image-to-image translation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12354, pp. 319–345. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58545-7_19
Chapter Google Scholar
Rao, J., Liu, L., Tay, Y., Yang, H., Shi, P., Lin, J.: Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, EMNLP-IJCNLP 2019, pp. 5369–5380. Association for Computational Linguistics (2019)
Google Scholar
Reimers, N., Gurevych, I.: Sentence-bert: Sentence embeddings using siamese bert-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, EMNLP-IJCNLP 2019, pp. 3980–3990. Association for Computational Linguistics (2019)
Google Scholar
Shen, T., Lei, T., Barzilay, R., Jaakkola, T.S.: Style transfer from non-parallel text by cross-alignment. In: Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems, NeurIPS2017, pp. 6830–6841 (2017)
Google Scholar
Shonibare, O.: ASBERT: siamese and triplet network embedding for open question answering. CoRR abs/2104.08558 (2021)
Google Scholar
Yang, Z., Dai, Z., Yang, Y., Carbonell, J.G., Salakhutdinov, R., Le, Q.V.: Xlnet: generalized autoregressive pretraining for language understanding. In: Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems, NeurIPS2019, pp. 5754–5764 (2019)
Google Scholar
Zhou, X., et al.: Multi-view response selection for human-computer conversation. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016, pp. 372–381. The Association for Computational Linguistics (2016)
Google Scholar
Zhou, X., et al.: Multi-turn response selection for chatbots with deep attention matching network. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Volume 1: Long Papers, pp. 1118–1127. Association for Computational Linguistics (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

Shenzhen International Graduate School, Tsinghua University, Beijing, China
Pengda Si & Yujiu Yang
Tencent Inc., Shenzhen, China
Qiang Deng, Yiru Wang, Bin Zhong & Jin Xu

Authors

Pengda Si
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Deng
View author publications
You can also search for this author in PubMed Google Scholar
Yiru Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Jin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yujiu Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Jin Xu or Yujiu Yang .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Lu Fang
Duke University, Durham, NC, USA
Yiran Chen
Shanghai Jiao Tong University, Shanghai, China
Guangtao Zhai
University of British Columbia, Vancouver, BC, Canada
Jane Wang
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Ruiping Wang
Xidian University, Xi'an, China
Weisheng Dong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Si, P., Deng, Q., Wang, Y., Zhong, B., Xu, J., Yang, Y. (2021). Judging Medical Q&A Alignments in Multiple Aspects. In: Fang, L., Chen, Y., Zhai, G., Wang, J., Wang, R., Dong, W. (eds) Artificial Intelligence. CICAI 2021. Lecture Notes in Computer Science(), vol 13070. Springer, Cham. https://doi.org/10.1007/978-3-030-93049-3_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-93049-3_23
Published: 01 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93048-6
Online ISBN: 978-3-030-93049-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Judging Medical Q&A Alignments in Multiple Aspects