Chinese medical question answer selection via hybrid models based on CNN and GRU

Zhang, Yuteng; Lu, Wenpeng; Ou, Weihua; Zhang, Guoqiang; Zhang, Xu; Cheng, Jinyong; Zhang, Weiyu

doi:10.1007/s11042-019-7240-1

Chinese medical question answer selection via hybrid models based on CNN and GRU

Published: 01 March 2019

Volume 79, pages 14751–14776, (2020)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Yuteng Zhang¹,
Wenpeng Lu ORCID: orcid.org/0000-0002-1840-3540¹,
Weihua Ou²,
Guoqiang Zhang³,
Xu Zhang¹,
Jinyong Cheng¹ &
…
Weiyu Zhang¹

1284 Accesses
57 Citations
Explore all metrics

Abstract

Question answer selection in the Chinese medical field is very challenging since it requires effective text representations to capture the complex semantic relationships between Chinese questions and answers. Recent approaches on deep learning, e.g., CNN and RNN, have shown their potential in improving the selection quality. However, these existing methods can only capture a part or one-side of semantic relationships while ignoring the other rich and sophisticated ones, leading to limited performance improvement. In this paper, a series of neural network models are proposed to address Chinese medical question answer selection issue. In order to model the complex relationships between questions and answers, we develop both single and hybrid models with CNN and GRU to combine the merits of different neural network architectures. This is different from existing works that can onpy capture partial relationships by utilizing a single network structure. Extensive experimental results on cMedQA dataset demonstrate that the proposed hybrid models, especially BiGRU-CNN, significantly outperform the state-of-the-art methods. The source codes of our models are available in the GitHub (https://github.com/zhangyuteng/MedicalQA-CNN-BiGRU).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Co-attention fusion based deep neural network for Chinese medical answer selection

Article 08 February 2021

Chinese Medical Question Answer Matching with Stack-CNN

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

Notes

References

Abacha AB, Zweigenbaum P (2012) Medical question answering: translating medical questions into sparql queries. In: Proceedings of the 2nd ACM SIGHIT international health informatics symposium, pp 41–50. ACM
Athenikos SJ, Han H, Brooks AD (2009) A framework of a logic-based question-answering system for the medical domain (loqas-med). In: Proceedings of the ACM symposium on applied computing, pp 847–851. ACM, p 2009
Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1994) Signature verification using a ‘siamese’ time delay neural network. In: Advances in neural information processing systems, pp 737–744
Cairns BL, Nielsen RD, Masanz JJ, Martin JH, Palmer MS, Ward WH, Savova GK (2011) The mipacq clinical question answering system. In: AMIA annual symposium proceedings, vol 2011, pp 171. American medical informatics association
Chao L (2016) Research and application on intelligent disease guidance and medical question answering method. Master’s thesis, Dalian University of Technology
Cho K, Van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder-decoder for statistical machine translation. In: Methods in natural language processing (EMNLP), October 25-29, 2014, Doha, Qatar, pp 1724–1734. https://www.aclweb.org/anthology/D14-1179
Gao L, Guo Z, Zhang H, Xu X, Shen HT (2017) Video captioning with attention-based lstm and semantic consistency. IEEE Trans Multimedia 19(9):2045–2055
Article Google Scholar
Heilman M, Smith NA (2010) Tree edit models for recognizing textual entailments, paraphrases, and answers to questions. In: Human language technologies: conference of the North American chapter of the association of computational linguistics, pp 1011–1019
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Hu B, Lu Z, Li H, Chen Q (2014) Convolutional neural network architectures for matching natural language sentences. In: Advances in neural information processing systems, pp 2042–2050
Jain S, Dodiya T (2014) Rule based architecture for medical question answering system. In: Proceedings of the 2nd international conference on soft computing for problem solving (SocProS 2012), pp 1225–1233 Springer
LeCun Y, Chopra S, Hadsell R, Ranzato M, Huang F (2006) A tutorial on energy-based learning. Predicting Structured Data 1:1–59. http://yann.lecun.com/exdb/publis/pdf/lecun-06.pdf
Li S, Zhao Z, Hu R, Li W, Liu T, Du X (2018) Analogical reasoning on chinese morphological and semantic relations. In: Proceedings of the 56th annual meeting of the association for computational linguistics (Volume 2: Short Papers), pp 138–143 Association for Computational Linguistics
Lu H, Li Y, Chen M, Kim H, Serikawa S (2018) Brain intelligence: go beyond artificial intelligence. Mob Netw Appl 23(2):368–375
Article Google Scholar
Lu H, Li Y, Uemura T, Ge Z, Xu X, He L, Serikawa S, Kim H (2017) Fdcnet: filtering deep convolutional network for marine organism classification. Multimed Tools Appl 77(2):1–14
Google Scholar
Lu H, Li Y, Uemura T, Kim H, Serikawa S (2018) Low illumination underwater light field images reconstruction using deep convolutional neural networks. Future Generation Computer Systems
Lu W (2018) Word sense disambiguation based on dependency constraint knowledge. Clust Comput, pp 1–9. https://link.springer.com/article/10.1007/s10586-018-1899-3
Lu W, Huang H, Zhu C (2012) Feature words selection for knowledge-based word sense disambiguation with syntactic parsing. Przeglad Elektrotechniczny 88(1b):82–87
Google Scholar
Lu W, Wu H, Jian P, Huang Y, Huang H (2018) An empirical study of classifier combination based word sense disambiguation. IEICE Trans Inf Syst 101(1):225–233
Article Google Scholar
Mihalcea R, Textrank PT (2004) Bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing
Moschitti A, Quarteroni S (2011) Linguistic kernels for answer re-ranking in question answering systems. Inf Process Manag 47(6):825–842
Article Google Scholar
Qiu X, Huang X (2015) Convolutional neural tensor network architecture for community-based question answering. In: Proceedings of international joint conferences on artificial intelligence, pp 1305–1311
Robertson S, Zaragoza H et al (2009) The probabilistic relevance framework: Bm25 and beyond. Foundations and Trends® in Information Retrieval 3(4):333–389
Article Google Scholar
Tan M, dos Santos C, Xiang B, Zhou B (2015) Lstm-based deep learning models for non-factoid answer selection. arXiv:1511.04108
Toba H, Ming Z-Y, Adriani M, Chua T-S (2014) Discovering high quality answers in community question answering archives using a hierarchy of classifiers. Inf Sci 261:101–115
Article MathSciNet Google Scholar
Tymoshenko K, Bonadiman D, Moschitti A (2016) Convolutional neural networks vs. convolution kernels: feature engineering for answer sentence reranking. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1268–1278
Wang B, Niu J, Ma L, Zhang Y, Zhang L, Li J, Zhang P, Song D (2016) A chinese question answering approach integrating count-based and embedding-based features. In: Natural language understanding and intelligent applications, pp 934–941. Springer
Wang J, Man C, Zhao Y, Wang F (2016) An answer recommendation algorithm for medical community question answering systems. In: IEEE International Conference on Service Operations and Logistics, and Informatics, pp 139–144
Wang S, Cao L (2017) Inferring implicit rules by learning explicit and hidden item dependency. IEEE Transactions on Systems, Man, and Cybernetics: Systems. https://ieeexplore.ieee.org/abstract/document/8194854
Wang S, Hu L, Cao L, Huang X, Lian D, Liu W (2018) Attention-based transactional context embedding for next-item recommendation
Wang S, Liu W, Wu J, Cao L, Meng Q, Kennedy PJ (2016) Training deep neural networks on imbalanced data sets. In: International joint conference on neural networks (IJCNN), pp 4368–4374. IEEE, p 2016
Xiang L, Yu J, Yang C, Zeng D, Shen X (2018) A word-embedding-based steganalysis method for linguistic steganography via synonym substitution. IEEE Access 6:64131–64141
Article Google Scholar
Xu X, He L, Lu H, Gao L, Ji Y (2018) Deep adversarial metric learning for cross-modal retrieval. World Wide Web, pp 1–16. https://link.springer.com/article/10.1007/s11280-018-0541-x
Xu X, He L, Shimada A, Taniguchi R-I, Lu Hu (2016) Learning unified binary codes for cross-modal retrieval via latent semantic hashing. Neurocomputing 213:191–203
Article Google Scholar
Xu X, Shen F, Shen HT, Li X (2017) Learning discriminative binary codes for large-scale cross-modal retrieval. IEEE Trans Image Process Publication IEEE Signal Process Soc 26(5):2494–2507
Article MathSciNet Google Scholar
Xu X, Song J, Lu H, He L, Yang Y, Shen F (2018) Dual learning for visual question generation. In: IEEE international conference on multimedia and expo (ICME), pp 1–6. IEEE
Yao X, Van Durme B, Callison-Burch C, Clark P (2013) Answer extraction as sequence tagging with tree edit distance. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 858–867
Yen S-J, Wu Y-C, Yang J-C, Lee Y-S, Lee C-J, Liu J-J (2013) A support vector machine-based context-ranking model for question answering. Inf Sci 224:77–87
Article Google Scholar
Yin W, Schütze H, Xiang B, Zhou B (2015) Abcnn: Attention-based convolutional neural network for modeling sentence pairs. arXiv:1512.05193
Yu H, Lee M, Kaufman D, Ely J, Osheroff JA, Hripcsak G, Cimino J (2007) Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians. J Biomed Inform 40(3):236–251
Article Google Scholar
Yuan L, Yuan A, Hasan S (2017) Improving clinical diagnosis inference through integration of structured and unstructured knowledge. In: Proceedings of the 1st workshop on sense, concept and entity representations and their applications, pp 31–36
Zhang M, Zhang Y, Che W, Liu T (2014) Character-level chinese dependency parsing. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers), vol 1, pp 1326–1336
Zhang S, Zhang X, Wang H, Cheng J, Li P, Ding Z (2017) Chinese medical question answer matching using end-to-end character-level multi-scale cnns. Appl Sci 7(8):767
Article Google Scholar
Zhou Q, Yang W, Gao G, Ou W, Lu H, Chen J, Latecki LJ (2018) Multi-scale deep context convolutional neural networks for semantic segmentation. World Wide Web, pp 1–16. https://link.springer.com/article/10.1007/s11280-018-0556-3
Zhou Q, Zheng B, Zhu W, Latecki LJ (2016) Multi-scale context for scene labeling via flexible segmentation graph. Pattern Recogn 59:312–324
Article Google Scholar

Download references

Acknowledgements

The research work is supported by the National Key R&D Program of China under Grant No.2018YFC0831704, National Nature Science Foundation of China under Grant No.61502259 and No.61762021, Natural Science Foundation of Guizhou Province under Grant No.2017[1130], Key Subjects Construction of Guizhou Province under Grant No.ZDXK[2016]8, Natural Science Foundation of Shandong Province under Grant No.ZR2017MF056. Taishan Scholar Program of Shandong Province in China (Directed by Prof. Yinglong Wang)

Author information

Authors and Affiliations

School of Computer Science and Technology, QiLu University of Technology (Shandong Academy of Sciences), Jinan, China
Yuteng Zhang, Wenpeng Lu, Xu Zhang, Jinyong Cheng & Weiyu Zhang
School of Big Data and Computer Science, Guizhou Normal University, Guiyang, China
Weihua Ou
Centre for Audio, Acoustics and Vibration, University of Technology Sydney, Sydney, Australia
Guoqiang Zhang

Authors

Yuteng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wenpeng Lu
View author publications
You can also search for this author in PubMed Google Scholar
Weihua Ou
View author publications
You can also search for this author in PubMed Google Scholar
Guoqiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jinyong Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Weiyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenpeng Lu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, Y., Lu, W., Ou, W. et al. Chinese medical question answer selection via hybrid models based on CNN and GRU. Multimed Tools Appl 79, 14751–14776 (2020). https://doi.org/10.1007/s11042-019-7240-1

Download citation

Received: 30 September 2018
Revised: 14 January 2019
Accepted: 16 January 2019
Published: 01 March 2019
Issue Date: June 2020
DOI: https://doi.org/10.1007/s11042-019-7240-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Chinese medical question answer selection via hybrid models based on CNN and GRU

Abstract

Access this article

Similar content being viewed by others

Co-attention fusion based deep neural network for Chinese medical answer selection

Chinese Medical Question Answer Matching with Stack-CNN

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Chinese medical question answer selection via hybrid models based on CNN and GRU

Abstract

Access this article

Similar content being viewed by others

Co-attention fusion based deep neural network for Chinese medical answer selection

Chinese Medical Question Answer Matching with Stack-CNN

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation