Skip to main content
Log in

Chinese medical question answer selection via hybrid models based on CNN and GRU

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Question answer selection in the Chinese medical field is very challenging since it requires effective text representations to capture the complex semantic relationships between Chinese questions and answers. Recent approaches on deep learning, e.g., CNN and RNN, have shown their potential in improving the selection quality. However, these existing methods can only capture a part or one-side of semantic relationships while ignoring the other rich and sophisticated ones, leading to limited performance improvement. In this paper, a series of neural network models are proposed to address Chinese medical question answer selection issue. In order to model the complex relationships between questions and answers, we develop both single and hybrid models with CNN and GRU to combine the merits of different neural network architectures. This is different from existing works that can onpy capture partial relationships by utilizing a single network structure. Extensive experimental results on cMedQA dataset demonstrate that the proposed hybrid models, especially BiGRU-CNN, significantly outperform the state-of-the-art methods. The source codes of our models are available in the GitHub (https://github.com/zhangyuteng/MedicalQA-CNN-BiGRU).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Notes

  1. https://github.com/zhangyuteng/MedicalQA-CNN-BiGRU

  2. http://dxy.com

  3. http://www.xywy.com

  4. https://github.com/Embedding/Chinese-Word-Vectors

References

  1. Abacha AB, Zweigenbaum P (2012) Medical question answering: translating medical questions into sparql queries. In: Proceedings of the 2nd ACM SIGHIT international health informatics symposium, pp 41–50. ACM

  2. Athenikos SJ, Han H, Brooks AD (2009) A framework of a logic-based question-answering system for the medical domain (loqas-med). In: Proceedings of the ACM symposium on applied computing, pp 847–851. ACM, p 2009

  3. Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1994) Signature verification using a ‘siamese’ time delay neural network. In: Advances in neural information processing systems, pp 737–744

  4. Cairns BL, Nielsen RD, Masanz JJ, Martin JH, Palmer MS, Ward WH, Savova GK (2011) The mipacq clinical question answering system. In: AMIA annual symposium proceedings, vol 2011, pp 171. American medical informatics association

  5. Chao L (2016) Research and application on intelligent disease guidance and medical question answering method. Master’s thesis, Dalian University of Technology

  6. Cho K, Van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder-decoder for statistical machine translation. In: Methods in natural language processing (EMNLP), October 25-29, 2014, Doha, Qatar, pp 1724–1734. https://www.aclweb.org/anthology/D14-1179

  7. Gao L, Guo Z, Zhang H, Xu X, Shen HT (2017) Video captioning with attention-based lstm and semantic consistency. IEEE Trans Multimedia 19(9):2045–2055

    Article  Google Scholar 

  8. Heilman M, Smith NA (2010) Tree edit models for recognizing textual entailments, paraphrases, and answers to questions. In: Human language technologies: conference of the North American chapter of the association of computational linguistics, pp 1011–1019

  9. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  10. Hu B, Lu Z, Li H, Chen Q (2014) Convolutional neural network architectures for matching natural language sentences. In: Advances in neural information processing systems, pp 2042–2050

  11. Jain S, Dodiya T (2014) Rule based architecture for medical question answering system. In: Proceedings of the 2nd international conference on soft computing for problem solving (SocProS 2012), pp 1225–1233 Springer

  12. LeCun Y, Chopra S, Hadsell R, Ranzato M, Huang F (2006) A tutorial on energy-based learning. Predicting Structured Data 1:1–59. http://yann.lecun.com/exdb/publis/pdf/lecun-06.pdf

  13. Li S, Zhao Z, Hu R, Li W, Liu T, Du X (2018) Analogical reasoning on chinese morphological and semantic relations. In: Proceedings of the 56th annual meeting of the association for computational linguistics (Volume 2: Short Papers), pp 138–143 Association for Computational Linguistics

  14. Lu H, Li Y, Chen M, Kim H, Serikawa S (2018) Brain intelligence: go beyond artificial intelligence. Mob Netw Appl 23(2):368–375

    Article  Google Scholar 

  15. Lu H, Li Y, Uemura T, Ge Z, Xu X, He L, Serikawa S, Kim H (2017) Fdcnet: filtering deep convolutional network for marine organism classification. Multimed Tools Appl 77(2):1–14

    Google Scholar 

  16. Lu H, Li Y, Uemura T, Kim H, Serikawa S (2018) Low illumination underwater light field images reconstruction using deep convolutional neural networks. Future Generation Computer Systems

  17. Lu W (2018) Word sense disambiguation based on dependency constraint knowledge. Clust Comput, pp 1–9. https://link.springer.com/article/10.1007/s10586-018-1899-3

  18. Lu W, Huang H, Zhu C (2012) Feature words selection for knowledge-based word sense disambiguation with syntactic parsing. Przeglad Elektrotechniczny 88(1b):82–87

    Google Scholar 

  19. Lu W, Wu H, Jian P, Huang Y, Huang H (2018) An empirical study of classifier combination based word sense disambiguation. IEICE Trans Inf Syst 101(1):225–233

    Article  Google Scholar 

  20. Mihalcea R, Textrank PT (2004) Bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing

  21. Moschitti A, Quarteroni S (2011) Linguistic kernels for answer re-ranking in question answering systems. Inf Process Manag 47(6):825–842

    Article  Google Scholar 

  22. Qiu X, Huang X (2015) Convolutional neural tensor network architecture for community-based question answering. In: Proceedings of international joint conferences on artificial intelligence, pp 1305–1311

  23. Robertson S, Zaragoza H et al (2009) The probabilistic relevance framework: Bm25 and beyond. Foundations and Trends® in Information Retrieval 3(4):333–389

    Article  Google Scholar 

  24. Tan M, dos Santos C, Xiang B, Zhou B (2015) Lstm-based deep learning models for non-factoid answer selection. arXiv:1511.04108

  25. Toba H, Ming Z-Y, Adriani M, Chua T-S (2014) Discovering high quality answers in community question answering archives using a hierarchy of classifiers. Inf Sci 261:101–115

    Article  MathSciNet  Google Scholar 

  26. Tymoshenko K, Bonadiman D, Moschitti A (2016) Convolutional neural networks vs. convolution kernels: feature engineering for answer sentence reranking. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1268–1278

  27. Wang B, Niu J, Ma L, Zhang Y, Zhang L, Li J, Zhang P, Song D (2016) A chinese question answering approach integrating count-based and embedding-based features. In: Natural language understanding and intelligent applications, pp 934–941. Springer

  28. Wang J, Man C, Zhao Y, Wang F (2016) An answer recommendation algorithm for medical community question answering systems. In: IEEE International Conference on Service Operations and Logistics, and Informatics, pp 139–144

  29. Wang S, Cao L (2017) Inferring implicit rules by learning explicit and hidden item dependency. IEEE Transactions on Systems, Man, and Cybernetics: Systems. https://ieeexplore.ieee.org/abstract/document/8194854

  30. Wang S, Hu L, Cao L, Huang X, Lian D, Liu W (2018) Attention-based transactional context embedding for next-item recommendation

  31. Wang S, Liu W, Wu J, Cao L, Meng Q, Kennedy PJ (2016) Training deep neural networks on imbalanced data sets. In: International joint conference on neural networks (IJCNN), pp 4368–4374. IEEE, p 2016

  32. Xiang L, Yu J, Yang C, Zeng D, Shen X (2018) A word-embedding-based steganalysis method for linguistic steganography via synonym substitution. IEEE Access 6:64131–64141

    Article  Google Scholar 

  33. Xu X, He L, Lu H, Gao L, Ji Y (2018) Deep adversarial metric learning for cross-modal retrieval. World Wide Web, pp 1–16. https://link.springer.com/article/10.1007/s11280-018-0541-x

  34. Xu X, He L, Shimada A, Taniguchi R-I, Lu Hu (2016) Learning unified binary codes for cross-modal retrieval via latent semantic hashing. Neurocomputing 213:191–203

    Article  Google Scholar 

  35. Xu X, Shen F, Shen HT, Li X (2017) Learning discriminative binary codes for large-scale cross-modal retrieval. IEEE Trans Image Process Publication IEEE Signal Process Soc 26(5):2494–2507

    Article  MathSciNet  Google Scholar 

  36. Xu X, Song J, Lu H, He L, Yang Y, Shen F (2018) Dual learning for visual question generation. In: IEEE international conference on multimedia and expo (ICME), pp 1–6. IEEE

  37. Yao X, Van Durme B, Callison-Burch C, Clark P (2013) Answer extraction as sequence tagging with tree edit distance. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 858–867

  38. Yen S-J, Wu Y-C, Yang J-C, Lee Y-S, Lee C-J, Liu J-J (2013) A support vector machine-based context-ranking model for question answering. Inf Sci 224:77–87

    Article  Google Scholar 

  39. Yin W, Schütze H, Xiang B, Zhou B (2015) Abcnn: Attention-based convolutional neural network for modeling sentence pairs. arXiv:1512.05193

  40. Yu H, Lee M, Kaufman D, Ely J, Osheroff JA, Hripcsak G, Cimino J (2007) Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians. J Biomed Inform 40(3):236–251

    Article  Google Scholar 

  41. Yuan L, Yuan A, Hasan S (2017) Improving clinical diagnosis inference through integration of structured and unstructured knowledge. In: Proceedings of the 1st workshop on sense, concept and entity representations and their applications, pp 31–36

  42. Zhang M, Zhang Y, Che W, Liu T (2014) Character-level chinese dependency parsing. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (Volume 1: Long Papers), vol 1, pp 1326–1336

  43. Zhang S, Zhang X, Wang H, Cheng J, Li P, Ding Z (2017) Chinese medical question answer matching using end-to-end character-level multi-scale cnns. Appl Sci 7(8):767

    Article  Google Scholar 

  44. Zhou Q, Yang W, Gao G, Ou W, Lu H, Chen J, Latecki LJ (2018) Multi-scale deep context convolutional neural networks for semantic segmentation. World Wide Web, pp 1–16. https://link.springer.com/article/10.1007/s11280-018-0556-3

  45. Zhou Q, Zheng B, Zhu W, Latecki LJ (2016) Multi-scale context for scene labeling via flexible segmentation graph. Pattern Recogn 59:312–324

    Article  Google Scholar 

Download references

Acknowledgements

The research work is supported by the National Key R&D Program of China under Grant No.2018YFC0831704, National Nature Science Foundation of China under Grant No.61502259 and No.61762021, Natural Science Foundation of Guizhou Province under Grant No.2017[1130], Key Subjects Construction of Guizhou Province under Grant No.ZDXK[2016]8, Natural Science Foundation of Shandong Province under Grant No.ZR2017MF056. Taishan Scholar Program of Shandong Province in China (Directed by Prof. Yinglong Wang)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wenpeng Lu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Lu, W., Ou, W. et al. Chinese medical question answer selection via hybrid models based on CNN and GRU. Multimed Tools Appl 79, 14751–14776 (2020). https://doi.org/10.1007/s11042-019-7240-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-7240-1

Keywords

Navigation