Skip to main content

A Semantic Expansion-Based Joint Model for Answer Ranking in Chinese Question Answering Systems

  • Conference paper
  • First Online:
  • 431 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12004))

Abstract

Answer ranking is one of essential steps in open domain question answering systems. The ranking of the retrieved answers directly affects user satisfaction. This paper proposes a new joint model for answer ranking by leveraging context semantic features, which balances both question-answer similarities and answer ranking scores. A publicly available dataset containing 40,000 Chinese questions and 369,919 corresponding answer passages from Sogou Lab is used for experiments. Evaluation on the joint model shows a Precison@1 of 72.6%, which outperforms the state-of-the-art baseline methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.sogou.com/labs/.

  2. 2.

    http://radimrehurek.com/gensim/similarities/docsim.html.

References

  1. Zhao, Z., Lu, H., Zheng, V.W., Cai, D., He, X., Zhuang, Y.: Community-based question answering via asymmetric multi-faceted ranking network learning. In: The 31th Conference on Artificial Intelligence, pp. 3532–3538 (2017)

    Google Scholar 

  2. Freihat, A.A., Qwaider, M.R.H., Giunchiglia, F.: Using grice maxims in ranking community question answers. In: International Conference on Information, Process, and Knowledge Management. pp. 38–43 (2018)

    Google Scholar 

  3. Zhou, Z.-M., Lan, M., Niu, Z.-Y., Lu, Y.: Exploiting user profile information for answer ranking in cQA. In: The 21st International Conference Companion on World Wide Web, pp. 767–774 (2012)

    Google Scholar 

  4. Li, Z., Huang, J., Zhou, Z., Zhang, H., Chang, S., Huang, Z.: LSTM-based deep learning models for answer ranking. In: 2016 IEEE First International Conference on Data Science in Cyberspace. pp. 90–97 (2016)

    Google Scholar 

  5. Agarwal, A., et al.: Learning to rank for robust question answering. In: The 21st ACM International Conference on Information and Knowledge Management, pp. 833–842 (2012)

    Google Scholar 

  6. Yulianti, E., Chen, R.-C., Scholer, F., Croft, W.B., Sanderson, M.: Ranking documents by answer-passage quality. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 335–344 (2018)

    Google Scholar 

  7. Ko, J., Nyberg, E., Si, L.: A Probabilistic graphical model for joint answer ranking in question answering. In: The 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 343–350. ACM (2007)

    Google Scholar 

  8. Severyn, A., Nicosia, M., Moschitti, A.: Building structures from classifiers for passage reranking. In: The 22nd ACM International Conference on Information & Knowledge Management, pp. 969–978. ACM (2013)

    Google Scholar 

  9. Moschitti, A., Quarteroni, S., Basili, R., Manandhar, S.: Exploiting syntactic and shallow semantic kernels for question answer classification. In: The 45th Annual Meeting of the Association of Computational Linguistics, pp. 776–783 (2007)

    Google Scholar 

  10. Ko, J., Mitamura, T., Nyberg, E.: Language-independent probabilistic answer ranking for question answering. In: The 45th Annual Meeting of the Association of Computational Linguistics, pp. 784–791 (2007)

    Google Scholar 

  11. Ferrucci, D., et al.: Building watson: an overview of the DeepQA project. AI Mag. 31(3), 59–79 (2010)

    Article  Google Scholar 

  12. Bhowan, U., McCloskey, D.J.: Genetic programming for feature selection and question-answer ranking in IBM watson. In: Machado, P., et al. (eds.) EuroGP 2015. LNCS, vol. 9025, pp. 153–166. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16501-1_13

    Chapter  Google Scholar 

  13. Bilotti, M.W., Elsas, J., Carbonell, J., Nyberg, E.: rank learning for factoid question answering with linguistic and semantic constraints. In: The 19th ACM International Conference on Information and Knowledge Managementm pp. 459–468 (2010)

    Google Scholar 

  14. Yang, L., et al.: Beyond factoid QA: effective methods for non-factoid answer sentence retrieval. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 115–128. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_9

    Chapter  Google Scholar 

  15. Metzler, D., Kanungo, T.: Machine learned sentence selection strategies for query-biased summarization. In: Sigir Learning to Rank Workshopm, pp. 40–47 (2008)

    Google Scholar 

  16. Jeon, J., Croft, W.B., Lee, J.H., Park, S.: A framework to predict the quality of answers with non-textual features. In: The 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 228–235 (2006)

    Google Scholar 

  17. Liu, Y., Bian, J., Agichtein, E.: Predicting information seeker satisfaction in community question answering. In: The 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 483–490 (2008)

    Google Scholar 

  18. Blooma, M.J., Goh, D.H.: A predictive framework for retrieving the best answer. In: The 2008 ACM Symposium on Applied Computing, pp. 1107–1111 (2008)

    Google Scholar 

  19. Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community QA. In: The 33rd international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 411–418 (2010)

    Google Scholar 

  20. Liu, M., Liu, Y., Yang, Q.: Predicting best answerers for new questions in community question answering. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 127–138. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14246-8_15

    Chapter  Google Scholar 

  21. Bian, J., Agichtein, E., Liu, Y., Zha, H.: Finding the right facts in the crowd: factoid question answering over social media categories and subject descriptors. In: The 17th International Conference on World Wide Web, pp. 467–476 (2008)

    Google Scholar 

  22. Severyn, A., Nicosia, M., Moschitti, A.: Learning adaptable patterns for passage reranking. In: The Seventeenth Conference on Computational Natural Language Learning, pp. 75–83 (2013)

    Google Scholar 

  23. Severyn, A., Moschitti, A.: Automatic feature engineering for answer selection and extraction. In: Empirical Methods in Natural Language Processing, pp. 458–467 (2013)

    Google Scholar 

  24. Tymoshenko, K., Bonadiman, D., Moschitti, A.: Convolutional neural networks vs convolution kernels : feature engineering for answer sentence reranking. In: North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1268–1278 (2016)

    Google Scholar 

  25. Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: The 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 373–382 (2015)

    Google Scholar 

  26. Shen, Y., et al.: Knowledge-aware attentive neural network for ranking question answer pairs. In: International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 901–904 (2018)

    Google Scholar 

  27. Cohen, D., Croft, W.B.: A hybrid embedding approach to noisy answer passage retrieval. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 127–140. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76941-7_10

    Chapter  Google Scholar 

  28. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)

    Google Scholar 

  29. Wang, H., Xi, H.: Chinese synonyms toolkit (2017). https://github.com/huyingxi/Synonyms

  30. Garcia, E.: Latent Semantic Indexing (LSI) A Fast Track Tutorial. Grossman and Frieders Information Retrieval, Algorithms and Heuristics (2006)

    Google Scholar 

  31. Cao, Y., Huang, T., Tian, Y.: A ranking SVM based fusion model for cross-media meta-search engine. J. Zhejiang Univ. Sci. C 11(11), 903–910 (2010)

    Article  Google Scholar 

  32. Joachims, T.: Training linear SVMs in linear time. In: The 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 217–226 (2006)

    Google Scholar 

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (No.61772146), the OUHK 2018/19 S&T School Research Fund (R5077), and Natural Science Foundation of Guangdong Province (2018A030310051).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tianyong Hao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xie, W., Wong, LP., Lee, LK., Au, O., Hao, T. (2020). A Semantic Expansion-Based Joint Model for Answer Ranking in Chinese Question Answering Systems. In: Wang, F., et al. Information Retrieval Technology. AIRS 2019. Lecture Notes in Computer Science(), vol 12004. Springer, Cham. https://doi.org/10.1007/978-3-030-42835-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-42835-8_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-42834-1

  • Online ISBN: 978-3-030-42835-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics