A Semantic Expansion-Based Joint Model for Answer Ranking in Chinese Question Answering Systems

Xie, Wenxiu; Wong, Leung-Pun; Lee, Lap-Kei; Au, Oliver; Hao, Tianyong

doi:10.1007/978-3-030-42835-8_3

A Semantic Expansion-Based Joint Model for Answer Ranking in Chinese Question Answering Systems

Wenxiu Xie¹⁷,
Leung-Pun Wong¹⁸,
Lap-Kei Lee¹⁸,
Oliver Au¹⁸ &
…
Tianyong Hao¹⁹

Conference paper
First Online: 27 February 2020

431 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12004))

Abstract

Answer ranking is one of essential steps in open domain question answering systems. The ranking of the retrieved answers directly affects user satisfaction. This paper proposes a new joint model for answer ranking by leveraging context semantic features, which balances both question-answer similarities and answer ranking scores. A publicly available dataset containing 40,000 Chinese questions and 369,919 corresponding answer passages from Sogou Lab is used for experiments. Evaluation on the joint model shows a Precison@1 of 72.6%, which outperforms the state-of-the-art baseline methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Zhao, Z., Lu, H., Zheng, V.W., Cai, D., He, X., Zhuang, Y.: Community-based question answering via asymmetric multi-faceted ranking network learning. In: The 31th Conference on Artificial Intelligence, pp. 3532–3538 (2017)
Google Scholar
Freihat, A.A., Qwaider, M.R.H., Giunchiglia, F.: Using grice maxims in ranking community question answers. In: International Conference on Information, Process, and Knowledge Management. pp. 38–43 (2018)
Google Scholar
Zhou, Z.-M., Lan, M., Niu, Z.-Y., Lu, Y.: Exploiting user profile information for answer ranking in cQA. In: The 21st International Conference Companion on World Wide Web, pp. 767–774 (2012)
Google Scholar
Li, Z., Huang, J., Zhou, Z., Zhang, H., Chang, S., Huang, Z.: LSTM-based deep learning models for answer ranking. In: 2016 IEEE First International Conference on Data Science in Cyberspace. pp. 90–97 (2016)
Google Scholar
Agarwal, A., et al.: Learning to rank for robust question answering. In: The 21st ACM International Conference on Information and Knowledge Management, pp. 833–842 (2012)
Google Scholar
Yulianti, E., Chen, R.-C., Scholer, F., Croft, W.B., Sanderson, M.: Ranking documents by answer-passage quality. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 335–344 (2018)
Google Scholar
Ko, J., Nyberg, E., Si, L.: A Probabilistic graphical model for joint answer ranking in question answering. In: The 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 343–350. ACM (2007)
Google Scholar
Severyn, A., Nicosia, M., Moschitti, A.: Building structures from classifiers for passage reranking. In: The 22nd ACM International Conference on Information & Knowledge Management, pp. 969–978. ACM (2013)
Google Scholar
Moschitti, A., Quarteroni, S., Basili, R., Manandhar, S.: Exploiting syntactic and shallow semantic kernels for question answer classification. In: The 45th Annual Meeting of the Association of Computational Linguistics, pp. 776–783 (2007)
Google Scholar
Ko, J., Mitamura, T., Nyberg, E.: Language-independent probabilistic answer ranking for question answering. In: The 45th Annual Meeting of the Association of Computational Linguistics, pp. 784–791 (2007)
Google Scholar
Ferrucci, D., et al.: Building watson: an overview of the DeepQA project. AI Mag. 31(3), 59–79 (2010)
Article Google Scholar
Bhowan, U., McCloskey, D.J.: Genetic programming for feature selection and question-answer ranking in IBM watson. In: Machado, P., et al. (eds.) EuroGP 2015. LNCS, vol. 9025, pp. 153–166. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16501-1_13
Chapter Google Scholar
Bilotti, M.W., Elsas, J., Carbonell, J., Nyberg, E.: rank learning for factoid question answering with linguistic and semantic constraints. In: The 19th ACM International Conference on Information and Knowledge Managementm pp. 459–468 (2010)
Google Scholar
Yang, L., et al.: Beyond factoid QA: effective methods for non-factoid answer sentence retrieval. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 115–128. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_9
Chapter Google Scholar
Metzler, D., Kanungo, T.: Machine learned sentence selection strategies for query-biased summarization. In: Sigir Learning to Rank Workshopm, pp. 40–47 (2008)
Google Scholar
Jeon, J., Croft, W.B., Lee, J.H., Park, S.: A framework to predict the quality of answers with non-textual features. In: The 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 228–235 (2006)
Google Scholar
Liu, Y., Bian, J., Agichtein, E.: Predicting information seeker satisfaction in community question answering. In: The 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 483–490 (2008)
Google Scholar
Blooma, M.J., Goh, D.H.: A predictive framework for retrieving the best answer. In: The 2008 ACM Symposium on Applied Computing, pp. 1107–1111 (2008)
Google Scholar
Shah, C., Pomerantz, J.: Evaluating and predicting answer quality in community QA. In: The 33rd international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 411–418 (2010)
Google Scholar
Liu, M., Liu, Y., Yang, Q.: Predicting best answerers for new questions in community question answering. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 127–138. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14246-8_15
Chapter Google Scholar
Bian, J., Agichtein, E., Liu, Y., Zha, H.: Finding the right facts in the crowd: factoid question answering over social media categories and subject descriptors. In: The 17th International Conference on World Wide Web, pp. 467–476 (2008)
Google Scholar
Severyn, A., Nicosia, M., Moschitti, A.: Learning adaptable patterns for passage reranking. In: The Seventeenth Conference on Computational Natural Language Learning, pp. 75–83 (2013)
Google Scholar
Severyn, A., Moschitti, A.: Automatic feature engineering for answer selection and extraction. In: Empirical Methods in Natural Language Processing, pp. 458–467 (2013)
Google Scholar
Tymoshenko, K., Bonadiman, D., Moschitti, A.: Convolutional neural networks vs convolution kernels : feature engineering for answer sentence reranking. In: North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1268–1278 (2016)
Google Scholar
Severyn, A., Moschitti, A.: Learning to rank short text pairs with convolutional deep neural networks. In: The 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 373–382 (2015)
Google Scholar
Shen, Y., et al.: Knowledge-aware attentive neural network for ranking question answer pairs. In: International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 901–904 (2018)
Google Scholar
Cohen, D., Croft, W.B.: A hybrid embedding approach to noisy answer passage retrieval. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 127–140. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76941-7_10
Chapter Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Wang, H., Xi, H.: Chinese synonyms toolkit (2017). https://github.com/huyingxi/Synonyms
Garcia, E.: Latent Semantic Indexing (LSI) A Fast Track Tutorial. Grossman and Frieders Information Retrieval, Algorithms and Heuristics (2006)
Google Scholar
Cao, Y., Huang, T., Tian, Y.: A ranking SVM based fusion model for cross-media meta-search engine. J. Zhejiang Univ. Sci. C 11(11), 903–910 (2010)
Article Google Scholar
Joachims, T.: Training linear SVMs in linear time. In: The 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 217–226 (2006)
Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (No.61772146), the OUHK 2018/19 S&T School Research Fund (R5077), and Natural Science Foundation of Guangdong Province (2018A030310051).

Author information

Authors and Affiliations

Department of Linguistics and Translation, City University of Hong Kong, Hong Kong, China
Wenxiu Xie
School of Science and Technology, The Open University of Hong Kong, Hong Kong, China
Leung-Pun Wong, Lap-Kei Lee & Oliver Au
School of Computer Science, South China Normal University, Guangzhou, China
Tianyong Hao

Authors

Wenxiu Xie
View author publications
You can also search for this author in PubMed Google Scholar
Leung-Pun Wong
View author publications
You can also search for this author in PubMed Google Scholar
Lap-Kei Lee
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Au
View author publications
You can also search for this author in PubMed Google Scholar
Tianyong Hao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tianyong Hao .

Editor information

Editors and Affiliations

Open University of Hong Kong, Hong Kong, China
Fu Lee Wang
The Education University of Hong Kong, Hong Kong, China
Haoran Xie
Chinese University of Hong Kong, Hong Kong, China
Wai Lam
Nanyang Technological University, Singapore, Singapore
Aixin Sun
Institute of Information Science, Academia Sinica, Taipei, Taiwan
Lun-Wei Ku
South China Normal University, Guangzhou, China
Tianyong Hao
Chinese Academy of Agricultural Sciences, Beijing, China
Wei Chen
Douglas College, New Westminster, BC, Canada
Tak-Lam Wong
University of Southern Queensland, Toowoomba, QLD, Australia
Xiaohui Tao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, W., Wong, LP., Lee, LK., Au, O., Hao, T. (2020). A Semantic Expansion-Based Joint Model for Answer Ranking in Chinese Question Answering Systems. In: Wang, F., et al. Information Retrieval Technology. AIRS 2019. Lecture Notes in Computer Science(), vol 12004. Springer, Cham. https://doi.org/10.1007/978-3-030-42835-8_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-42835-8_3
Published: 27 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42834-1
Online ISBN: 978-3-030-42835-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics