Abstract
When the neural network model is applied to solve the question retrieval task of community question and answer, it needs a large corpus and long retrieval time. To address these problems, this paper proposes a two-stage question retrieval algorithm. In the second stage, the multi-feature fusion method is adopted to comprehensively judge the retrieved results according to the similarity of the query sentence to the candidate question sentence in lexical features and semantic features, as well as the answer quality features in the candidate answers. Experimental results ranked second with 78.3 on SemEval-2016 Task3 test set and ranked first with 48.20 on SemEval-2017 Task3 test set and and only took 500 ms to get the results from 1000 pieces of data. These results show that this algorithm can significantly improve the question retrieval effect while ensuring the retrieval efficiency.



Similar content being viewed by others
Data availability
The data used to support the findings of this study are available from the corresponding author upon request.
References
AlZu’bi S, Alsmadiv A, AlQatawneh S, et al. (2019) A brief analysis of amazon online reviews[C]//2019 sixth international conference on social networks analysis, management and security (SNAMS). IEEE, 555–560
Alzubi S, Hawashin B, Mughaid A, et al. (2020) Whats trending? An efficient trending research topics extractor and recommender[C]//2020 11th international conference on information and communication systems (ICICS). IEEE, 191–196
Charlet D, Damnati G (2017) Simbow at semeval-2017 task 3: Soft-cosine semantic similarity between questions for community question answering[C]//Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), 315–319
Chen Q, Zhu X, Ling ZH, et al. (2017) Enhanced LSTM for Natural Language Inference[A]. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)[C], 1657–1668
Employing Deep Learning Methods for Predicting Helpful Reviews (n.d.)
Franco-Salvador M, Kar S, Solorio T, et al. (2018) Uh-prhlt at semeval-2016 task 3: Combining lexical and semantic-based features for community question answering[J]. arXiv preprint arXiv:1807.11584
Goyal N (2017) Learningtoquestion at semeval 2017 task 3: Ranking similar questions by learning to rank using rich features[C]//Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), 310–314
Guo J, Fan Y, Ai Q, et al. (2016) A deep relevance matching model for ad-hoc retrieval[C]//Proceedings of the 25th ACM international on conference on information and knowledge management, 55–64
Guo T, Lin T, Lu Y (2018) An interpretable LSTM neural network for autoregressive exogenous model[J]. arXiv preprint arXiv:1804.05251
Guo J, Fan Y, Ji X, et al. (2019) Matchzoo: A learning, practicing, and developing system for neural text matching[C]//Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 1297–1300
Joachims T (2002) Optimizing search engines using clickthrough data[C]//Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 133–142
Joachims T (2006) Training linear SVMs in linear time[C]//Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, 217–226
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980
Li H, Xu J (2014) Semantic matching in search[J]. Found Trends Inf Retr 7(5):343–469
Mitra B, Diaz F, Craswell N (2017) Learning to match using local and distributed representations of text for web search[C]//Proceedings of the 26th International Conference on World Wide Web, 1291–1299
Nakov P, Màrquez L, Moschitti A, et al. (2016) SemEval-2016 Task 3: Community Question Answering[A]. Proceedings of the 10th International Workshop on Semantic Evaluation[C], 525–545
Nakov P, Hoogeveen D, Màrquez L, et al. (2017) SemEval-2017 Task 3: Community Question Answering[A]. Proceedings of the 11th International Workshop on Semantic Evaluation, 27–48
Palangi H, Deng L, Shen Y, et al. (2014) Semantic modelling with long-short-term memory for information retrieval[J]. arXiv preprint arXiv:1412.6629
Peters M E, Neumann M, Iyyer M, et al. (2018) Deep contextualized word representations[J]. arXiv preprint arXiv:1802.05365
Robertson S, Zaragoza H (2009) The probabilistic relevance framework: BM25 and beyond[M]. Now Publishers Inc
Shen Y, He X, Gao J, et al. (2014) Learning semantic representations using convolutional neural networks for web search[C]//Proceedings of the 23rd international conference on world wide web, 373–374
Wang Z, Hamza W, Florian R et al. (2017) Bilateral multi-perspective matching for natural language sentences[a]. Twenty-Sixth International Joint Conference on Artificial Intelligence 4144–4150
Wu Q, Burges CJC, Svore KM, Gao J (2010) Adapting boosting for information retrieval measures[J]. Inf Retr 13(3):254–270
Xiong C, Dai Z, Callan J, et al. (2017) End-to-end neural ad-hoc ranking with kernel pooling[C]//Proceedings of the 40th International ACM SIGIR conference on research and development in information retrieval, 55–64
Yu BG, Xu QT (2018) Research on answer selection algorithm based on collaboration attention mechanism[A]. Chinese Academy of Management, Fudan Premium Fund of Management. Proceedings of the 13th (2018) China management conference[C]. Chinese Academy of Management, Fudan Premium Fund of Management: Chinese Academy of Management, 7
Zhang K, Wu W, Wu H, et al. (2014) Question retrieval with high quality answers in community question answering[C]//Proceedings of the 23rd ACM international conference on conference on information and knowledge management, 371–380
Zhou G, Cai L, Zhao J, et al. (2011) Phrase-based translation model for question retrieval in community question answer archives[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 653–662
Funding
Supported by the National Natural Science Funds (No.62041305 and No.62072053) and Xizang Natural Science Foundation (No.XZ202001ZR0065G).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Disclosures
The authors declare no conflicts of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sun, Y., Song, J., Song, X. et al. Research on question retrieval method for community question answering. Multimed Tools Appl 82, 24309–24325 (2023). https://doi.org/10.1007/s11042-023-14458-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14458-2