ABSTRACT
Community Question Answering (CQA) has emerged as a popular type of service where users ask and answer questions and access historical question-answer pairs. CQA archives contain very large volumes of questions organized into a hierarchy of categories. As an essential function of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. In this paper, we present a new approach to exploiting category information of questions for improving the performance of question retrieval, and we apply the approach to existing question retrieval models, including a state-of-the-art question retrieval model. Experiments conducted on real CQA data demonstrate that the proposed techniques are capable of outperforming a variety of baseline methods significantly.
- E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM, pp. 183--194, 2008. Google ScholarDigital Library
- A. Berger, R. Caruana, D. Cohn, D. Freitag, and V. Mittal. Bridging the lexical chasm: statistical approaches to answer-finding. In SIGIR, pp. 192--199, 2000. Google ScholarDigital Library
- J. Bian, Y. Liu, E. Agichtein, and H. Zha. Finding the right facts in the crowd: factoid question answering over social media. In WWW, pp. 467--476, 2008. Google ScholarDigital Library
- R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. AI Magazine, 18(2):57--66, 1997.Google ScholarDigital Library
- X. Cao, G. Cong, B. Cui, C. S. Jensen, and C. Zhang. The use of categorization information in language models for question retrieval. In CIKM, pp. 265--274, 2009. Google ScholarDigital Library
- C. Chekuri, M. H. Goldwasser, P. Raghavan, and E. Upfal. Web search using automatic classification. In WWW, 1997.Google Scholar
- H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu. Searching questions by identifying question topic and question focus. In ACL-HLT, pp. 156--164, 2008.Google Scholar
- H. Fang, T. Tao, and C. Zhai. A formal study of information retrieval heuristics. In SIGIR, pp. 49--56, 2004. Google ScholarDigital Library
- J. Jeon, W. B. Croft, and J. H. Lee. Finding semantically similar questions based on their answers. In SIGIR, pp. 617--618, 2005. Google ScholarDigital Library
- J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In CIKM, pp. 84--90,2005. Google ScholarDigital Library
- V. Jijkoun and M. de Rijke. Retrieving answers from frequently asked questions pages on the web. In CIKM, pp. 76--83, 2005. Google ScholarDigital Library
- W. Lam, M. Ruiz, and P. Srinivasan. Automatic text categorization and its application to text retrieval. IEEE TKDE, 11(6):865--879, 1999. Google ScholarDigital Library
- Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In SIGIR, pp. 483--490, 2008. Google ScholarDigital Library
- S. Riezler, A. Vasserman, I. Tsochantaridis, V. O. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL, pp. 464--471, 2007.Google Scholar
- S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. In TREC, pp. 109--126, 1994.Google Scholar
- A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In SIGIR, pp. 21--29, 1996. Google ScholarDigital Library
- R. Soricut and E. Brill. Automatic question answering: Beyond the factoid. In HLT-NAACL, pp. 57--64, 2004.Google Scholar
- K. Wang, Z. Ming, and T.-S. Chua. A syntactic tree matching approach to finding similar questions in community-based qa services. In SIGIR, pp. 187--194, 2009. Google ScholarDigital Library
- X. Xue, J. Jeon, and W. B. Croft. Retrieval models for question and answer archives. In SIGIR, pp. 475--482, 2008. Google ScholarDigital Library
- C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM TOIS, 22(2):179--214, 2004. Google ScholarDigital Library
- J. Zobel and A. Moffat. Inverted files for text search engines. ACM Computing Surveys, 38(2), 56 pages, 2006. Google ScholarDigital Library
Index Terms
- A generalized framework of exploring category information for question retrieval in community question answer archives
Recommendations
Question Retrieval with High Quality Answers in Community Question Answering
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementThis paper studies the problem of question retrieval in community question answering (CQA). To bridge lexical gaps in questions, which is regarded as the biggest challenge in retrieval, state-of-the-art methods learn translation models using answers ...
The use of categorization information in language models for question retrieval
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementCommunity Question Answering (CQA) has emerged as a popular type of service meeting a wide range of information needs. Such services enable users to ask and answer questions and to access existing question-answer pairs. CQA archives contain very large ...
Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives
Community Question Answering (CQA) is a popular type of service where users ask questions and where answers are obtained from other users or from historical question-answer pairs. CQA archives contain large volumes of questions organized into a ...
Comments