research-article

A generalized framework of exploring category information for question retrieval in community question answer archives

Authors:
Xin Cao

Aalborg University, Aalborg, Denmark

Aalborg University, Aalborg, Denmark
View Profile

,
Gao Cong

Aalborg University, Aalborg, Denmark

Aalborg University, Aalborg, Denmark
View Profile

,
Bin Cui

Peking University, Beijing, China

Peking University, Beijing, China
View Profile

,
Christian S. Jensen

Aalborg University, Aalborg, Denmark

Aalborg University, Aalborg, Denmark
View Profile

WWW '10: Proceedings of the 19th international conference on World wide webApril 2010Pages 201–210https://doi.org/10.1145/1772690.1772712

Published:26 April 2010Publication History

WWW '10: Proceedings of the 19th international conference on World wide web

Pages 201–210

ABSTRACT

Community Question Answering (CQA) has emerged as a popular type of service where users ask and answer questions and access historical question-answer pairs. CQA archives contain very large volumes of questions organized into a hierarchy of categories. As an essential function of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. In this paper, we present a new approach to exploiting category information of questions for improving the performance of question retrieval, and we apply the approach to existing question retrieval models, including a state-of-the-art question retrieval model. Experiments conducted on real CQA data demonstrate that the proposed techniques are capable of outperforming a variety of baseline methods significantly.

References

E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM, pp. 183--194, 2008. Google ScholarDigital Library
A. Berger, R. Caruana, D. Cohn, D. Freitag, and V. Mittal. Bridging the lexical chasm: statistical approaches to answer-finding. In SIGIR, pp. 192--199, 2000. Google ScholarDigital Library
J. Bian, Y. Liu, E. Agichtein, and H. Zha. Finding the right facts in the crowd: factoid question answering over social media. In WWW, pp. 467--476, 2008. Google ScholarDigital Library
R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. AI Magazine, 18(2):57--66, 1997.Google ScholarDigital Library
X. Cao, G. Cong, B. Cui, C. S. Jensen, and C. Zhang. The use of categorization information in language models for question retrieval. In CIKM, pp. 265--274, 2009. Google ScholarDigital Library
C. Chekuri, M. H. Goldwasser, P. Raghavan, and E. Upfal. Web search using automatic classification. In WWW, 1997.Google Scholar
H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu. Searching questions by identifying question topic and question focus. In ACL-HLT, pp. 156--164, 2008.Google Scholar
H. Fang, T. Tao, and C. Zhai. A formal study of information retrieval heuristics. In SIGIR, pp. 49--56, 2004. Google ScholarDigital Library
J. Jeon, W. B. Croft, and J. H. Lee. Finding semantically similar questions based on their answers. In SIGIR, pp. 617--618, 2005. Google ScholarDigital Library
J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In CIKM, pp. 84--90,2005. Google ScholarDigital Library
V. Jijkoun and M. de Rijke. Retrieving answers from frequently asked questions pages on the web. In CIKM, pp. 76--83, 2005. Google ScholarDigital Library
W. Lam, M. Ruiz, and P. Srinivasan. Automatic text categorization and its application to text retrieval. IEEE TKDE, 11(6):865--879, 1999. Google ScholarDigital Library
Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In SIGIR, pp. 483--490, 2008. Google ScholarDigital Library
S. Riezler, A. Vasserman, I. Tsochantaridis, V. O. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL, pp. 464--471, 2007.Google Scholar
S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. In TREC, pp. 109--126, 1994.Google Scholar
A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In SIGIR, pp. 21--29, 1996. Google ScholarDigital Library
R. Soricut and E. Brill. Automatic question answering: Beyond the factoid. In HLT-NAACL, pp. 57--64, 2004.Google Scholar
K. Wang, Z. Ming, and T.-S. Chua. A syntactic tree matching approach to finding similar questions in community-based qa services. In SIGIR, pp. 187--194, 2009. Google ScholarDigital Library
X. Xue, J. Jeon, and W. B. Croft. Retrieval models for question and answer archives. In SIGIR, pp. 475--482, 2008. Google ScholarDigital Library
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM TOIS, 22(2):179--214, 2004. Google ScholarDigital Library
J. Zobel and A. Moffat. Inverted files for text search engines. ACM Computing Surveys, 38(2), 56 pages, 2006. Google ScholarDigital Library

Index Terms

A generalized framework of exploring category information for question retrieval in community question answer archives
1. Information systems
  1. Information retrieval

Recommendations

Question Retrieval with High Quality Answers in Community Question Answering
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management

This paper studies the problem of question retrieval in community question answering (CQA). To bridge lexical gaps in questions, which is regarded as the biggest challenge in retrieval, state-of-the-art methods learn translation models using answers ...
Read More
The use of categorization information in language models for question retrieval
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management

Community Question Answering (CQA) has emerged as a popular type of service meeting a wide range of information needs. Such services enable users to ask and answer questions and to access existing question-answer pairs. CQA archives contain very large ...
Read More
Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

Community Question Answering (CQA) is a popular type of service where users ask questions and where answers are obtained from other users or from historical question-answer pairs. CQA archives contain large volumes of questions organized into a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '10: Proceedings of the 19th international conference on World wide web
April 2010
1407 pages
ISBN:9781605587998
DOI:10.1145/1772690
General Chairs:
Michael Rappa
North Carolina State University, USA
,
Paul Jones
University of North Carolina at Chapel Hill, USA
,
Program Chairs:
Juliana Freire
University of Utah, USA
,
Soumen Chakrabarti
Indian Institute of Technology, India
Copyright © 2010 International World Wide Web Conference Committee (IW3C2)
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 April 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
categorization
question retrieval
question-answering services
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 95
  Total Citations
  View Citations
- 1,079
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

ePub

View this article in ePub.

View ePub

A generalized framework of exploring category information for question retrieval in community question answer archives

WWW '10: Proceedings of the 19th international conference on World wide web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Question Retrieval with High Quality Answers in Community Question Answering

The use of categorization information in language models for question retrieval

Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives