skip to main content
10.1145/1772690.1772712acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

A generalized framework of exploring category information for question retrieval in community question answer archives

Published:26 April 2010Publication History

ABSTRACT

Community Question Answering (CQA) has emerged as a popular type of service where users ask and answer questions and access historical question-answer pairs. CQA archives contain very large volumes of questions organized into a hierarchy of categories. As an essential function of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. In this paper, we present a new approach to exploiting category information of questions for improving the performance of question retrieval, and we apply the approach to existing question retrieval models, including a state-of-the-art question retrieval model. Experiments conducted on real CQA data demonstrate that the proposed techniques are capable of outperforming a variety of baseline methods significantly.

References

  1. E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM, pp. 183--194, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Berger, R. Caruana, D. Cohn, D. Freitag, and V. Mittal. Bridging the lexical chasm: statistical approaches to answer-finding. In SIGIR, pp. 192--199, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Bian, Y. Liu, E. Agichtein, and H. Zha. Finding the right facts in the crowd: factoid question answering over social media. In WWW, pp. 467--476, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. AI Magazine, 18(2):57--66, 1997.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. X. Cao, G. Cong, B. Cui, C. S. Jensen, and C. Zhang. The use of categorization information in language models for question retrieval. In CIKM, pp. 265--274, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Chekuri, M. H. Goldwasser, P. Raghavan, and E. Upfal. Web search using automatic classification. In WWW, 1997.Google ScholarGoogle Scholar
  7. H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu. Searching questions by identifying question topic and question focus. In ACL-HLT, pp. 156--164, 2008.Google ScholarGoogle Scholar
  8. H. Fang, T. Tao, and C. Zhai. A formal study of information retrieval heuristics. In SIGIR, pp. 49--56, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Jeon, W. B. Croft, and J. H. Lee. Finding semantically similar questions based on their answers. In SIGIR, pp. 617--618, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In CIKM, pp. 84--90,2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. V. Jijkoun and M. de Rijke. Retrieving answers from frequently asked questions pages on the web. In CIKM, pp. 76--83, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. W. Lam, M. Ruiz, and P. Srinivasan. Automatic text categorization and its application to text retrieval. IEEE TKDE, 11(6):865--879, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. Liu, J. Bian, and E. Agichtein. Predicting information seeker satisfaction in community question answering. In SIGIR, pp. 483--490, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Riezler, A. Vasserman, I. Tsochantaridis, V. O. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL, pp. 464--471, 2007.Google ScholarGoogle Scholar
  15. S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at trec-3. In TREC, pp. 109--126, 1994.Google ScholarGoogle Scholar
  16. A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In SIGIR, pp. 21--29, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Soricut and E. Brill. Automatic question answering: Beyond the factoid. In HLT-NAACL, pp. 57--64, 2004.Google ScholarGoogle Scholar
  18. K. Wang, Z. Ming, and T.-S. Chua. A syntactic tree matching approach to finding similar questions in community-based qa services. In SIGIR, pp. 187--194, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. X. Xue, J. Jeon, and W. B. Croft. Retrieval models for question and answer archives. In SIGIR, pp. 475--482, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM TOIS, 22(2):179--214, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Zobel and A. Moffat. Inverted files for text search engines. ACM Computing Surveys, 38(2), 56 pages, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A generalized framework of exploring category information for question retrieval in community question answer archives

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '10: Proceedings of the 19th international conference on World wide web
      April 2010
      1407 pages
      ISBN:9781605587998
      DOI:10.1145/1772690

      Copyright © 2010 International World Wide Web Conference Committee (IW3C2)

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 April 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    ePub

    View this article in ePub.

    View ePub