skip to main content
10.1145/2983323.2983893acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Learning to Re-Rank Questions in Community Question Answering Using Advanced Features

Published:24 October 2016Publication History

ABSTRACT

We study the impact of different types of features for question ranking in community Question Answering: bag-of-words models (BoW), syntactic tree kernels (TKs) and rank features. It should be noted that structural kernels have never been applied to the question reranking task, i.e., question to question similarity, where they have to model paraphrase relations. Additionally, the informal text, typically present in forums, poses new challenges to the use of TKs. We compare our learning to rank (L2R) algorithms against a strong baseline given by the Google rank (GR). The results show that (i) our shallow structures used in TKs are robust enough to noisy data and (ii) improving GR requires effective BoW features and TKs along with an accurate model of GR features in the used L2R algorithm.

References

  1. L. Allison and T. Dix. A bit-string longest-common-subsequence algorithm. Inf. Process. Lett., 23(6):305--310, Dec. 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Barrón-Cedeno, G. Da San Martino, S. Joty, A. Moschitti, F. Al-Obaidli, S. Romeo, K. Tymoshenko, and A. Uva. ConvKN at SemEval-2016 Task 3: Answer and question selection for question answering on arabic and english fora. In Proceedings of SemEval '16, pages 896--903, San Diego, California, June 2016. ACL.Google ScholarGoogle Scholar
  3. beginflushleftX. Cao, G. Cong, B. Cui, C. S. Jensen, and C. Zhang. The use of categorization information in language models for question retrieval. In CIKM, pages 265--274, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu. Searching questions by identifying question topic and question focus. In ACL, pages 156--164, 2008.Google ScholarGoogle Scholar
  5. S. Filice, D. Croce, A. Moschitti, and R. Basili. Kelp at semeval-2016 task 3: Learning semantic relations between questions and answers. In Proceedings of SemEval '16, pages 1116--1123, San Diego, California, June 2016. ACL.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Franco-Salvador, S. Kar, T. Solorio, and P. Rosso. UH-PRHLT at SemEval-2016 Task 3: Combining lexical and semantic-based features for community question answering. In Proceedings of SemEval '16, pages 814--821, San Diego, California, June 2016. ACL.Google ScholarGoogle ScholarCross RefCross Ref
  7. T. Joachims. Optimizing search engines using clickthrough data. KDD, pages 133--142, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Lyon, J. Malcolm, and B. Dickerson. Detecting short passages of similar text in large document collections. EMNLP, pages 118--125, 2001.Google ScholarGoogle Scholar
  9. A. Moschitti. Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. In ECML, pages 318--329. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Nakov, L. Màrquez, A. Moschitti, W. Magdy, H. Mubarak, A. A. Freihat, J. Glass, and B. Randeree. SemEval-2016 task 3: Community question answering. In Proceedings of SemEval '16. ACL, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  11. A. Severyn and A. Moschitti. Structural relationships for large-scale learning of answer re-ranking. SIGIR, pages 741--750, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Tymoshenko and A. Moschitti. Assessing the impact of syntactic and semantic structures for answer passages reranking. In Proceedings of CIKM '15, pages 1451--1460, New York, NY, USA, 2015. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Wang, Z. Ming, and T.-S. Chua. A syntactic tree matching approach to finding similar questions in community-based qa services. In SIGIR, pages 187--194, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Wise. Yap3: Improved detection of similarities in computer program and other texts. In SIGCSE, pages 130--134, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. Zhou, L. Cai, J. Zhao, and K. Liu. Phrase-based translation model for question retrieval in community question answer archives. In ACL, pages 653--662, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning to Re-Rank Questions in Community Question Answering Using Advanced Features

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
          October 2016
          2566 pages
          ISBN:9781450340731
          DOI:10.1145/2983323

          Copyright © 2016 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 October 2016

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

          Acceptance Rates

          CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader