short-paper

Learning to Re-Rank Questions in Community Question Answering Using Advanced Features

Authors:
Giovanni Da San Martino

Hamad bin Khalifa University, Doha, Qatar

Hamad bin Khalifa University, Doha, Qatar
View Profile

,
Alberto Barrón Cedeño

Hamad bin Khalifa University, Doha, Qatar

Hamad bin Khalifa University, Doha, Qatar
View Profile

,
Salvatore Romeo

Hamad bin Khalifa University, Doha, Qatar

Hamad bin Khalifa University, Doha, Qatar
View Profile

,
Antonio Uva

University of Trento, Trento, Italy

University of Trento, Trento, Italy
View Profile

,
Alessandro Moschitti

Hamad bin Khalifa University, Doha, Qatar

Hamad bin Khalifa University, Doha, Qatar
View Profile

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge ManagementOctober 2016Pages 1997–2000https://doi.org/10.1145/2983323.2983893

Published:24 October 2016Publication History

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

Pages 1997–2000

ABSTRACT

We study the impact of different types of features for question ranking in community Question Answering: bag-of-words models (BoW), syntactic tree kernels (TKs) and rank features. It should be noted that structural kernels have never been applied to the question reranking task, i.e., question to question similarity, where they have to model paraphrase relations. Additionally, the informal text, typically present in forums, poses new challenges to the use of TKs. We compare our learning to rank (L2R) algorithms against a strong baseline given by the Google rank (GR). The results show that (i) our shallow structures used in TKs are robust enough to noisy data and (ii) improving GR requires effective BoW features and TKs along with an accurate model of GR features in the used L2R algorithm.

References

L. Allison and T. Dix. A bit-string longest-common-subsequence algorithm. Inf. Process. Lett., 23(6):305--310, Dec. 1986. Google ScholarDigital Library
A. Barrón-Cedeno, G. Da San Martino, S. Joty, A. Moschitti, F. Al-Obaidli, S. Romeo, K. Tymoshenko, and A. Uva. ConvKN at SemEval-2016 Task 3: Answer and question selection for question answering on arabic and english fora. In Proceedings of SemEval '16, pages 896--903, San Diego, California, June 2016. ACL.Google Scholar
beginflushleftX. Cao, G. Cong, B. Cui, C. S. Jensen, and C. Zhang. The use of categorization information in language models for question retrieval. In CIKM, pages 265--274, 2009. Google ScholarDigital Library
H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu. Searching questions by identifying question topic and question focus. In ACL, pages 156--164, 2008.Google Scholar
S. Filice, D. Croce, A. Moschitti, and R. Basili. Kelp at semeval-2016 task 3: Learning semantic relations between questions and answers. In Proceedings of SemEval '16, pages 1116--1123, San Diego, California, June 2016. ACL.Google ScholarCross Ref
M. Franco-Salvador, S. Kar, T. Solorio, and P. Rosso. UH-PRHLT at SemEval-2016 Task 3: Combining lexical and semantic-based features for community question answering. In Proceedings of SemEval '16, pages 814--821, San Diego, California, June 2016. ACL.Google ScholarCross Ref
T. Joachims. Optimizing search engines using clickthrough data. KDD, pages 133--142, 2002. Google ScholarDigital Library
C. Lyon, J. Malcolm, and B. Dickerson. Detecting short passages of similar text in large document collections. EMNLP, pages 118--125, 2001.Google Scholar
A. Moschitti. Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. In ECML, pages 318--329. 2006. Google ScholarDigital Library
P. Nakov, L. Màrquez, A. Moschitti, W. Magdy, H. Mubarak, A. A. Freihat, J. Glass, and B. Randeree. SemEval-2016 task 3: Community question answering. In Proceedings of SemEval '16. ACL, 2016.Google ScholarCross Ref
A. Severyn and A. Moschitti. Structural relationships for large-scale learning of answer re-ranking. SIGIR, pages 741--750, 2012. Google ScholarDigital Library
K. Tymoshenko and A. Moschitti. Assessing the impact of syntactic and semantic structures for answer passages reranking. In Proceedings of CIKM '15, pages 1451--1460, New York, NY, USA, 2015. ACM. Google ScholarDigital Library
K. Wang, Z. Ming, and T.-S. Chua. A syntactic tree matching approach to finding similar questions in community-based qa services. In SIGIR, pages 187--194, 2009. Google ScholarDigital Library
M. Wise. Yap3: Improved detection of similarities in computer program and other texts. In SIGCSE, pages 130--134, 1996. Google ScholarDigital Library
G. Zhou, L. Cai, J. Zhao, and K. Liu. Phrase-based translation model for question retrieval in community question answer archives. In ACL, pages 653--662, 2011. Google ScholarDigital Library

Index Terms

Learning to Re-Rank Questions in Community Question Answering Using Advanced Features
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Learning to rank
    2. Machine learning approaches
      1. Kernel methods
        Support vector machines
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Question answering

Recommendations

Improving search relevance for short queries in community question answering
WSDM '14: Proceedings of the 7th ACM international conference on Web search and data mining

Relevant question retrieval and ranking is a typical task in community question answering (CQA). Existing methods mainly focus on long and syntactically structured queries. However, when an input query is short, the task becomes challenging, due to a ...
Read More
A community question-answering refinement system
HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia

Community Question Answering (CQA) websites, which archive millions of questions and answers created by CQA users to provide a rich resource of information that is missing at web search engines and QA websites, have become increasingly popular. Web ...
Read More
On Application of Learning to Rank for E-Commerce Search
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

E-Commerce (E-Com) search is an emerging important new application of information retrieval. Learning to Rank (LETOR) is a general effective strategy for optimizing search engines, and is thus also a key technology for E-Com search. While the use of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
October 2016
2566 pages
ISBN:9781450340731
DOI:10.1145/2983323
General Chairs:
Snehasis Mukhopadhyay
Indiana University Purdue University Indianapolis, USA
,
ChengXiang Zhai
University of Illinois at Urbana-Champaign, USA
,
Program Chairs:
Elisa Bertino
Purdue University
,
Fabio Crestani
University of Lugano
,
Javed Mostafa
University of North Carolina
,
Jie Tang
Tsinghua University
,
Luo Si
Alibaba Group Inc & Purdue University
,
Xiaofang Zhou
University of Queensland
,
Yi Chang
Yahoo Research
,
Yunyao Li
IBM Research - Almaden
,
Parikshit Sondhi
WalmartLabs
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 October 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
community question answering
learning to rank
syntactic structures
Qualifiers
- short-paper
Conference

Acceptance Rates
CIKM '16 Paper Acceptance Rate160of701submissions,23%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 267
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Learning to Re-Rank Questions in Community Question Answering Using Advanced Features

CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving search relevance for short queries in community question answering

A community question-answering refinement system

On Application of Learning to Rank for E-Commerce Search