research-article

Tapping on the potential of q&a community by recommending answer providers

Authors:

Yong YuAuthors Info & Claims

CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management

Pages 921 - 930

https://doi.org/10.1145/1458082.1458204

Published: 26 October 2008 Publication History

Abstract

The rapidly increasing popularity of community-based Question Answering (cQA) services, e.g. Yahoo! Answers, Baidu Zhidao, etc. have attracted great attention from both academia and industry. Besides the basic problems, like question searching and answer finding, it should be noted that the low participation rate of users in cQA service is the crucial problem which limits its development potential. In this paper, we focus on addressing this problem by recommending answer providers, in which a question is given as a query and a ranked list of users is returned according to the likelihood of answering the question. Based on the intuitive idea for recommendation, we try to introduce topic-level model to improve heuristic term-level methods, which are treated as the baselines. The proposed approach consists of two steps: (1) discovering latent topics in the content of questions and answers as well as latent interests of users to build user profiles; (2) recommending question answerers for new arrival questions based on latent topics and term-level model. Specifically, we develop a general generative model for questions and answers in cQA, which is then altered to obtain a novel computationally tractable Bayesian network model. Experiments are carried out on a real-world data crawled from Yahoo! Answers during Jun 12 2007 to Aug 04 2007, which consists of 118510 questions, 772962 answers and 150324 users. The experimental results reveal significant improvements over the baseline methods and validate the positive influence of topic-level information.

References

[1]

C. Fellbaum. WordNet: An Electronic Lexical Database. MIT Press, 1998.

[2]

Ricardo Bae-Yates and Berthier Ribeiro-Neto. Modern Information Retrieval. Addison Wesley. 1999.

Digital Library

[3]

Stephen Robertson, Hugo Zaragoza and Michael Taylor. Simple BM25 Extension to Multiple Weighted Fields. In Proc. of CIKM'04, pages 42--49, 2004.

Digital Library

[4]

Ricardo Baeza-Yates and Alessandro Tiberi. Extracting Semantic Relations from Query Logs. In Proc. of KDD'07, pages 76--85, 2007.

Digital Library

[5]

James Surowiecki. The Wisdom of Crowds: Why the Many Are Smarter Than the Few and How Collective Wisdom Shapes Business, Economies, Societies and Nations, Little and Brown, 2004.

Digital Library

[6]

Jiwoon Jeon, W. Bruce Croft, Joon Ho Lee and Soyeon Park. A Framework to Predict the Quality of Answers with Non-Textual Features. In Proc. of SIGIR'06, pages 228--235, 2006.

Digital Library

[7]

Jiwoon Jeon, W. Bruce Croft and Joon Ho Lee. Finding Similar Questions in Large Question and Answer Archives. In Proc. of CIKM'05, pages 84--90, 2005.

Digital Library

[8]

Jiwoon Jeon, W. Bruce Croft and Joon Ho Lee. Finding Semantically Similar Questions Based on Their Answers. In Proc. of SIGIR'05, pages 617--618, 2005.

Digital Library

[9]

Yupeng Fu, Rongjing Xiang, Yiqun Liu, Min Zhang and Shaoping Ma. A CDD-based Formal Model for Expert Finding. In Proc. of CIKM'07, pages 881--884, 2007.

Digital Library

[10]

Y. Cao, H. Duan, Chin-Yew Lin, Y. Yu and Hsiao-Wuen Hon. Recommending Questions Using the MDL-based Tree Cut Model. In Proc. of WWW'08, pages 81--90, 2008.

Digital Library

[11]

A. Berger, R. Caruana, D. Cohn, D. Freitag, and V. Mittal. Bridging the lexical chasm: statistical approaches to answer-finding. In Proc. of SIGIR'00, pages 192--199, 2000.

Digital Library

[12]

R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently asked question files: Experiences with the faq finder system. Technical report, 1997.

Digital Library

[13]

M. A. Pasca and S. M. Harabagiu. High performances question/answering. In Proc. of SIGIR'01, pages 366--374, 2001.

Digital Library

[14]

E. Sneiders. Automated question answering using question templates that cover the conceptual model of the database. In Proc. of NLDB'02, pages 235--239, 2002.

Digital Library

[15]

E. M. Voorhees. Overview of the TREC 2004 question answering track. In Proc. of the TREC'04.

[16]

Krisztian Balog, Leif Azzopardi, and Maarten de Rijke. Formal models for expert finding in enterprise corpora. In Proc. of SIGIR'06, pages 43--50, 2006.

Digital Library

[17]

C. S. Campbell, P. P. Maglio, A. Cozzi, and B. Dom. Expertise identification using email communications. In Proc. of CIKM'03, pages 528--531, 2003.

Digital Library

[18]

N. Craswell, D. Hawking, A. M. Vercoustre, and P. Wilkins. P@noptic expert: Searching for experts not just for documents. In Proc. of Ausweb'01.

[19]

Byron Dom, Iris Eiron, Alex Cozzi and Yi Zhang. Graph-based ranking algorithms for e-mail expertise analysis. In Proc. of SIGMOD workshop, pages 42--48, 2003.

Digital Library

[20]

Audris Mockus and James D. Herbsleb. Expertise browser: a quantitative approach to identifying expertise. In Proc. of ICSE'02, pages 503--512, 2002.

Digital Library

[21]

J. Zhang, L. A. Adamic, E. Bakshy and Mark S. Ackerman. Everyone knows something: Examining knowledge sharing on Yahoo Answers. In Proc. of WWW'08, pages 665--674, 2008.

Digital Library

[22]

S. Deerwester, S. Dumais, T. Landauer, G. Furnas, and R. Harshman. Indexing by latent semantic analysis. JASIS, 41(6):391---407, 1990.

[23]

T. Hofmann. Probabilistic latent semantic indexing. In Proc. of SIGIR'99.

Digital Library

[24]

D. M. Blei, A. Y. Ng, and M. I. Jordan (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research, 3 : 993--1022

Digital Library

[25]

X. Wu, L. Zhang, and Y. Yu. Exploring social annotations for the semantic web. In Proc. of WWW'06, pages 417--426, 2006.

Digital Library

[26]

M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In Proc. of UAI '04, pages 487--494, 2004.

Digital Library

[27]

T. Griffiths and M. Steyvers. Finding scientific topics. In National Academy of Sciences, 2004.

[28]

D. Zhou, J. Bian, S. Zheng, H. Zha, and C. L. Giles. Exploring social annotation for information retrieval. In Proc. of WWW'08, pages 715--724, 2008.

Digital Library

[29]

G. Casella and E. I. George. Explaining the Gibbs Sampler. The American Statistician, Aug, 1992, Vol, 46, No. 3.

[30]

X. Wang and A. McCallum. Topic over Time: A Non-Markov Continuous-Time Model of Topical Trends. In Proc. of SIGKDD'06, pages 424--433, 2006.

Digital Library

[31]

N. Agarwal, H. Liu, L. Tang and Philip S. Yu. Identifying the influential Bloggers in a community. In Proc. of WSDM'08, pages 207--218, 2008.

Digital Library

[32]

M. Zhou, S. Bao, X. Wu and Y. Yu. An unsupervised model for exploring hierarchical semantics from social annotation. In Proc. of ISWC'07, pages 680--693, 2007.

Digital Library

[33]

E. Agichtein, C. Castillo, D. Donato, A. Gionis and G. Mishne. Finding High-Quality Content in Social Media. In Proc. of WSDM'08, pages 183--194

Digital Library

[34]

S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak and Z. Ives: DBpedia: A Nucleus for a Web of Open Data. In Proc. of ISWC'07, pages 722--735, 2007.

Digital Library

[35]

M. Pasca. Weakly-Supervised Discovery of Named Entities Using Web Search Queries. In Proc. of CIKM'07, pages 683--690, 2007.

Digital Library

[36]

M. Pasca. Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds. In Proc. of WWW'07, pages 101--110, 2007.

Digital Library

[37]

P. Jurczyk and E. Agichtein. Discovering Authorities in Question Answer Communities by Using Link Analysis. In Proc. of CIKM'07, pages 919--922, 2007.

Digital Library

[38]

Tom Griffiths. Gibbs sampling in the generative model of Latent Dirichlet Allocation. http://www-psych.stanford.edu/~gruffydd/cogsci02/lda.ps

Cited By

Zhang LShi YQi KWu DWang XYan ZChen Z(2025)Cross-space topological contrastive learning for knowledge graph-aware issue recommendationKnowledge and Information Systems10.1007/s10115-025-02355-zOnline publication date: 18-Feb-2025
https://doi.org/10.1007/s10115-025-02355-z
Luo GLi C(2024)Research on Mental Health Problem Identification Algorithm of College Students2024 IEEE 2nd International Conference on Image Processing and Computer Applications (ICIPCA)10.1109/ICIPCA61593.2024.10708980(506-510)Online publication date: 28-Jun-2024
https://doi.org/10.1109/ICIPCA61593.2024.10708980
Wu XLaufer ELi HKhomh FSrinivasan SLuo J(2024)Characterizing and classifying developer forum posts with their intentionsEmpirical Software Engineering10.1007/s10664-024-10487-z29:4Online publication date: 5-Jun-2024
https://doi.org/10.1007/s10664-024-10487-z
Show More Cited By

Index Terms

Tapping on the potential of q&a community by recommending answer providers
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
  2. World Wide Web
    1. Web applications
    2. Web services
2. Mathematics of computing
  1. Probability and statistics

Recommendations

Evaluating and predicting answer quality in community QA
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Question answering (QA) helps one go beyond traditional keywords-based querying and retrieve information in more precise form than given by a document or a list of documents. Several community-based QA (CQA) services have emerged allowing information ...
Exploring heterogeneous features for query-focused summarization of categorized community answers

Community-based question answering (cQA) is a popular type of online knowledge-sharing web service where users ask questions and obtain answers contributed by others. To enhance knowledge sharing, cQA also provides users with a retrieval function to ...
Novelty based Ranking of Human Answers for Community Questions
SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Questions and their corresponding answers within a community based question answering (CQA) site are frequently presented as top search results forWeb search queries and viewed by millions of searchers daily. The number of answers for CQA questions ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management

October 2008

1562 pages

ISBN:9781595939913

DOI:10.1145/1458082

General Chair:
James G. Shanahan
Church and Duncan Group Inc, USA
,
Program Chairs:
Sihem Amer-Yahia
Yahoo! Research, USA
,
Ioana Manolescu
INRIA, France
,
Yi Zhang
University of California, Santa Cruz, USA
,
David A. Evans
JustSystems Evans Research, USA
,
Alek Kolcz
Microsoft Live Labs, USA
,
Key-Sun Choi
KAIST, Korea
,
Abdur Chowdury
Twitter, USA

Copyright © 2008 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2008

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

CIKM08

Sponsor:

CIKM08: Conference on Information and Knowledge Management

October 26 - 30, 2008

California, Napa Valley, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

116
Total Citations
View Citations
1,390
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)1

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhang LShi YQi KWu DWang XYan ZChen Z(2025)Cross-space topological contrastive learning for knowledge graph-aware issue recommendationKnowledge and Information Systems10.1007/s10115-025-02355-zOnline publication date: 18-Feb-2025
https://doi.org/10.1007/s10115-025-02355-z
Luo GLi C(2024)Research on Mental Health Problem Identification Algorithm of College Students2024 IEEE 2nd International Conference on Image Processing and Computer Applications (ICIPCA)10.1109/ICIPCA61593.2024.10708980(506-510)Online publication date: 28-Jun-2024
https://doi.org/10.1109/ICIPCA61593.2024.10708980
Wu XLaufer ELi HKhomh FSrinivasan SLuo J(2024)Characterizing and classifying developer forum posts with their intentionsEmpirical Software Engineering10.1007/s10664-024-10487-z29:4Online publication date: 5-Jun-2024
https://doi.org/10.1007/s10664-024-10487-z
Kasela PPasi GPerego R(2023)SE-PEF: a Resource for Personalized Expert FindingProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3625335(288-309)Online publication date: 26-Nov-2023
https://dl.acm.org/doi/10.1145/3624918.3625335
Costa GOrtale R(2023) Ask and Ye shall be AnsweredInformation Fusion10.1016/j.inffus.2023.10185699:COnline publication date: 1-Nov-2023
https://dl.acm.org/doi/10.1016/j.inffus.2023.101856
Li YWang WPeng QLiu HShao MJiao P(2023)Expertise-Oriented Explainable Question RoutingCollaborative Computing: Networking, Applications and Worksharing10.1007/978-3-031-24383-7_3(41-57)Online publication date: 25-Jan-2023
https://doi.org/10.1007/978-3-031-24383-7_3
Modi ARajanala SSingh M(2022)DACE: Did I Catch You at a Good Time?Recent Challenges in Intelligent Information and Database Systems10.1007/978-981-19-8234-7_25(313-326)Online publication date: 24-Nov-2022
https://doi.org/10.1007/978-981-19-8234-7_25
Askari AVerberne SPasi G(2022)Expert Finding in Legal Community Question AnsweringAdvances in Information Retrieval10.1007/978-3-030-99739-7_3(22-30)Online publication date: 5-Apr-2022
https://doi.org/10.1007/978-3-030-99739-7_3
de Campos LFernández-Luna JHuete JRedondo-Expósito L(2021)LDA-based term profiles for expert finding in a political settingJournal of Intelligent Information Systems10.1007/s10844-021-00636-xOnline publication date: 23-Mar-2021
https://doi.org/10.1007/s10844-021-00636-x
Kundu DPal RMandal D(2021)Time-aware hybrid expertise retrieval system in community question answering servicesApplied Intelligence10.1007/s10489-020-02177-2Online publication date: 17-Feb-2021
https://doi.org/10.1007/s10489-020-02177-2
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten