skip to main content
10.1145/2396761.2398493acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Topic-sensitive probabilistic model for expert finding in question answer communities

Authors Info & Claims
Published:29 October 2012Publication History

ABSTRACT

In this paper, we address the problem of expert finding in community question answering (CQA). Most of the existing approaches attempt to find experts in CQA by means of link analysis techniques. However, these traditional techniques only consider the link structure while ignore the topical similarity among users (askers and answerers) and user expertise and user reputation. In this study, we propose a topic-sensitive probabilistic model, which is an extension of PageRank algorithm to find experts in CQA. Compared to the traditional link analysis techniques, our proposed method is more effective because it finds the experts by taking into account both the link structure and the topical similarity among users. We conduct experiments on real world data set from Yahoo! Answers. Experimental results show that our proposed method significantly outperforms the traditional link analysis techniques and achieves the state-of-the-art performance for expert finding in CQA.

References

  1. E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. 2008. Finding high-quality content in social media. In WSDM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Blei, A. Ng, and M. Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Bouguessa, B. Dumoulin, and S. Wang. 2008. Identifying authoritative actors in question-answering forums - the case of Yahoo! Answers. In KDD, pages 866--874. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. P. Dempster, N. M. Laird, D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1--38.Google ScholarGoogle ScholarCross RefCross Ref
  5. T. Griffiths and M. Steyvers. 2004. Finding scientific topics. The National Academy of Sciences, 101:5228--5235.Google ScholarGoogle ScholarCross RefCross Ref
  6. T. H. Haveliwala. 2002. Topic-sensitive pagerank. In WWW. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Jurczyk and E. Agichtein. 2007. Discovering authorities in question answer communities by using link analysis. In CIKM, pages 919--922. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. W. Kao, D. Liu, and S. Wang. 2010. Expert finding in question-answering websites: a novel hybrid approach. In SAC, pages 867--871. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Kleinberg. 1999. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604--632. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Kullback and R. A. Leibler. 1951. On information and sufficiency. Annals of Mathematical Statistics 22 (1): 79--86.Google ScholarGoogle ScholarCross RefCross Ref
  11. J. Lafferty and C. Zhai. 2003. Probabilistic relevance models based on document and query generation. Language Modeling and Information Retrieval, Kluwer International Series on Information Retrieval.Google ScholarGoogle Scholar
  12. B. Li and I. King. 2010. Routing questions to appropriate answerers in community question answering services. In CIKM, pages 1585--1588. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Z. Liu, W. Huang, Y. Zheng, and M. Sun. 2010. Automatic keyphrase extraction via topic decomposition. In EMNLP, pages 366--376. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Liu, Y. -I. Song, and C. -Y. Lin. 2011. Competition-based user expertise score estimation. In SIGIR, pages 425--434. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Nie, B. D. Davison, and X. Qi. 2006. Topic link analysis for web search. In SIGIR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. L. Page, S. Brin, R. Motwani, and T. Winograd. 1998. The pagerank citation ranking: bringing order to the web. Stanford Digtital Library Technologies Project.Google ScholarGoogle Scholar
  17. A. Pal and S. Counts. 2011. Identifying topical authorities in microblogs. In WSDM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Pal and J. Konstan. 2010. Expert identification in community question answering: exploring question selection bias. In CIKM, pages 1505--1508. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. I. Porteous, D. Newman, A. Ihler, A. Asuncion, P. Smyth, and M. Welling. 2008. Fast collapsed gibbs sampling for latent dirichlet allocation. In KDD, pages 569--577. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. 2004. The author-topic model for authors and documents. In UAI, pages 487--494. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Schall and F. Skopik. 2011. An Analysis of the Structure and Dynamics of Large-Scale Q/A Communities. In ADBIS, pages 285--301. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Steyvers, P. Smyth, and T. Griffiths. 2002. Probabilistic author-topic models for informaiton discovery. In KDD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Weng, E. -P. Lim, J. Jiang, and Q. He. 2010. TwitterRank: finding topic-sensitive influential twitterers. In WSDM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Zhang, M. Ackerman, and L. Adamic. 2007. Expertise networks in online commmunities: structure and algorithm. In WWW. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. G. Zhou, L. Cai, J. Zhao, and K. Liu. 2011. Phrase-based translation model for question retrieval in community question answer archives. In ACL, pages 653--662. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Topic-sensitive probabilistic model for expert finding in question answer communities

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
              October 2012
              2840 pages
              ISBN:9781450311564
              DOI:10.1145/2396761

              Copyright © 2012 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 29 October 2012

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • short-paper

              Acceptance Rates

              Overall Acceptance Rate1,861of8,427submissions,22%

              Upcoming Conference

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader