Skip to main content

Identifying Local Questions in Community Question Answering

  • Conference paper
Book cover Information Retrieval Technology (AIRS 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7675))

Included in the following conference series:

Abstract

Community Question Answering (CQA) services, such as Yahoo! Answers, WikiAnswers, and Quora, have recently proliferated on the Internet. A large number of questions in CQA ask for information about a certain place (e.g., a city). Answering such local questions requires some local knowledge; therefore, it is probably beneficial to treat them differently from global questions for answer retrieval and answerer recommendation etc. In this paper, we address the problem of automatically identifying local questions in CQA through machine learning. The challenge is that manually labelling questions as local or global for training would be costly. Realising that we could find many local questions reliably from a few location-related categories (e.g., “Travel”), we propose to build local/global question classifiers in the framework of PU learning (i.e., learning from positive and unlabelled examples), and thus remove the need of manually labelling questions. In addition to standard text features of questions, we also make use of locality features which are extracted by the geo-parsing tool Yahoo! Placemaker. Our experiments on real-world datasets (collected from Yahoo! Answers and WikiAnswers) show that the probability estimation approach to PU learning outperforms S-EM (spy EM) and Biased-SVM for this task. Furthermore, we demonstrate that the spatial scope of a local question can be inferred accurately even if it does not mention any place name. This is particularly helpful in a mobile environment as users would be able to ask local questions via their GPS-equipped mobile phones without explicitly mentioning their current location and intended search radius.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Broder, A.: A taxonomy of web search. SIGIR Forum 36, 3–10 (2002)

    Article  Google Scholar 

  2. Chen, L., Zhang, D., Levene, M.: Understanding user intent in community question answering. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW 2012 Companion, pp. 823–828. ACM, New York (2012)

    Chapter  Google Scholar 

  3. Elkan, C., Noto, K.: Learning classifiers from only positive and unlabeled data. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 213–220. ACM, New York (2008)

    Chapter  Google Scholar 

  4. Gravano, L., Hatzivassiloglou, V., Lichtenstein, R.: Categorizing web queries according to geographical locality. In: Proceedings of the Twelfth International Conference on Information and Knowledge Management, CIKM 2003, pp. 325–333. ACM, New York (2003)

    Chapter  Google Scholar 

  5. Li, B., King, I., Lyu, M.R.: Question routing in community question answering: putting category in its place. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM 2011, pp. 2041–2044. ACM, New York (2011)

    Google Scholar 

  6. Liu, B., Dai, Y., Li, X., Lee, W.S., Yu, P.S.: Building text classifiers using positive and unlabeled examples. In: Proceedings of the Third IEEE International Conference on Data Mining, ICDM 2003, pp. 179–188. IEEE Computer Society, Washington, DC, USA (2003), http://dl.acm.org/citation.cfm?id=951949.952139

    Chapter  Google Scholar 

  7. Liu, B., Lee, W.S., Yu, P.S., Li, X.: Partially supervised classification of text documents. In: Proceedings of the Nineteenth International Conference on Machine Learning, ICML 2002, pp. 387–394. Morgan Kaufmann Publishers Inc., San Francisco (2002)

    Google Scholar 

  8. Manning, C.D., Raghavan, P., Schtze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)

    Book  MATH  Google Scholar 

  9. McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI 1998 Workshop on Learning for Text Categorization, pp. 41–48. AAAI Press (1998)

    Google Scholar 

  10. Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in Kernel Methods, pp. 185–208. MIT Press, Cambridge (1999)

    Google Scholar 

  11. Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1999, pp. 42–49. ACM, New York (1999)

    Chapter  Google Scholar 

  12. Zhou, T.C., Lyu, M.R., King, I.: A classification-based approach to question routing in community question answering. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW 2012 Companion, pp. 783–790. ACM, New York (2012)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, L., Zhang, D., Levene, M. (2012). Identifying Local Questions in Community Question Answering. In: Hou, Y., Nie, JY., Sun, L., Wang, B., Zhang, P. (eds) Information Retrieval Technology. AIRS 2012. Lecture Notes in Computer Science, vol 7675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35341-3_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35341-3_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35340-6

  • Online ISBN: 978-3-642-35341-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics