Abstract
Community Question Answering (CQA) services, such as Yahoo! Answers, WikiAnswers, and Quora, have recently proliferated on the Internet. A large number of questions in CQA ask for information about a certain place (e.g., a city). Answering such local questions requires some local knowledge; therefore, it is probably beneficial to treat them differently from global questions for answer retrieval and answerer recommendation etc. In this paper, we address the problem of automatically identifying local questions in CQA through machine learning. The challenge is that manually labelling questions as local or global for training would be costly. Realising that we could find many local questions reliably from a few location-related categories (e.g., “Travel”), we propose to build local/global question classifiers in the framework of PU learning (i.e., learning from positive and unlabelled examples), and thus remove the need of manually labelling questions. In addition to standard text features of questions, we also make use of locality features which are extracted by the geo-parsing tool Yahoo! Placemaker. Our experiments on real-world datasets (collected from Yahoo! Answers and WikiAnswers) show that the probability estimation approach to PU learning outperforms S-EM (spy EM) and Biased-SVM for this task. Furthermore, we demonstrate that the spatial scope of a local question can be inferred accurately even if it does not mention any place name. This is particularly helpful in a mobile environment as users would be able to ask local questions via their GPS-equipped mobile phones without explicitly mentioning their current location and intended search radius.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Broder, A.: A taxonomy of web search. SIGIR Forum 36, 3–10 (2002)
Chen, L., Zhang, D., Levene, M.: Understanding user intent in community question answering. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW 2012 Companion, pp. 823–828. ACM, New York (2012)
Elkan, C., Noto, K.: Learning classifiers from only positive and unlabeled data. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 213–220. ACM, New York (2008)
Gravano, L., Hatzivassiloglou, V., Lichtenstein, R.: Categorizing web queries according to geographical locality. In: Proceedings of the Twelfth International Conference on Information and Knowledge Management, CIKM 2003, pp. 325–333. ACM, New York (2003)
Li, B., King, I., Lyu, M.R.: Question routing in community question answering: putting category in its place. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM 2011, pp. 2041–2044. ACM, New York (2011)
Liu, B., Dai, Y., Li, X., Lee, W.S., Yu, P.S.: Building text classifiers using positive and unlabeled examples. In: Proceedings of the Third IEEE International Conference on Data Mining, ICDM 2003, pp. 179–188. IEEE Computer Society, Washington, DC, USA (2003), http://dl.acm.org/citation.cfm?id=951949.952139
Liu, B., Lee, W.S., Yu, P.S., Li, X.: Partially supervised classification of text documents. In: Proceedings of the Nineteenth International Conference on Machine Learning, ICML 2002, pp. 387–394. Morgan Kaufmann Publishers Inc., San Francisco (2002)
Manning, C.D., Raghavan, P., Schtze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)
McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI 1998 Workshop on Learning for Text Categorization, pp. 41–48. AAAI Press (1998)
Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in Kernel Methods, pp. 185–208. MIT Press, Cambridge (1999)
Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1999, pp. 42–49. ACM, New York (1999)
Zhou, T.C., Lyu, M.R., King, I.: A classification-based approach to question routing in community question answering. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW 2012 Companion, pp. 783–790. ACM, New York (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, L., Zhang, D., Levene, M. (2012). Identifying Local Questions in Community Question Answering. In: Hou, Y., Nie, JY., Sun, L., Wang, B., Zhang, P. (eds) Information Retrieval Technology. AIRS 2012. Lecture Notes in Computer Science, vol 7675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35341-3_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-35341-3_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35340-6
Online ISBN: 978-3-642-35341-3
eBook Packages: Computer ScienceComputer Science (R0)