skip to main content
10.1145/2063576.2063598acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Improving context-aware query classification via adaptive self-training

Authors Info & Claims
Published:24 October 2011Publication History

ABSTRACT

Topical classification of user queries is critical for general-purpose web search systems. It is also a challenging task, due to the sparsity of query terms and the lack of labeled queries. On the other hand, search contexts embedded in query sessions and unlabeled queries free on the web have not been fully utilized in most query classification systems. In this work, we leverage these information to improve query classification accuracy.

We first incorporate search contexts into our framework using a Conditional Random Field (CRF) model. Discriminative training of CRFs is favored over the traditional maximum likelihood training because of its robustness to noise. We then adapt self-training with our model to exploit the information in unlabeled queries. By investigating different confidence measurements and model selection strategies, we effectively avoid the error-reinforcing nature of self-training. In extensive experiments on real search logs, we have averaged around 20% improvement in classification accuracy over other state-of-the-art baselines.

References

  1. S. Beitzel, E. Jensen, O. Frieder, D. Lewis, A. Chowdhury, and A. Kołcz. Improving automatic query classification via semi-supervised learning. In Proc. ICDM, pages 42--49, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Belkin, I. Matveeva, and P. Niyogi. Regularization and semi-supervised learning on large graphs. Learning theory, pages 624--638, 2004.Google ScholarGoogle Scholar
  3. S. Benson, L. McInnes, J. Moré, and J. Sarich. TAO user manual (revision 1.9). Mathematics and Computer Science Division, Argonne National Laboratory, Tech. Rep. ANL/MCS-TM-242, 2005.Google ScholarGoogle Scholar
  4. A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In Proc. COLT, pages 92--100, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Burges. A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery, 2(2):121--167, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Cao, D. Hu, D. Shen, D. Jiang, J. Sun, E. Chen, and Q. Yang. Context-aware query classification. In Proc. SIGIR, pages 3--10, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Catledge and J. Pitkow. Characterizing browsing strategies in the World-Wide Web. Computer Networks and ISDN systems, 27(6):1065--1073, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. O. Chapelle, B. Schölkopf, A. Zien, et al. Semi-supervised learning. MIT press Cambridge, MA, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Chen, C. Y., M. Brent, and A. Tenney. Gradient-Based Feature Selection for Conditional Random Fields and Its Applications in Computational Genetics. In Proc. ICTAI, pages 750--757, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Croft et al. The role of context and adaptation in user interfaces. Journal of Man-Machine Studies, 21(4):283--292, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. Cui, J. Wen, J. Nie, and W. Ma. Probabilistic query expansion using query logs. In Proc. WWW, pages 325--332, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Gimpel and N. Smith. Softmax-margin crfs: Training log-linear models with cost functions. In Proc. ACL, pages 733--736, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Goker. Context learning in Okapi. Journal of Documentation, 53(1):80--83, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  14. B. Jansen, A. Spink, C. Blakely, and S. Koshman. Defining a session on web search engines. Journal of the American Society for Information Science and Technology, 58(6):862--871, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. F. Jiao, S. Wang, C. Lee, R. Greiner, and D. Schuurmans. Semi-supervised conditional random fields for improved sequence segmentation and labeling. In Proc. ACL, pages 209--216, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Joachims. Learning to classify text using support vector machines: Methods, theory, and algorithms. Computational Linguistics, 29(4):656--664, 2002.Google ScholarGoogle Scholar
  17. R. Jones and K. Klinkner. Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In Proc. CIKM, pages 699--708, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. ICML, pages 282--289, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. X. Li, Y. Wang, and A. Acero. Learning query intent from regularized click graphs. In Proc. SIGIR, pages 339--346, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Mann and A. McCallum. Simple, robust, scalable semi-supervised learning via expectation regularization. In Proc. ICML, pages 593--600. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. N. Seshadri and C. Sundberg. List Viterbi decoding algorithms with applications. Communications, IEEE Transactions on, 42(234):313--323, 2002.Google ScholarGoogle Scholar
  22. F. Sha and F. Pereira. Shallow parsing with conditional random fields. In Proc. Human Language Technology - NAACL, pages 134--141, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. F. Sha and L. Saul. Large margin hidden Markov models for automatic speech recognition. In Proc. NIPS, pages 1249--1256, 2007.Google ScholarGoogle Scholar
  24. C. Silverstein, H. Marais, M. Henzinger, and M. Moricz. Analysis of a very large web search engine query log. In ACM SIGIR Forum, volume 33, pages 6--12, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. C. Sutton and A. McCallum. An Introduction to Conditional Random Fields for Relational Learning. Introduction to statistical relational learning, page 93, 2007.Google ScholarGoogle Scholar
  26. S. Talja, H. Keso, and T. Pietil\"ainen. The production of 'context' in information seeking research: a metatheoretical view. Information Processing and Management, 35(6):751--763, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. B. Taskar, C. Guestrin, and D. Koller. Max-margin Markov networks. In Proc. NIPS, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. I. Tsochantaridis, T. Hofmann, T. Joachims, and Y. Altun. Support vector machine learning for interdependent and structured output spaces. In Proc. ICML, page 104, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. V. Vapnik and V. Vapnik. Statistical learning theory. Wiley New York, 1998.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. D. Yarowsky. Unsupervised word sense disambiguation rivaling supervised methods. In Proc. ACL, pages 189--196, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Zhang and F. Oles. A probability analysis on the value of unlabeled data for classification problems. In Proc. ICML, pages 1191--1198, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. X. Zhu. Semi-supervised learning literature survey. Computer Science, University of Wisconsin-Madison, 2006.Google ScholarGoogle Scholar

Index Terms

  1. Improving context-aware query classification via adaptive self-training

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
            October 2011
            2712 pages
            ISBN:9781450307178
            DOI:10.1145/2063576

            Copyright © 2011 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 24 October 2011

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate1,861of8,427submissions,22%

            Upcoming Conference

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader