skip to main content
10.1145/1963405.1963411acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Sparse hidden-dynamics conditional random fields for user intent understanding

Authors Info & Claims
Published:28 March 2011Publication History

ABSTRACT

Understanding user intent from her sequential search behaviors, i.e. predicting the intent of each user query in a search session, is crucial for modern Web search engines. However, due to the huge number of user behavior variables and coarse level intent labels defined by human editors, it is very difficult to directly model user behavioral dynamics or user intent dynamics in user search sessions. In this paper, we propose a novel Sparse Hidden-Dynamic Conditional Random Fields (SHDCRF) model for user intent learning from their search sessions. Through incorporating the proposed hidden state variables, SHDCRF aims to learn a substructure, i.e. a set of related hidden variables, for each intent label and they are used to model the intermediate dynamics between user intent labels and user behavioral variables. In addition, SHDCRF learns a sparse relation between the hidden variables and intent labels to make the hidden state variables explainable. Extensive experiment results, on real user search sessions from a popular commercial search engine show that the proposed SHDCRF model significantly outperforms in terms of intent prediction results that those classical solutions such as Support Vector Machine (SVM), Conditional Random Field (CRF) and Latnet-Dynamic Conditional Random Field (LDCRF).

References

  1. E. Agichtein, Eric Brill and S. Dumais. Improving Web search ranking by incorporating user behavior information. In SIGIR' 06, pp. 19--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. E. Agichtein, Eric Brill, S. Dumais and R. Ragno. Learning User Interaction Models for Predicting Web Search Result Preferences, In SIGIR'06, pp. 3--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Z. B. Andrei, M. Fontoura, E. Gabrilovich, A. Joshi, V. Josifovski and T. Zhang. Robust Classification of Rare Queries Using Web Knowledge. In SIGIR'07, pp. 231--238. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Cao, D. Jiang, J. Pei, Q. He and Z. Liao, E. Chen and H. Li. Context-Aware Query Suggestion By Mining Click-Through and session Data. In SIGKDD'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Cao, D. Jiang, J. Pei, E. Chen and H. Li. Towards context-aware search by learning a very large variable length hidden markov model from search logs. In WWW'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Cao, D. H. Hu, D. Shen, D. Jiang, J. T. Sun, E. Chen and Q. Yang. Context-aware query classification. In SIGIR'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Z. Cheng, B. Gao and T. Y. Liu. Actively predicting diverse search intent from user browsing behaviors. In WWW'10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Cortes and V. Vapnik. Support-Vector Networks. Machine Learning, 1995, Vol. 20, pp.273--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E. R. Daniel and Danny Levinson. Understanding User Goals in Web Search. In WWW'04, pp. 13--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. David, D. Nichols, M. Brain and O. D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the ACM, 1992, vol.12, pp. 61--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. V. Ganti, A.K. Christian and X. Li. Precomputing search features for fast and accurate query classification. In WSDM'10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Gunawardana, M. Milind, A. Alex, and J.C. Platt. Hidden Conditional Random Fields for Phone Classification. International Conference on Speech Communication and Technology(ICSCT), 2005.Google ScholarGoogle Scholar
  13. Hassan, R. Jones, and K. L. Klinkner. Beyond DCG, User Behavior as a Predictor of a Successful Search. In WSDM'10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Q. He, D. Jiang, Z. Liao, C. H. Steven, K. Chang, E. P. Lim and H. Li. Web Query Recommendation via Sequential Query Prediction. In ICDE'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Hu, G. Wang, F. Lochovsky, J. T. Sun and Z. Chen. Understanding user's query intent with wikipedia. In WWW'09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. H. Hu, D. Shen, J. T. Sun, Q. Yang and Z. Chen. Context-Aware Online Commercial Intention Detection. LNCS(5829), 2009, pp. 135--149. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Jeffrey and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In Operating Systems Design and Implementation(OSDI), 2004, pp. 137--150.Google ScholarGoogle Scholar
  18. T. Joachims. SVM-light Support Vector Machine. http://svmlight.joachims.org/Google ScholarGoogle Scholar
  19. I. H. Kang, G. C. Kim. Query Type Classification for web document Retrieval. In SIGIR'03. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Lafferty, A. McCallum and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In ICML' 01. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Lewis and David. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval. In ECML'98, pp.4--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. X. Li, Y. Y. W., Alex Acero. Learning Query Intent from Regularized Click Graphs. In SIGIR'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. C. Liu and J. Nocedal. On the Limited Memory Method for Large Scale Optimization. Mathematical Programming, 1989. pp. 503--528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Martigny and T. Artieres. Neural conditional random fields. In AISTATS'10, pp.177--184.Google ScholarGoogle Scholar
  25. H. Nguyen. Capturing User Intent For Information Retrieval. In AAAI'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. N. Okazaki. CRFsuit. A fast implementation of Conditional Random Fields. http://www.chokkan.org/software/crfsuite/.Google ScholarGoogle Scholar
  27. J. Pearl. Probabilistic Reasoning in Intellignet Systems: Networks of Plausible Inference. Morgan Kaufmann, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Peng, L. F. Bo and J. B. Xu. Conditional Neural Fields. In NIPS'09.Google ScholarGoogle Scholar
  29. L. M. Philippe, A. Quattoni and T. Darrell. Latent-Dynamic Discriminative Models for Continuous Gesture Recognition. In CVPR'07.Google ScholarGoogle Scholar
  30. Quattoni, M. Collins, and T. Darrell. Conditional random fields for object recognition. In NIPS'04.Google ScholarGoogle Scholar
  31. B. Y. Ricardo, C. Hurtado and M. Mendoza. Query Clustering for Boosting Web Page Ranking. LNCS(3034), 2004, pp.164--175.Google ScholarGoogle Scholar
  32. S. Sarawagi and William W. Cohen. Semi-Markov conditional random fields for information extraction. In NIPS'04.Google ScholarGoogle Scholar
  33. K. Seymore, A. McCallum and R. Rosenfeld. Learning Hidden Markov Model Structure for Information Extraction. In AAAI'99 Workshop on Machine Learning for Information Extraction, 2009.Google ScholarGoogle Scholar
  34. D. Shen, J. T. Sun, Q. Yang, and Z. Chen. Building Bridges for Web Query Classification. In SIGIR'06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. D. Shen and R. Pan. Query Enrichment for Web-Query Classification. ACM Transactions on Information System 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. M. B. Steven, C. J. Eric, F. Ophir, D. L. David, C. Abdur, K. Aleksander. Improving Automatic Query Classification via Semi-Supervised Learning. In ICDM'05.Google ScholarGoogle Scholar
  37. M. B. Steven and C. J. Eric. Automatic Classification of Web Queries Using Very Large Unlabeled Query Logs. In TOIS'06, vol. 24, pp. 320--352.Google ScholarGoogle Scholar
  38. Sutton and A. McCallum. Collective Segmentation and Labeling of Distant Entities in Information Extraction. In ICML workshop on Statistical Relational Learning, 2004.Google ScholarGoogle Scholar
  39. Sutton, K. Rohanimanesh and A. McCallum. Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data. In ICML'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 2003, Vol.13, pp. 260--269. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. J. Wang, A. de Vries and M. Reinders. A User-Item Relevance Model for Log-Based Collaborative Filtering. LNCS3936 (January 2006), pp. 37--48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. S. B. Wang, A. Quattini, L. P. Morency and D. Demirdjian. Hidden Conditional Random Fields for Gesture Recognition. In CVPR'06. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Yu, L. Deng, and S. Wang. Learning in the Deep-Structured Conditional Random Fields. In NIPS'09.Google ScholarGoogle Scholar
  44. X. H. Zhang. Building Maximum Entropy Text Classifier Using Semi-Supervised Learning. PhD thesis, NUS, 2004.Google ScholarGoogle Scholar
  45. Thomas G. Dietterich. Machine Learning for Sequential Data: A Review. LNCS(2396), 2002, pp. 15--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. H. Zou, T. Hastie and R. Tibshirani. Sparse Principal Component Analysis. Journal of Computational and Graphical Statistics, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  47. J. Mairal, F. Bach, J. Ponce and G. Sapiro. Online dictionary learning for sparse coding. In ICML'09. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Sparse hidden-dynamics conditional random fields for user intent understanding

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '11: Proceedings of the 20th international conference on World wide web
      March 2011
      840 pages
      ISBN:9781450306324
      DOI:10.1145/1963405

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 March 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader