skip to main content
10.1145/1645953.1645961acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Semi-supervised learning of semantic classes for query understanding: from the web and for the web

Published:02 November 2009Publication History

ABSTRACT

Understanding intents from search queries can improve a user's search experience and boost a site's advertising profits. Query tagging via statistical sequential labeling models has been shown to perform well, but annotating the training set for supervised learning requires substantial human effort. Domain-specific knowledge, such as semantic class lexicons, reduces the amount of needed manual annotations, but much human effort is still required to maintain these as search topics evolve over time.

This paper investigates semi-supervised learning algorithms that leverage structured data (HTML lists) from the Web to automatically generate semantic-class lexicons, which are used to improve query tagging performance - even with far less training data. We focus our study on understanding the correct objectives for the semi-supervised lexicon learning algorithms that are crucial for the success of query tagging. Prior work on lexicon acquisition has largely focused on the precision of the lexicons, but we show that precision is not important if the lexicons are used for query tagging. A more adequate criterion should emphasize a trade-off between maximizing the recall of semantic class instances in the data, and minimizing the confusability. This ensures that the similar levels of precision and recall are observed on both training and test set, hence prevents over-fitting the lexicon features. Experimental results on retail product queries show that enhancing a query tagger with lexicons learned with this objective reduces word level tagging errors by up to 25% compared to the baseline tagger that does not use any lexicon features. In contrast, lexicons obtained through a precision-centric learning algorithm even degrade the performance of a tagger compared to the baseline. Furthermore, the proposed method outperforms one in which semantic class lexicons have been extracted from a database.

References

  1. Textgraphs: Graph-based algorithms for natural language processing. http://www.textgraphs.org.Google ScholarGoogle Scholar
  2. E. Agichtein and L. Gravano. Snowball: extracting relations from large plain-text collections. In the Proceedings of the 5th ACM Conference on Digital Libraries, San Antonio, Texas, USA, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. J. Cafarella, A. Halevy, Z. D. Wang, E. Wu, and Y. Zhang. WebTables: Exploring the power of tables on the Web. In the Proceedings of VLDB, Auckland, New Zealand, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. Eiloff and R. Jones. Learning dictionaries for information extraction by multi-level bootstrapping. In the Proceedings of the 16th National Conference on Artificial Intelligence, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. O. Etzioni, M. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderl, D. S. Weld, and E. Yates. Methods for domain-independent information extraction from the web: An experimental comparison. In the Proceedings of AAAI, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In the Proceedings of the 14th Conference on Computational Linguistics, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Komachi and H. Suzuki. Minimally supervised learning of semantic knowledge from query logs. In the Proceedings of IJCNLP, Hyderabad, India, 2008.Google ScholarGoogle Scholar
  8. J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In the Proceedings of ICML, pages 282--289, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. X. Li, Y.-Y. Wang, and A. Acero. Learning query intent from regularized click graphs. In the Proceedings of the 31st SIGIR Conference, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. X. Li, Y.-Y. Wang, and A. Acero. Extracting structured information from user queries with semi-supervised conditional random fields. In the Proceedings of the 32nd SIGIR Conference, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Lin and P. Pantel. Concept discovery from text. In the Proceedings of the 19th International Conference on Computational linguistics (COLING-02), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. McCallum and W. Li. Early results for named entity recognition with conditional random fields, feature induction and Web-enhanced lexicons. In the Proceedings of the 7th Conference on Natural Language Learning (CoNLL), Edmonton, Canada, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the Web.Technical report, Stanford InfoLab, 1999.Google ScholarGoogle Scholar
  14. P. Pantel and M. Pennacchiotti. Espresso: Leveraging generic patterns for automatically harvesting semantic relations. In the Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, Sydney, Australia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. F. Peng and A. McCallum. Accurate information extraction from research papers using conditional random fields. In the Proceedings of Human Language Technology Conference and the Conference of North American Chapter of the Association for Computational Linguistics, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Sarawagi and W. W. Cohen. Semi-Markov conditional random fields for information extraction. In the Proceedings of Advances in Neural Information Processing Systems, Vancouver, Canada, 2005.Google ScholarGoogle Scholar
  17. F. Sha and F. Pereira. Shallow parsing with conditional random fields. In the Proceedings of Human Language Technology Conference and the Conference of the North American Chapter of the Association for Computational Linguistics, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. P. Talukdar, T. Brants, M. Liberman, and F. Pereira. A context pattern induction method for named entity extraction. In the Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL-X), New York City, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. P. Talukdar, J. Reisinger, M. Pasça, D. Ravichandran, R. Bhagat, and F. Pereira. Weakly-supervised acquisition of labeled class instances using graph random walks. In the Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. C. Wang, N. Schlaefer, W. Cohen, and E. Nyberg. Automatic set expansion for list question answering. In the Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y.-Y. Wang, A. Acero, C. Chelba, B. Frey, and L. Wong. Combination of statistical and rule-based approaches for spoken language understanding. In the Proceedings of the International Conference on Speech and Language Processing, Denver, Colorado, 2002.Google ScholarGoogle Scholar
  22. D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In Advances in Neural Information Processing Systems, volume 16, pages 321--328, 2004.Google ScholarGoogle Scholar
  23. D. Zhou, B. Schölkopf, and T. Hofmann. Semi-supervised learning on directed graphs. In Advances in Neural Information Processing Systems, 2005.Google ScholarGoogle Scholar
  24. X. Zhu. Semi-Supervised Learning with Graphs. PhD thesis, Carnegie Mellon University, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Semi-supervised learning of semantic classes for query understanding: from the web and for the web

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader