Skip to main content

Unsupervised Feature Adaptation for Cross-Domain NLP with an Application to Compositionality Grading

  • Conference paper
  • 2263 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7816))

Abstract

In this paper, we introduce feature adaptation, an unsupervised method for cross-domain natural language processing (NLP). Feature adaptation adapts a supervised NLP system to a new domain by recomputing feature values while retaining the model and the feature definitions used on the original domain. We demonstrate the effectiveness of feature adaptation through cross-domain experiments in compositionality grading and show that it rivals supervised target domain systems when moving from generic web text to a specialized physics text domain.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Daumé III, H., Marcu, D.: Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research (JAIR) 26 (2006)

    Google Scholar 

  2. Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: EMNLP, pp. 120–128 (2006)

    Google Scholar 

  3. Biemann, C., Giesbrecht, E.: Distributional semantics and compositionality 2011: Shared task description and results. In: ACL 2011 Workshop on Distributional Semantics and Compositionality, pp. 21–28 (2011)

    Google Scholar 

  4. Sag, I.A., Baldwin, T., Bond, F., Copestake, A., Flickinger, D.: Multiword Expressions: A Pain in the Neck for NLP. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 1–15. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Gildea, D.: Corpus variation and parser performance. In: EMNLP, pp. 167–202 (2001)

    Google Scholar 

  6. McClosky, D., Charniak, E., Johnson, M.: Reranking and self-training for parser adaptation. In: ACL/COLING, pp. 337–344 (2006)

    Google Scholar 

  7. Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In: ACL, pp. 440–447 (2007)

    Google Scholar 

  8. Daume III, H., Jagarlamudi, J.: Domain adaptation for machine translation by mining unseen words. In: ACL/HLT, pp. 407–412 (2011)

    Google Scholar 

  9. Baroni, M., Bernardini, S., Ferraresi, A., Zanchetta, E.: The wacky wide web: a collection of very large linguistically processed web-crawled corpora. Language Resources and Evaluation 43(3) (2009)

    Google Scholar 

  10. Lykke, M., Larsen, B., Lund, H., Ingwersen, P.: Developing a Test Collection for the Evaluation of Integrated Search. In: Gurrin, C., He, Y., Kazai, G., Kruschwitz, U., Little, S., Roelleke, T., Rüger, S., van Rijsbergen, K. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 627–630. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  11. Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: International Conference on New Methods in Language Processing, pp. 44–49 (1994)

    Google Scholar 

  12. Evert, S.: The Statistics of Word Cooccurrences: Word Pairs and Collocations. PhD thesis, Institut für maschinelle Sprachverarbeitung (IMS), Universität Stuttgart (2004)

    Google Scholar 

  13. Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press (1999)

    Google Scholar 

  14. Schone, P., Jurafsky, D.: Is knowledge-free induction of multiword unit dictionary headwords a solved problem? In: EMNLP, pp. 100–108 (2001)

    Google Scholar 

  15. Schütze, H.: Dimensions of meaning. In: 1992 ACM/IEEE Conference on Supercomputing, Supercomputing 1992, pp. 787–796. IEEE (1992)

    Google Scholar 

  16. Sahlgren, M.: The Word-Space Model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces. PhD thesis, Swedish Institute of Computer Science (2006)

    Google Scholar 

  17. Turney, P.D., Pantel, P.: From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research 37(1) (2010)

    Google Scholar 

  18. Harris, Z.: Distributional structure. Word (1954)

    Google Scholar 

  19. Mitchell, J., Lapata, M.: Composition in distributional models of semantics. Cognitive Science 34(8) (2010)

    Google Scholar 

  20. Baldwin, T., Bannard, C., Tanaka, T., Widdows, D.: An empirical model of multiword expression decomposability. In: ACL 2003 Workshop on Multiword Expressions, pp. 89–96 (2003)

    Google Scholar 

  21. Michelbacher, L., Kothari, A., Forst, M., Lioma, C., Schütze, H.: A cascaded classification approach to semantic head recognition. In: EMNLP, pp. 793–803 (2011)

    Google Scholar 

  22. Garrido, G., Peñas, A.: Detecting compositionality using semantic vector space models based on syntactic context. shared task system description. In: ACL 2011 Workshop on Distributional Semantics and Compositionality, pp. 43–47 (2011)

    Google Scholar 

  23. Guevara, E.: A regression model of adjective-noun compositionality in distributional semantics. In: 2010 Workshop on Geometrical Models of Natural Language Semantics, pp. 33–37 (2010)

    Google Scholar 

  24. Breiman, L.: Random forests. Machine Learning 45(1) (2001)

    Google Scholar 

  25. Reddy, S., McCarthy, D., Manandhar, S., Gella, S.: Exemplar-based word-space model for compositionality detection: Shared task system description. In: ACL 2011 Workshop on Distributional Semantics and Compositionality, pp. 54–60 (2011)

    Google Scholar 

  26. Johannsen, A., Martinez, H., Rishøj, C., Søgaard, A.: Shared task system description: Frustratingly hard compositionality prediction. In: ACL 2011 Workshop on Distributional Semantics and Compositionality, pp. 29–32 (2011)

    Google Scholar 

  27. Chelba, C., Acero, A.: Adaptation of maximum entropy capitalizer: Little data can help a lot. Computer Speech & Language 20(4) (2006)

    Google Scholar 

  28. Huang, F., Yates, A.: Distributional representations for handling sparsity in supervised sequence-labeling. In: ACL/IJCNLP, pp. 495–503 (2009)

    Google Scholar 

  29. Bertoldi, N., Federico, M.: Domain adaptation for statistical machine translation with monolingual resources. In: Fourth Workshop on Statistical Machine Translation, pp. 167–174 (2009)

    Google Scholar 

  30. Pan, S.J., Ni, X., Sun, J.T., Yang, Q., Chen, Z.: Cross-domain sentiment classification via spectral feature alignment. In: WWW, pp. 751–760 (2010)

    Google Scholar 

  31. Lin, D.: Automatic identification of non-compositional phrases. In: ACL, pp. 317–324 (1999)

    Google Scholar 

  32. Bannard, C., Baldwin, T., Lascarides, A.: A statistical approach to the semantics of verb-particles. In: ACL 2003 Workshop on Multiword Expressions, pp. 65–72 (2003)

    Google Scholar 

  33. McCarthy, D., Keller, B., Carroll, J.: Detecting a continuum of compositionality in phrasal verbs. In: ACL 2003 Workshop on Multiword Expressions, pp. 73–80 (2003)

    Google Scholar 

  34. Katz, G., Giesbrecht, E.: Automatic identification of non-compositional multi-word expressions using latent semantic analysis. In: ACL 2006 Workshop on Multiword Expressions, pp. 12–19 (2006)

    Google Scholar 

  35. Sporleder, C., Li, L.: Unsupervised recognition of literal and non-literal use of idiomatic expressions. In: EACL, pp. 754–762 (2009)

    Google Scholar 

  36. Turian, J., Ratinov, L., Bengio, Y.: Word representations: A simple and general method for semi-supervised learning. In: ACL (2010)

    Google Scholar 

  37. Gliozzo, A., Strapparava, C.: Cross language text categorization by acquiring multilingual domain models from comparable corpora. In: Proceedings of the ACL Workshop on Building and Using Parallel Texts, pp. 9–16. Association for Computational Linguistics (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Michelbacher, L., Han, Q., Schütze, H. (2013). Unsupervised Feature Adaptation for Cross-Domain NLP with an Application to Compositionality Grading. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37247-6_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37246-9

  • Online ISBN: 978-3-642-37247-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics