Skip to main content

Methodology for Connecting Nouns to Their Modifying Adjectives

  • Conference paper
Book cover Computational Linguistics and Intelligent Text Processing (CICLing 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8403))

Abstract

Adjectives are words that describe or modify other elements in a sentence. As such, they are frequently used to convey facts and opinions about the nouns they modify. Connecting nouns to the corresponding adjectives becomes vital for intelligent tasks such as aspect-level sentiment analysis or interpretation of complex queries (e.g., "small hotel with large rooms") for fine-grained information retrieval. To respond to the need, we propose a methodology that identifies dependencies of nouns and adjectives by looking at syntactic clues related to part-of-speech sequences that help recognize such relationships. These sequences are generalized into patterns that are used to train a binary classifier using machine learning methods. The capabilities of the new method are demonstrated in two, syntactically different languages: English, the leading language of international discourse, and Hebrew, whose rich morphology poses additional challenges for parsing. In each language we compare our method with a designated, state-of-the-art parser and show that it performs similarly in terms of accuracy while: (a) our method uses a simple and relatively small training set; (b) it does not require a language specific adaptation, and (c) it is robust across a variety of writing styles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adler, M.: Hebrew Morphological Disambiguation: An Unsupervised Stochastic Word-based Approach. Ph.D. thesis, Ben-Gurion University of the Negev, Beer-Sheva, Israel (2007)

    Google Scholar 

  2. Adler, M., DahanNetzer, Y., Goldberg, Y., Gabay, D., Elhadad, M.: Tagging a Hebrew Corpus: The Case of Participles. In: LREC (2008)

    Google Scholar 

  3. Adler, M., Elhadad, M.: An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation. In: Proceedings of COLING-ACL 2006 (2006)

    Google Scholar 

  4. Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: Proceedings of the Fifth ACM Conference on Digital Libraries. ACM (2000)

    Google Scholar 

  5. Alfonseca, E., Filippova, K., Delort, J.-Y., Garrido, G.: Pattern learning for relation extraction with a hierarchical topic model. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers, vol. 2, pp. 54–59. Association for Computational Linguistics (2012)

    Google Scholar 

  6. Banko, M., Etzioni, O., Center, T.: The Tradeoffs Between Open and Traditional Relation Extraction. In: ACL, vol. 8, pp. 28–36 (2008)

    Google Scholar 

  7. Barker, K., Szpakowicz, S.: Semi-automatic recognition of noun modifier relationships. In: Proceedings of the 17th International Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics (1998)

    Google Scholar 

  8. Blair-Goldensohn, S., Hannan, K., McDonald, R., Neylon, T., Reis, G.A., Reynar, J.: Building a sentiment summarizer for local service reviews. In: WWW Workshop on NLP in the Information Explosion Era (NLPIX), New York, NY, USA. ACM (2008)

    Google Scholar 

  9. Buchholz, S., Marsi, E.: CoNLL-X Shared Task on Multilingual Dependency Parsing. In: Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York, NY (2006)

    Google Scholar 

  10. De Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC, vol. 6 (2006)

    Google Scholar 

  11. Etzioni, O., Banko, M., Soderland, S., Weld, D.S.: Open information extraction from the web. Communications of the ACM 51(12), 68–74 (2008)

    Article  Google Scholar 

  12. Foster, J., Çetinoglu, Ö., Wagner, J., Le Roux, J., Hogan, S., Nivre, J., Hogan, D., Van Genabith, J.: # hard-to-parse: POS Tagging and Parsing the Twitterverse. In: Proceedings of the Workshop on Analyzing Microtext (AAAI 2011), pp. 20–25 (2011)

    Google Scholar 

  13. Goldberg, Y., Adler, M., Elhadad, M.: EM Can Find Pretty Good HMM POS-Taggers (When Given a Good Start). ACL (2008)

    Google Scholar 

  14. Yoav, G., Elhadad, M.: Hebrew Dependency Parsing: Initial Results. In: Proceedings of IWPT (2009)

    Google Scholar 

  15. Goldberg, Y., Elhadad, M.: Easy first dependency parsing of modern Hebrew. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages. Association for Computational Linguistics (2010)

    Google Scholar 

  16. Guthmann, N., Krymolowski, Y., Milea, A., Winter, Y.: Automatic annotation of morpho-syntactic dependencies in a modern Hebrew treebank. In: Proceedings of TLT (2009)

    Google Scholar 

  17. Hatzivassiloglou, V., Wiebe, J.M.: Effects of adjective orientation and gradability on sentence subjectivity. In: Proceedings of the 18th Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics (2000)

    Google Scholar 

  18. Kim, S.N., Nakov, P.: Large-scale noun compound interpretation using bootstrapping and the Web as a corpus. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2011)

    Google Scholar 

  19. Kübler, S.: The PaGe 2008 shared task on parsing German. In: Proceedings of the Workshop on Parsing German, pp. 55–63. Association for Computational Linguistics (2008)

    Google Scholar 

  20. Lin, D.: MINIPAR: a minimalist parser. Maryland linguistics colloquium (1999)

    Google Scholar 

  21. Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)

    Google Scholar 

  22. Minack, E., Demartini, G., Nejdl, W.: Current approaches to search result di-versification. In: First International Workshop on Living Web: Making Web Diversity a True Asset, Washington DC (2009)

    Google Scholar 

  23. Oommen, T., Baise, L.G., Vogel, R.M.: Sampling bias and class imbalance in maximum-likelihood logistic regression. Mathematical Geosciences 43(1), 99–120 (2011)

    Article  MATH  Google Scholar 

  24. Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)

    Article  Google Scholar 

  25. Rigoutsos, I., Floratos, A.: Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm. Bioinformatics 14(2), 229 (1998)

    Google Scholar 

  26. Rokach, Romano, Maimon: Negation recognition in medical narrative reports. Information Retrieval 11(6), 499–538 (2008)

    Article  Google Scholar 

  27. Swanson, B.: Exploring Syntactic Representations for Native Language Identification. In: NAACL/HLT 2013, p. 146 (2013)

    Google Scholar 

  28. Tsarfaty, R., Seddah, D., Goldberg, Y., Kübler, S., Candito, M., Foster, J., Versley, Y., Rehbein, I., Tounsi, L.: Statistical parsing of morphologically rich languages (SPMRL): what, how and whither. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pp. 1–12. Association for Computational Linguistics (2010)

    Google Scholar 

  29. Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann (2005)

    Google Scholar 

  30. Xu, Y., Kim, M.-Y., Quinn, K., Goebel, R., Barbosa, D.: Open Information Extraction with Tree Kernels. In: Proceedings of NAACL-HLT, pp. 868–877 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ofek, N., Rokach, L., Mitra, P. (2014). Methodology for Connecting Nouns to Their Modifying Adjectives . In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54906-9_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54905-2

  • Online ISBN: 978-3-642-54906-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics