Skip to main content

Harmonizing WordNet and FrameNet

  • Conference paper
Book cover Advances in Natural Language Processing (NLP 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6233))

Included in the following conference series:

Abstract

Lexical semantic resources are a key component of many NLP systems, whose performance continues to be limited by the “lexical bottleneck”. Two large hand-constructed resources, WordNet and FrameNet, differ in their theoretical foundations and their approaches to the representation of word meaning. A core question that both resources address is, how can regularities in the lexicon be discovered and encoded in a way that allows both human annotators and machines to better discriminate and interpret word meanings?

WordNet organizes the bulk of the English lexicon into a network (an acyclic graph) of word form-meaning pairs that are interconnected via directed arcs that express paradigmatic semantic relations. This classification largely disregards syntagmatic properties such as argument selection for verbs. However, a comparison with a syntax-based approach like Levin (1993) reveals some overlap as well as systematic divergences that can be straightforwardly ascribed to the different classification principles. FrameNet’s units are cognitive schemas (Frames), each characterized by a set of lexemes from different parts of speech with Frame-specific meanings (lexial units) and roles (Frame Elements). FrameNet also encodes cross-frame relations that parallel the relations among WordNet’s synsets.

Given the somewhat complementary nature of the two resources, an alignment would have at least the following potential advantages: (1) both sense inventories are checked and corrected where necessary, and (2) FrameNet’s coverage (lexical units per Frame) can be increased by taking advantage of WordNet’s class-based organization. A number of automatic alignments have been attempted, with variations on a few intuitively plausible algorithms. Often, the result is limited, as implicit assumptions concerning the systematicity of WordNet’s encoding or the semantic correspondences across the resources are not fully warranted. Thus, not all members of a synonym set or a subsumption tree are necessarily Frame mates.

We carry out a manual alignment of selected word forms against tokens in the American National Corpus that can serve as a basis for semi-automatic alignment. This work addresses a persistent, unresolved question, namely, to what extent can humans select, and agree on, the context-appropriate meaning of a word with respect to a lexical resource? We discuss representative cases, their challenges and solutions for alignment as well as initial steps for semi-automatic alignment.

(Joint work with Collin Baker and Nancy Ide)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fellbaum, C.D. (2010). Harmonizing WordNet and FrameNet. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds) Advances in Natural Language Processing. NLP 2010. Lecture Notes in Computer Science(), vol 6233. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14770-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14770-8_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14769-2

  • Online ISBN: 978-3-642-14770-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics