Skip to main content

Modeling Grammars with Knowledge Representation Methods: Subcategorization as a Test Case

  • Conference paper
  • First Online:
The Semantic Web: ESWC 2023 Satellite Events (ESWC 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13998))

Included in the following conference series:

  • 506 Accesses

Abstract

An OWL ontology is used to model a grammar that accounts for subcategorization, showing that ontologies are able to generate (mildly) context-sensitive languages. Semantic Web knowledge representation methods offer a useful way to model the implicit knowledge that defines human linguistic abilities. When a grammar is modeled as a set of ontological constraints (i.e. classes with restrictions on their properties), ungrammatical sentences are defined as facts that lead to inconsistencies which can be discovered by a reasoner. Property chains are used to “pass on” the category of a syntactic complement as the value of a head’s subcategorization feature, modeling the concept of structure sharing that is central to constraint-based theories of syntax like HPSG. By treating utterances as instances and syntactic constraints as axioms, this approach offers points of contact with efforts to model grammars as Linguistic Linked Open Data in the Semantic Web.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    As instances, constituents need to be given unique identifiers, like “Mocked_Hook”, and not generic class names like VP.

  2. 2.

    This is similar to the use of syntactic frames in LexInfo [8].

  3. 3.

    https://github.com/RaulAranovich/OnSyDE/blob/main/OnSyDE.owl.

  4. 4.

    My treatment of individual utterances as instances is similar to efforts to serialize syntactically-annotated corpora as RDF documents, sharable as Linguistic Linked Data [11, 12].

References

  1. Joshi, A.K., Shanker, K.V., Weir, D.: The Convergence of Mildly Context-Sensitive Grammar Formalisms. University of Pennsylvania Department of Computer and Information Science Technical Report No. MS-CIS-90-01 (1990)

    Google Scholar 

  2. Hitzler, P.: A review of the semantic web field. Commun. ACM 64(2), 76–83 (2021)

    Article  Google Scholar 

  3. Cimiano, P., Unger, C., McCrae, J.: Ontology-Based Interpretation of Natural Language. Morgan & Claypool, San Rafael (2014)

    Google Scholar 

  4. Schalley, A.C.: Ontologies and ontological methods in linguistics. Lang. Linguist. Compass 13(11) (2019). https://doi.org/10.1111/lnc3.12356

  5. Pollard, C., Sag, I.: Head-Driven Phrase Structure Grammar. University of Chicago Press, Chicago (1994)

    Google Scholar 

  6. Copestake, A.: Implementing Typed Feature Structure Grammars. CSLI Publications, Stanford (2002)

    Google Scholar 

  7. Francez, N., Wintner, S.: Unification Grammars. Cambridge University Press, Cambridge (2012)

    Google Scholar 

  8. Cimiano, P., Buitelaar, P., McCrae, J., Sintek, M.: LexInfo: a declarative model for the lexicon-ontology interface. J. Web Semant. Sci. Serv. Agents World Wide Web 9(1), 29–51 (2011)

    Article  Google Scholar 

  9. Unger, C., Cimiano, P.: Pythia: compositional meaning construction for ontology-based question answering on the semantic web. In: Muñoz, R., Montoyo, A., Métais, E. (eds.) NLDB 2011. LNCS, vol. 6716, pp. 153–160. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22327-3_15

    Chapter  Google Scholar 

  10. Farrar, S., Lewis, W.D.: The GOLD community of practice: an infrastructure for linguistic data on the web. Lang. Resour. Eval. 41(1), 45–60 (2007)

    Article  Google Scholar 

  11. Chiarcos, C., Glaser, L.: A tree extension for CoNLL-RDF. In: Proceedings of the 12th Conference on Language Resources and Evaluation, pp. 7161–7169. ELRA (2020)

    Google Scholar 

  12. Chiarcos, C.: POWLA: modeling linguistic corpora in OWL/DL. In: 9th Extended Semantic Web Conference, Heraklion, pp. 225–239 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Raúl Aranovich .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Aranovich, R. (2023). Modeling Grammars with Knowledge Representation Methods: Subcategorization as a Test Case. In: Pesquita, C., et al. The Semantic Web: ESWC 2023 Satellite Events. ESWC 2023. Lecture Notes in Computer Science, vol 13998. Springer, Cham. https://doi.org/10.1007/978-3-031-43458-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-43458-7_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-43457-0

  • Online ISBN: 978-3-031-43458-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics