Skip to main content

An Approximate Perspective on Word Prediction in Context: Ontological Semantics Meets BERT

  • Conference paper
  • First Online:
Fuzzy Information Processing 2020 (NAFIPS 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1337))

Included in the following conference series:

  • 246 Accesses

Abstract

This paper presents an analysis of a large neural network model—BERT, by placing its word prediction in context capability under the framework of Ontological Semantics. BERT has reportedly performed well in tasks that require semantic competence without any explicit semantic inductive bias. We posit that word prediction in context can be interpreted as the task of inferring the meaning of an unknown word. This practice has been employed by several papers following the Ontological Semantic Technology (OST) approach to Natural Language Understanding. Using this approach, we deconstruct BERT’s output for an example sentence and interpret it using OST’s fuzziness handling mechanisms, revealing the degree to which each output satisfies the sentence’s constraints.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Is-a relationship, for example: a dog is a mammal.

References

  1. J. Devlin, M.W. Chang, K. Lee, Toutanova, K.: BERT, Pre-training of deep bidirectional transformers for language understanding (2019), pp. 4171–4186

    Google Scholar 

  2. A. Ettinger, What BERT is not: lessons from a new suite of psycholinguistic diagnostics for language models. Trans. Assoc. Comput. Ling. 8, 34–48 (2020)

    Google Scholar 

  3. J.R. Firth, A synopsis of linguistic theory 1930-1955. Studies in linguistic analysis (1957)

    Google Scholar 

  4. Z.S. Harris, Distributional structure. Word 10(2–3), 146–162 (1954)

    Google Scholar 

  5. C.F. Hempelmann, J.M. Taylor, V. Raskin, Application-guided ontological engineering, in ICAI 2010: Proceedings of the 2010 International Conference on Artificial Intelligence (Las Vegas NV, July 12–15, 2010), pp. 843–849

    Google Scholar 

  6. D. Jurafsky, Speech & Language Processing, 3rd edn. (2020)

    Google Scholar 

  7. J. Launchbury, A DARPA perpective on artificial intelligence (2019). https://www.darpa.mil/attachments/AIFull.pdf

  8. T. Linzen, E. Dupoux, Y. Goldberg, Assessing the ability of LSTMs to learn syntax-sensitive dependencies. Trans. Assoc. Comput. Ling. 4, 521–535 (2016)

    Google Scholar 

  9. K. Misra, A. Ettinger, J.T. Rayz, Exploring BERT’s sensitivity to lexical cues using tests from semantic priming, in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings (2020), pp. 4625–4635

    Google Scholar 

  10. S. Nirenburg, V. Raskin, Ontological Semantics (MIT Press, New York, 2004)

    Google Scholar 

  11. V. Raskin, C.F. Hempelmann, J.M. Taylor, Guessing vs. knowing: The two approaches to semantics in natural language processing (2010), pp. 642–650

    Google Scholar 

  12. V. Raskin, J.M. Taylor, in The (not so) Unbearable Fuzziness of Natural Language: The Ontological Semantic Way of Computing with Words (IEEE, 2009), pp. 1–6

    Google Scholar 

  13. E. Reif, A. Yuan, M. Wattenberg, F.B. Viegas, A. Coenen, A. Pearce, B. Kim, Visualizing and measuring the geometry of BERT, in Advances in Neural Information Processing Systems (2019), pp. 8592–8600

    Google Scholar 

  14. M. van Schijndel, T. Linzen, Modeling garden path effects without explicit hierarchical syntax (2018)

    Google Scholar 

  15. J.M. Taylor, C.F. Hempelmann, V. Raskin, On an automatic acquisition toolbox for ontologies and lexicons in ontological semantics (2010), pp. 863–869

    Google Scholar 

  16. J.M. Taylor, V. Raskin, Fuzzy ontology for natural language, in 2010 Annual Meeting of the North American Fuzzy Information Processing Society (IEEE, 2010), pp. 1–6

    Google Scholar 

  17. J.M. Taylor, V. Raskin, Understanding the unknown: unattested input processing in natural language, in 2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011) (IEEE, 2011), pp. 94–101

    Google Scholar 

  18. J.M. Taylor, V. Raskin, in Conceptual Defaults in Fuzzy Ontology (IEEE, 2016), pp. 1–6

    Google Scholar 

  19. J.M. Taylor, V. Raskin, C.F. Hempelmann, Towards computational guessing of unknown word meanings: the Ontological Semantic approach, in Proceedings of the Annual Meeting of the Cognitive Science Society, vol. 33 (2011)

    Google Scholar 

  20. W.L. Taylor, “Cloze procedure”: a new tool for measuring readability. J. Q. 30(4), 415–433 (1953)

    Google Scholar 

  21. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, Ł. Kaiser, I. Polosukhin, Attention is all you need, in Advances in Neural Information Processing Systems (2017), pp. 5998–6008

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kanishka Misra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Misra, K., Rayz, J.T. (2022). An Approximate Perspective on Word Prediction in Context: Ontological Semantics Meets BERT. In: Bede, B., Ceberio, M., De Cock, M., Kreinovich, V. (eds) Fuzzy Information Processing 2020. NAFIPS 2020. Advances in Intelligent Systems and Computing, vol 1337. Springer, Cham. https://doi.org/10.1007/978-3-030-81561-5_14

Download citation

Publish with us

Policies and ethics