Skip to main content

Extracting Semantic Representations from Large Text Corpora

  • Conference paper

Part of the book series: Perspectives in Neural Computing ((PERSPECT.NEURAL))

Abstract

Many connectionist language processing models have now reached a level of detail at which more realistic representations of semantics are required. In this paper we discuss the extraction of semantic representations from the word co-occurrence statistics of large text corpora and present a preliminary investigation into the validation and optimisation of such representations. We find that there is significantly more variation across the extraction procedures and evaluation criteria than is commonly assumed.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Battig WF and Montague WE. Category norms for verbal items in 56 categories: A replication and extension of the Connecticut category norms. Journal of Experimental Psychology Monograph 1969; 80

    Google Scholar 

  2. Bullinaria JA. Modelling Reading, Spelling and Past Tense Learning with Artificial Neural Networks. Brain and Language 1997; in press

    Google Scholar 

  3. Bullinaria JA. Modelling Lexical Decision: Who needs a lexicon? In Keating JG. (Ed) Neural Computing Research and Applications III, 62–69. Maynooth, Ireland: St. Patrick’s College, 1995

    Google Scholar 

  4. Bullinaria JA. Connectionist Models of Reading: Incorporating Semantics. In Proceedings of the First European Workshop on Cognitive Modelling, 224–229, Berlin: Technische Universitat Berlin, 1996

    Google Scholar 

  5. Bullinaria JA and Huckle CC. Modelling Lexical Decision Using Corpus Derived Semantic Representations in a Connectionist Network. In Proceedings of the Fourth Neural Computational and Psychology Workshop 1997

    Google Scholar 

  6. Coltheart M, Curtis B, Atkins P and Haller M. Models of Reading Aloud: Dual-Route and Parallel-Distributed-Processing Approaches, Psychological Review 1993; 100: 589–608

    Article  Google Scholar 

  7. Hinton GE and Shallice T. Lesioning an Attractor Network: Investigations of Acquired Dyslexia. Psychological Review 1991; 98: 74–95

    Article  Google Scholar 

  8. Landauer TK and Dumais ST. A Solution to Plato’s Problem: The Latent Semantic Analysis Theory of Acquisition, Induction and Representation of Knowledge. Psychological Review 1997; 104: 211–240

    Article  Google Scholar 

  9. Leech G. 100 million words of English: the British National Corpus. Language Research 1992, 28:1–13

    Google Scholar 

  10. Levy JP, Bullinaria JA and Patel M. Evaluating the Use of Word Co-Occurrence Statistics as Semantic Representations, in preparation

    Google Scholar 

  11. Lund K, Burgess C and Atchley RA. Semantic and Associative Priming in High-dimensional Semantic Space. In Moore JD and Lehman JF (Eds), Proceedings of the Seventeenth Annual Meeting of the Cognitive Science Society, 660–665. Lawrence Erlbaum Associates, Pittsburgh PA 1995

    Google Scholar 

  12. Lund K and Burgess C. Producing High-dimensional Semantic Spaces from Lexical Co-occurrence. Behaviour Research Methods, Instruments and Computers 1996; 2: 203–208

    Article  Google Scholar 

  13. Miller GA and Fellbaume C. Semantic networks of English. Cognition 1991; 41: 197–229

    Article  Google Scholar 

  14. Moss HE, Ostrin RK, Tyler LK and Marslen-Wilson WD. Accessing Different Types of Lexical Semantic Information: Evidence From Priming. Journal of Experimental Psychology: Learning, Memory and Cognition 1995; 21: 863–883

    Article  Google Scholar 

  15. Patel M. Using Neural Nets to Investigate Lexical Analysis. PRICAI’96: Topics in Artificial Intelligence 1996; 241–252

    Google Scholar 

  16. Plaut DC. Semantic and Associative Priming in a Distributed Attractor Network. Proceedings of the Seventeenth Annual Conference of the Cognitive Science Society 1995; 37–42

    Google Scholar 

  17. Plaut DC and Shallice T. Deep Dyslexia: A case study of connectionist neuropsychology. Cognitive Neuropsychology 1993; 10: 377–500

    Article  Google Scholar 

  18. Plaut DC, McClelland JL, Seidenberg MS and Patterson KE. Understanding Normal and Impaired Word Reading: Computational Principles in Quasi-Regular Domains. Psychological Review 1996; 103: 56–115

    Article  Google Scholar 

  19. Schutze H. Word Space. In Hanson SJ, Cowan JD and Giles CL (Eds), Advances in Neural Information Processing Systems 5, 895–902. Morgan Kaufmann, San Mateo CA, 1993.

    Google Scholar 

  20. Seidenberg MS and McClelland JL. A Distributed, Developmental Model of Word Recognition and Naming. Psychological Review 1989; 96: 523–568

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag London Limited

About this paper

Cite this paper

Patel, M., Bullinaria, J.A., Levy, J.P. (1998). Extracting Semantic Representations from Large Text Corpora. In: Bullinaria, J.A., Glasspool, D.W., Houghton, G. (eds) 4th Neural Computation and Psychology Workshop, London, 9–11 April 1997. Perspectives in Neural Computing. Springer, London. https://doi.org/10.1007/978-1-4471-1546-5_16

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-1546-5_16

  • Publisher Name: Springer, London

  • Print ISBN: 978-3-540-76208-9

  • Online ISBN: 978-1-4471-1546-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics