Skip to main content

Evaluating SOM-based models in Text Classification Tasks for the Greek Language

  • Conference paper
Advances in Self-Organising Maps

Abstract

In the present paper, the Self-Organising Map (SOM) is applied to the problem of categorising a corpus of Modem Greek texts according to the style of their authors. A number of variants of the SOM model are used in a series of experiments, in order to compare and contrast their behaviour in the specific task. The experimental results indicate that the SOM possesses the ability to analyse such data, successfully uncovering the differences among authors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mosteller, F. & Wallace, D.L. Applied Bayesian and Classical Inference: The Case of the Federalist Papers, Springer, New York, 1964.

    Google Scholar 

  2. Karlgren, J. Stylistic Experiments in Information Retrieval. In T. Strzalkowski (ed.), Natural Language Information Retrieval, Dordrecht: Kluwer, 1999: 147-166.

    Google Scholar 

  3. Matthews, R. & Merriam, T. Neural Computation in Stylometry I: An Application to the Works of Shakespeare and Fletcher. Literary and Linguistic Computing, 1993; 8(4): 203-209.

    Article  Google Scholar 

  4. Tweedie, F. J., Singh, S. & Holmes, D.I.. Neural Networks in Stylometry: The Federalist Papers. Computers and the Humanities, 1996; 30: 1–10

    Article  Google Scholar 

  5. Kohonen, T. Self-Organising Maps (2nd ed.) Springer-Verlag, Berlin, 1997

    Google Scholar 

  6. Kaski, S., Lagus, K., Honkela, T. & Kohonen, T. Statistical Aspects of the WEBSOM system in Organising Document Collections. Computing Science and Statistics, (Scott, D. W., ed.), Interface Foundation of North America, Inc.: Fairfax Station, VA, 1998: 281–290

    Google Scholar 

  7. Biber, D., Conrad, S. & Reppen, R. Corpus Linguistics: Investigating Language Structure and Use. Cambridge University Press, 1998

    Google Scholar 

  8. Tambouratzis, G., Markantonatou, S., Hairetakis, N. & Carayannis, G. Automatic Style Categorization of Corpora in the Greek Language. Proceedings of the LREC-2000 Conference, Athens, Greece, 31 May - 2 June, 2000; 1: 135-140.

    Google Scholar 

  9. Tambouratzis, G., Markantonatou, S., Hairetakis, N., Vassiliou, M., Tambouratzis, D., & Carayannis, G. Discriminating the registers and styles in the Modern Greek Language. Proceedings of the Workshop on Comparing Corpora, held in conjunction with the 38th ACL Conference, 7 October, Hong Kong, 2000; 35–42

    Google Scholar 

  10. Miikkulainen, R. Script Recognition with Hierarchical Feature Maps. Connection Science, 1990; 2(1-2): 83–101

    Article  Google Scholar 

  11. Merkl, D. Document Classification with Self-Organising Maps. In Kohonen Maps, Oja, E. & Kaski, S., (eds.), Elsevier, Amsterdam, 1999

    Google Scholar 

  12. Vesanto, J. & Alhoniemi, E. Clustering of the Self-Organising Map. IEEE Transactions on Neural Networks, 2000; 11 (3): 586–600

    Article  Google Scholar 

  13. Clairis, C. & Babiniotis, G. Grammar of Modern Greek - II Verbs. Ellinika Grammata, Athens (in Greek), 1999

    Google Scholar 

  14. Vesanto, J., Himberg, J., Alhoniemi, E. & Parhankangas, J. SOM Toolbox for Matlab 5. Report A57. SOM Toolbox Team, Helsinki University of Technology, 2000

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag London Limited

About this paper

Cite this paper

Tambouratzis, G., Hairetakis, N., Markantonatou, S., Carayannis, G. (2001). Evaluating SOM-based models in Text Classification Tasks for the Greek Language. In: Advances in Self-Organising Maps. Springer, London. https://doi.org/10.1007/978-1-4471-0715-6_35

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-0715-6_35

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-85233-511-3

  • Online ISBN: 978-1-4471-0715-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics