Choosing Feature Sets for Training and Testing Self-Organising Maps: A Case Study

Ahmad, Khurshid; Vrusias, Bogdan L.; Ledford, Anthony

doi:10.1007/s005210170018

Choosing Feature Sets for Training and Testing Self-Organising Maps: A Case Study

Published: April 2001

Volume 10, pages 56–66, (2001)
Cite this article

Neural Computing & Applications Aims and scope Submit manuscript

Khurshid Ahmad¹,
Bogdan L. Vrusias¹ &
Anthony Ledford²

104 Accesses
8 Citations
Explore all metrics

Statistical pattern recognition techniques, supervised and unsupervised classification techniques being two good examples here, rely on the computations of similarity and distance metrics. The

distances are computed in a multi-dimensional space. The axes of this space in principle relate to the features inherent in the input data. Usually, such features are chosen by neural network developers, thereby introducing a possible bias. A method of automatically generating feature sets is discussed, with specific reference to the categorisation of streams of free-text news items. The feature sets were generated by a procedure that automatically selects a group of keywords based on a lexico-semantic analysis. Three different types of text streams – headlines only, news summaries and full news items including the body of the text –have been categorised using Self-Organising Feature Maps (SOFM). A method for assessing the discrimination ability of a SOFM, based on Fisher’s Linear Discriminant Rule suggests that the maps trained on vectors related to summaries only provides a fairly accurate cluster when compared with vectors related to full text. The use of summaries as document surrogates for document categorisation is suggested.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Comprehensive Survey of Clustering Algorithms

Article 01 June 2015

Density-Based Clustering Based on Hierarchical Density Estimates

Siamese Neural Networks: An Overview

Author information

Authors and Affiliations

AI Group, Department of Computing, University of Surrey, Guildford, UK, , , , , , GB
Khurshid Ahmad & Bogdan L. Vrusias
Department of Mathematics and Statistics, University of Surrey, Guildford, UK, , , , , , GB
Anthony Ledford

Authors

Khurshid Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan L. Vrusias
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Ledford
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ahmad, K., Vrusias, B. & Ledford, A. Choosing Feature Sets for Training and Testing Self-Organising Maps: A Case Study . Neural Computing & Applications 10, 56–66 (2001). https://doi.org/10.1007/s005210170018

Download citation

Issue Date: April 2001
DOI: https://doi.org/10.1007/s005210170018

Keywords:Automatic classification; Kohonen map; Linear discriminant rule; SOFM; Text classification; Training NN; Weirdness coefficient

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Choosing Feature Sets for Training and Testing Self-Organising Maps: A Case Study

Statistical pattern recognition techniques, supervised and unsupervised classification techniques being two good examples here, rely on the computations of similarity and distance metrics. The

Access this article

Similar content being viewed by others

A Comprehensive Survey of Clustering Algorithms

Density-Based Clustering Based on Hierarchical Density Estimates

Siamese Neural Networks: An Overview

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Choosing Feature Sets for Training and Testing Self-Organising Maps: A Case Study

Statistical pattern recognition techniques, supervised and unsupervised classification techniques being two good examples here, rely on the computations of similarity and distance metrics. The

Access this article

Similar content being viewed by others

A Comprehensive Survey of Clustering Algorithms

Density-Based Clustering Based on Hierarchical Density Estimates

Siamese Neural Networks: An Overview

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation