Abstract:
Although there is much research of text classification based on vector spaces using word information in the whole text, generally humans can recognize the field by findin...Show MoreMetadata
Abstract:
Although there is much research of text classification based on vector spaces using word information in the whole text, generally humans can recognize the field by finding the specific words. This paper describes what is field-associated term and how to discover field-associated terms, which exist in any text. In this paper, such words are called a field association (FA) word that can be directly related to the field classification. Five criteria of FA terms are defined for hierarchical fields. All of them are stored to field tree to make use of extraction of field-coherent passages for document classification. The presented approach is estimated by the simulation results of 140 fields text files of sports field and extended by 197 text field of civil engineering.
Published in: Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005).
Date of Conference: 19-21 May 2005
Date Added to IEEE Xplore: 12 September 2005
Print ISBN:0-7803-9035-0