Loading [a11y]/accessibility-menu.js
Classification of RSS feed news items using ontology | IEEE Conference Publication | IEEE Xplore

Classification of RSS feed news items using ontology


Abstract:

Explosive growth of data on the web demand techniques, which would enable the user to access desired information. In Information retrieval Document Classification is prer...Show More

Abstract:

Explosive growth of data on the web demand techniques, which would enable the user to access desired information. In Information retrieval Document Classification is prerequisite. In practice many classification techniques were and are in use. Term Frequency-Inverse Document Frequency (TF-IDF) is an approach which represents documents based on the frequency of terms in documents. Limitation of this approach is high dimensionality of data. Moreover it does not consider the relations among the terms, resulting in less precise and noisy end result. In our approach we are using weighted Concept Frequency-Inverse Document Frequency (CF-IDF) with background knowledge of domain Ontology, for classification of RSS feed News Items. Metadata information of news items has been used to assign weight to the identified concepts. No trained classifiers are required as Ontology itself acts as a classifier. We have designed ontology based on news industry standards. This classification approach considers relations among the concepts and properties. It results in reduction of noise in final output. It considers only the key concepts of a domain for classification instead of all the terms, which curbs the problem of dimensionality. Evaluation of experimental results reveals that proposed approach gives better classification results.
Date of Conference: 27-29 November 2012
Date Added to IEEE Xplore: 24 January 2013
ISBN Information:

ISSN Information:

Conference Location: Kochi, India

References

References is not available for this document.