Comparison of a SOM based sequence analysis system and naive Bayesian classifier for spam filtering | IEEE Conference Publication | IEEE Xplore

Comparison of a SOM based sequence analysis system and naive Bayesian classifier for spam filtering


Abstract:

The problem introduced by the unsolicited bulk emails, also known as "spam" generates a need for reliable anti-spam filters. In this paper, we design and compare the perf...Show More

Abstract:

The problem introduced by the unsolicited bulk emails, also known as "spam" generates a need for reliable anti-spam filters. In this paper, we design and compare the performance of a newly designed SOM based sequence analysis (SBSA) system for the spam filtering task. The system is based on a SOM based sequential data representation combined with a kNN classifier designed to make use of word sequence information. We compare this system with the traditional baseline method naive Bayesian filter. Three different cost scenarios and suitable cost-sensitive measurements are employed. The results show that the SBSA system is superior to the naive Bayesian filter, particularly when the misclassification cost for non-spam message is high.
Date of Conference: 31 July 2005 - 04 August 2005
Date Added to IEEE Xplore: 27 December 2005
Print ISBN:0-7803-9048-2

ISSN Information:

Conference Location: Montreal, QC, Canada

Contact IEEE to Subscribe

References

References is not available for this document.