Abstract
Bayesian filtering is one of the most famous anti-spam measures. However, there is no standard implementation for treatment of Japanese emails by Bayesian filtering. In this paper, we compare several conceivable ways to treat Japanese emails about tokenizing and corpus separation. In addition, we give experimental results and some knowledge obtained by the experiments.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Mozilla 1.3 Release Notes, modified (February 2004), http://www.mozilla.org/releases/mozilla1.3/
QUALCOMM Releases Eudora(R) 6.0 - Significant Version Upgrade with New Advanced Time-Saving Tools (September 2003), http://www.eudora.com/press/2003/090403.html
Help Prevent Junk E-Mail Messages with Outlook 2003 (April 2003), http://www.microsoft.com/office/editions/prodinfo/junkmail.mspx
Graham, P.: A Plan for Spam, http://paulgraham.com/spam.html
Procmail, http://www.procmail.org/
Harris, E.: The Next Step in the Spam Control War: Greylisting (2003), http://projects.puremagic.com/greylisting/
Open relay database, http://www.ordb.org/
Sender Policy Framework, http://spf.pobox.com/
MTA Authentication Records in DNS, Internet-Draft (May 2004), http://xml.coverpages.org/draft-ietf-marid-core-01.txt
Spamassassin, http://spamassassin.org/
Graham, P.: Better Bayesian Filtering. In: Spam conference, Boston, USA (January 2003), http://paulgraham.com/better.html
Yerazunis, W.: The Spam-Filtering Accuracy Plateau at 99.9% Accuracy and How to Get Past It. In: 2004 Spam Conference, Boston, USA (January 2004), http://crm114.sourceforge.net/PlateauPaper.pdf
Vipul Razor, http://razor.sourceforge.net/
Gabber, E., Jakobsson, M., Matias, Y., Mayer, A.: Curbing Junk Email via secure Classification. In: Hirschfeld, R. (ed.) FC 1998. LNCS, vol. 1465, pp. 198–213. Springer, Heidelberg (1998)
Hall, R.J.: Channels: Avoiding unwanted electronic mail. In: The 1996 DIMACS Symposium on Network Threats, Piscataway, USA, pp. 85–103 (1996)
Mailblocks, http://about.mailblocks.com/
Jakobsson, M., Linn, J., Algesheimer, J.: How to Protect Against a Militant Spammer. Cryptology ePrint archive, report 2003/071 (2003)
scbayes, http://www.shiro.dreamhost.com/scheme/wiliki/wiliki.cgi?Gauche%3ASpamFilter&l=jp
bogofilter, http://bogofilter.sourceforge.net/
POPFile, http://popfile.sourceforge.net/
KAKASI, http://kakasi.namazu.org/
ChaSen, http://chasen.aist-nara.ac.jp/
Multipurpose Internet Mail Extensions (MIME) Part Five: Conformance Criteria and Examples, RFC2049 (November 1996), http://www.ietf.org/rfc/rfc2049.txt
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Iwanaga, M., Tabata, T., Sakurai, K. (2005). Some Fitting of Naive Bayesian Spam Filtering for Japanese Environment. In: Lim, C.H., Yung, M. (eds) Information Security Applications. WISA 2004. Lecture Notes in Computer Science, vol 3325. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31815-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-31815-6_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24015-0
Online ISBN: 978-3-540-31815-6
eBook Packages: Computer ScienceComputer Science (R0)