Skip to main content

Some Fitting of Naive Bayesian Spam Filtering for Japanese Environment

  • Conference paper
  • 935 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 3325))

Abstract

Bayesian filtering is one of the most famous anti-spam measures. However, there is no standard implementation for treatment of Japanese emails by Bayesian filtering. In this paper, we compare several conceivable ways to treat Japanese emails about tokenizing and corpus separation. In addition, we give experimental results and some knowledge obtained by the experiments.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mozilla 1.3 Release Notes, modified (February 2004), http://www.mozilla.org/releases/mozilla1.3/

  2. QUALCOMM Releases Eudora(R) 6.0 - Significant Version Upgrade with New Advanced Time-Saving Tools (September 2003), http://www.eudora.com/press/2003/090403.html

  3. Help Prevent Junk E-Mail Messages with Outlook 2003 (April 2003), http://www.microsoft.com/office/editions/prodinfo/junkmail.mspx

  4. Graham, P.: A Plan for Spam, http://paulgraham.com/spam.html

  5. Procmail, http://www.procmail.org/

  6. Harris, E.: The Next Step in the Spam Control War: Greylisting (2003), http://projects.puremagic.com/greylisting/

  7. Open relay database, http://www.ordb.org/

  8. Sender Policy Framework, http://spf.pobox.com/

  9. MTA Authentication Records in DNS, Internet-Draft (May 2004), http://xml.coverpages.org/draft-ietf-marid-core-01.txt

  10. Spamassassin, http://spamassassin.org/

  11. Graham, P.: Better Bayesian Filtering. In: Spam conference, Boston, USA (January 2003), http://paulgraham.com/better.html

  12. Yerazunis, W.: The Spam-Filtering Accuracy Plateau at 99.9% Accuracy and How to Get Past It. In: 2004 Spam Conference, Boston, USA (January 2004), http://crm114.sourceforge.net/PlateauPaper.pdf

  13. Vipul Razor, http://razor.sourceforge.net/

  14. Gabber, E., Jakobsson, M., Matias, Y., Mayer, A.: Curbing Junk Email via secure Classification. In: Hirschfeld, R. (ed.) FC 1998. LNCS, vol. 1465, pp. 198–213. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  15. Hall, R.J.: Channels: Avoiding unwanted electronic mail. In: The 1996 DIMACS Symposium on Network Threats, Piscataway, USA, pp. 85–103 (1996)

    Google Scholar 

  16. Mailblocks, http://about.mailblocks.com/

  17. Jakobsson, M., Linn, J., Algesheimer, J.: How to Protect Against a Militant Spammer. Cryptology ePrint archive, report 2003/071 (2003)

    Google Scholar 

  18. bsfilter, http://www.h2.dion.ne.jp/~nabeken/bsfilter/

  19. scbayes, http://www.shiro.dreamhost.com/scheme/wiliki/wiliki.cgi?Gauche%3ASpamFilter&l=jp

  20. bogofilter, http://bogofilter.sourceforge.net/

  21. POPFile, http://popfile.sourceforge.net/

  22. KAKASI, http://kakasi.namazu.org/

  23. ChaSen, http://chasen.aist-nara.ac.jp/

  24. MeCab, http://cl.aist-nara.ac.jp/~taku-ku/software/mecab/

  25. Multipurpose Internet Mail Extensions (MIME) Part Five: Conformance Criteria and Examples, RFC2049 (November 1996), http://www.ietf.org/rfc/rfc2049.txt

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Iwanaga, M., Tabata, T., Sakurai, K. (2005). Some Fitting of Naive Bayesian Spam Filtering for Japanese Environment. In: Lim, C.H., Yung, M. (eds) Information Security Applications. WISA 2004. Lecture Notes in Computer Science, vol 3325. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31815-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-31815-6_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24015-0

  • Online ISBN: 978-3-540-31815-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics