Skip to main content

Reducing Classification Times for Email Spam Using Incremental Multiple Instance Classifiers

  • Conference paper
Information Intelligence, Systems, Technology and Management (ICISTM 2011)

Abstract

Combating spam emails is both costly and time consuming. This paper presents a spam classification algorithm that utilizes both majority voting and multiple instance approaches to determine the resulting classification type. By utilizing multiple sub-classifiers, the classifier can be updated by replacing an individual sub-classifier. Furthermore, each sub-classifier represents a small fraction of a typical classifier, so it can be trained in less time with less data as well. The TREC 2007 spam corpus was used to conduct the experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hoanca, B.: How good are our weapons in the spam wars? IEEE Technology and Society Magazine 25(1), 22–30 (2006)

    Article  Google Scholar 

  2. Carpinter, J., Hunt, R.: Tightening the net: a review of current and next generation spam filtering tools. Computers & Security 25(8), 566–578 (2006)

    Article  Google Scholar 

  3. Islam, M.R., Zhou, W.: An Innovative Analyser for email classification based on grey list analysis. In: 2007 IFIP International Conference on Network and Parallel Computing Workshops, pp. 176–182. IEEE Computer Society, Washington, DC (2007)

    Chapter  Google Scholar 

  4. Islam, M.R., Zhou, W., Chowdhury, M.U.: MVGL Analyser for Multi-Classifier Based Spam Filtering System. In: The Eighth IEEE/ACIS International Conference on Computer and Information Science (ICIS), pp. 394–399. IEEE Computer Society, Washington, DC (2009)

    Google Scholar 

  5. Kang, F., Naphade, M.R.: A generalized multiple instance learning algorithm with multiple selection strategies for cross granular learning. In: 2006 IEEE International Conference on Image Processing, pp. 3213–3216. IEEE Press, New York (2006)

    Chapter  Google Scholar 

  6. Zhou, Y., Jorgensen, Z., Inge, M.: Combating good word attacks on statistical spam filters with multiple instance learning. In: Nineteenth IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 298–305. IEEE Computer Society, Washington, DC (2007)

    Chapter  Google Scholar 

  7. Sirisanyalak, B., Sornil, O.: An artificial immunity-based spam detection system. In: 2007 IEEE Congress on Evolutionary Computation (CEC), pp. 3392–3398. IEEE Press, New York (2007)

    Chapter  Google Scholar 

  8. Yeh, C.-C., Chiang, S.-J.: Revisit Bayesian approaches for spam detection. In: Ninth International Conference for Young Computer Scientists (ICYCS), pp. 659–664. IEEE Computer Society, Washington, DC (2008)

    Google Scholar 

  9. SPAM Track Guidelines - TREC 2005-2007, http://plg.uwaterloo.ca/~gvcormac/spam/

  10. Islam, R., Zhou, W., Xiang, Y., Mahmood, A.N.: Spam filtering for network traffic security on a multi-core environment. Concurrency and Computation: Practice and Experience 21(10), 1307–1320 (2009)

    Article  Google Scholar 

  11. Tran, D., Ma, W., Sharma, D., Nguyen, T.: Possibility theory-based approach to spam email detection. In: 2007 IEEE International Conference on Granular Computing (GRC), p. 571. IEEE Computer Society, Washington, DC (2007)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Moh, TS., Lee, N. (2011). Reducing Classification Times for Email Spam Using Incremental Multiple Instance Classifiers. In: Dua, S., Sahni, S., Goyal, D.P. (eds) Information Intelligence, Systems, Technology and Management. ICISTM 2011. Communications in Computer and Information Science, vol 141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19423-8_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19423-8_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19422-1

  • Online ISBN: 978-3-642-19423-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics