skip to main content
10.1145/1401890.1401980acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Asymmetric support vector machines: low false-positive learning under the user tolerance

Published: 24 August 2008 Publication History

Abstract

Many practical applications of classification require the classifier to produce a very low false-positive rate. Although the Support Vector Machine (SVM) has been widely applied to these applications due to its superiority in handling high dimensional data, there are relatively little effort other than setting a threshold or changing the costs of slacks to ensure the low false-positive rate. In this paper, we propose the notion of Asymmetric Support VectorMachine (ASVM) that takes into account the false-positives and the user tolerance in its objective. Such a new objective formulation allows us to raise the confidence in predicting the positives, and therefore obtain a lower chance of false-positives. We study the effects of the parameters in ASVM objective and address some implementation issues related to the Sequential Minimal Optimization (SMO) to cope with large-scale data. An extensive simulation is conducted and shows that ASVM is able to yield either noticeable improvement in performance or reduction in training time as compared to the previous arts.

References

[1]
I. Androutsopoulos, J. Koutsias, K. Chandrinos, and C. Spyropoulos. An experimental comparison of naive bayesian and keyword-based anti-spam filtering with personal e-mail messages. In Proc. of SIGIR, 2000.
[2]
A. Asuncion and D.J. Newman. UCI Machine Learning Repository, 2007.
[3]
D. Barbara, N. Wu, and S. Jajodia. Detecting novel network intrusions using bayes estimators. In Proc. of the 1st SIAM Conference on Data Mining (SDM), 2001.
[4]
P. Bartlett and J. Shawe-Taylor. Generalization performance of support vector machines and other pattern classifiers. In Advances in Kernel Methods: Support Vector Learning. MIT Press, 1998.
[5]
A. Ben-Hur, D. Horn, H.T. Siegelmann, and V. Vapnik. Support vector clustering. Journal of Machine Learning Research, 2:125--137, 2001.
[6]
P. Boykin and V. Roychowdhury. Leveraging social networks to fight spam. IEEE Computer, 2005.
[7]
A. Bratko, G. Cormack, B. Filipic, T. Lynam, and B. Zupan. Spam filtering using statistical data compression models. Journal of Machine Learning Research, 7, 2006.
[8]
L. Breiman. Classification and Regression Trees. Chapman & Hall, 1998.
[9]
C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121--167, 1998.
[10]
X. Carreras and L. Marquez. Boosting trees for anti-spam email fltering. In Proc. of the 4th International Conference on Recent Advances in Natural Language Processing, 2001.
[11]
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm, 2001.
[12]
H.D. Cheng, X. Cai, X. Chen, L. Hu, and X. Lou. Computer-aided detection and classification of microcalcifications in mammograms: a survey. Pattern Recognition, 36(12):2967--2991, 2003.
[13]
G. Cormack and T. Lynam. Overview of the trec 2005 spam evaluation track. In Fourteenth Text REtrieval Conference (TREC-2005). NIST, 2005.
[14]
C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20:273--297, 1995.
[15]
H. Drucker, D. Wu, and V. Vapnik. Support vector machines for spam categorization. IEEE Transactions on Neural networks, 10(5), 1999.
[16]
J. Goodman, G. Cormack, and D. Heckerman. Spam and the ongoing battle for the inbox. Communications of the ACM, 50(2):24--33, February 2007.
[17]
C.-W. Hsu, C.-C. Chang, and C.-J. Lin. A practical guide to support vector classification. Technical report, http://www.csie.ntu.edu.tw/~cjlin/libsvm, 2003.
[18]
J. Kivinen, A. Smola, and R. Williamson. Online learning with kernels. Advances in Neural Information Processing Systems. MIT Press, 14:785--793, 2002.
[19]
A. Kolcz and J. Alspector. SVM-based filtering of e-mail spam with content-specific misclassification costs. In Proc. of TextDM, 2001.
[20]
H.-Y. Lam and D.-Y. Yeung. A learning approach to spam detection based on social networks. In Proc. of the 4th Conference on Email and Anti-Spam (CEAS), 2007.
[21]
T. Lynam, G. Cormack, and D. Cheriton. On-line spam filter fusion. In Proc. of SIGIR, pages 123--130, 2006.
[22]
J. Platt. Sequenital minimal optimization: A fast algorithm for training support vector machines. In Advances in Kernel Methods: Support Vector Learning. MIT Press, 1998.
[23]
D. Prokhorov. IJCNN 2001 neural network competition, 2001. Slide presentation in IJCNN'01, Ford Research Laboratory.
[24]
M. Sahami, S. Dumais, D. Heckerman, and E. Horvitz. A bayesian approach to filtering junk e-mail. In AAAI Technical Report WS-98-05, 1998.
[25]
K. Schneider. A comparison of event models for naive bayes anti-spam e-mail filtering. In Proc. of the 11th Conference of the European Chapter of the Association for Computational Linguistics, 2003.
[26]
B. Scholkopf, J. Platt, J. Shawe-Taylor, A. Smola, and R. C.Williamson. Estimating the support of a high-dimensional distribution. Neural Computation, 13:1443--1471, 2001.
[27]
B. Scholkopf and A. Smola. Learning with Kernels:: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, 2002.
[28]
D. Sculley and G. Wachman. Relaxed online support vector machines for spam filtering. In Proc. of SIGIR, 2007.
[29]
J. Shawe-Taylor, P.L. Bartlett, R.C. Williamson, and M. Anthony. Structural risk minimization over data-dependent hierarchies. IEEE Transactions on Information Theory, 44(5):1926--1940, 1998.
[30]
V. Vapnik. Statistical Learning Theory. Wiley, NY, 1998.
[31]
P. Viola and M. Jones. Fast and robust classification using asymmetric adaboost and a detector cascade. In Proc. of Neural Information Processing Systems (NIPS), 2002.
[32]
W. Yih, J. Goodman, and G. Hulten. Learning at low false positive rates. In Proc. of the 3rd Conference on Email and Anti-Spam (CEAS), 2006.
[33]
B. Zheng, W. Qian, and L.P. Clarke. Digital mammography: mixed feature neural network with spectralentropy decision for detection of microcalcifications. IEEE Transactions on Medical Imaging, 15(5):589--597, 1996.

Cited By

View all
  • (2023)On the Theories Behind Hard Negative Sampling for RecommendationProceedings of the ACM Web Conference 202310.1145/3543507.3583223(812-822)Online publication date: 30-Apr-2023
  • (2023)Software defect prediction model based on improved twin support vector machinesSoft Computing10.1007/s00500-023-07984-627:21(16101-16110)Online publication date: 1-Apr-2023
  • (2022)AUC Maximization in the Era of Big Data and AI: A SurveyACM Computing Surveys10.1145/355472955:8(1-37)Online publication date: 3-Aug-2022
  • Show More Cited By

Index Terms

  1. Asymmetric support vector machines: low false-positive learning under the user tolerance

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2008
      1116 pages
      ISBN:9781605581934
      DOI:10.1145/1401890
      • General Chair:
      • Ying Li,
      • Program Chairs:
      • Bing Liu,
      • Sunita Sarawagi
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 August 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. classification
      2. low false-positive learning
      3. support vectormachine (svm)

      Qualifiers

      • Research-article

      Conference

      KDD08

      Acceptance Rates

      KDD '08 Paper Acceptance Rate 118 of 593 submissions, 20%;
      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)22
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 28 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)On the Theories Behind Hard Negative Sampling for RecommendationProceedings of the ACM Web Conference 202310.1145/3543507.3583223(812-822)Online publication date: 30-Apr-2023
      • (2023)Software defect prediction model based on improved twin support vector machinesSoft Computing10.1007/s00500-023-07984-627:21(16101-16110)Online publication date: 1-Apr-2023
      • (2022)AUC Maximization in the Era of Big Data and AI: A SurveyACM Computing Surveys10.1145/355472955:8(1-37)Online publication date: 3-Aug-2022
      • (2022)One side class SVM training methods for malware detection2022 24th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)10.1109/SYNASC57785.2022.00065(359-364)Online publication date: Sep-2022
      • (2021)Training over-parameterized models with non-decomposable objectivesProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3541651(18165-18181)Online publication date: 6-Dec-2021
      • (2021)Apportioned margin approach for cost sensitive large margin classifiersAnnals of Mathematics and Artificial Intelligence10.1007/s10472-021-09776-wOnline publication date: 8-Oct-2021
      • (2019)Big Data, Real-World Data, and Machine LearningStatistical Methods in Biomarker and Early Clinical Development10.1007/978-3-030-31503-0_9(167-195)Online publication date: 27-Dec-2019
      • (2017)Learning-based abstractions for nonlinear constraint solvingProceedings of the 26th International Joint Conference on Artificial Intelligence10.5555/3171642.3171728(592-599)Online publication date: 19-Aug-2017
      • (2017)Support vector algorithms for optimizing the partial area under the roc curveNeural Computation10.1162/NECO_a_0097229:7(1919-1963)Online publication date: 1-Jul-2017
      • (2017)Detecting In Situ Identity Fraud on Social Network Services: A Case Study With FacebookIEEE Systems Journal10.1109/JSYST.2015.250410211:4(2432-2443)Online publication date: Dec-2017
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media