Multi-class Bayesian support vector data description with anomalies

Turkoz, Mehmet; Kim, Sangahn

doi:10.1007/s10479-021-04364-x

Multi-class Bayesian support vector data description with anomalies

Original Research
Published: 10 November 2021

Volume 317, pages 287–312, (2022)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

Mehmet Turkoz¹ &
Sangahn Kim²

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Support vector data description (SVDD) procedure fits a spherically shaped boundary around the normal data by minimizing the volume of the description. However, the SVDD may not find an efficient boundary if the normal data consist of multiple classes. In addition to the multi-class normal data, some anomaly observations can be available. We propose a generalized SVDD procedure which finds multiple spheres around the multi-class data by incorporating the anomaly observations into the training procedure. Thus, descriptions for each class include as many as their corresponding class observations by keeping the other class and anomaly observations as far as possible. Moreover, we introduce a generalized Bayesian framework which utilizes the relationships among the classes by not only considering the prior information from normal classes but also the anomaly class. Experiments with various simulation studies and real-life applications demonstrate that the proposed approach can effectively identify the anomalies in multi-class data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

One-Class Classification Criterion Robust to Anomalies in Training Dataset

A fast and noise resilient cluster-based anomaly detection

Article 20 May 2015

Unsupervised Parameter Estimation for One-Class Support Vector Machines

References

Aleskerov, E., Freisleben, B. & Rao, B. (1997). Cardwatch: A neural network-based database mining system for credit card fraud detection. In Proceedings of the IEEE/IAFE 1997 IEEE computational intelligence for financial Engineering (CIFEr (pp. 220–226).
Amer, M., Goldstein, M. & Abdennadher, S., (2013). Enhancing one-class support vector machines for unsupervised anomaly detection. In Proceedings of the ACM SIGKDD workshop on outlier detection and description ACM (pp. 8–15).
Azzalini, A., & Capitanio, A. (1999). Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 61, 579–602.
Article Google Scholar
Azzalini, A., & Dalla Valle, A. (1996). The multivariate skew-normal distribution. Biometrika, 83, 715–726.
Article Google Scholar
Bovolo, F., Camps-Valls, G., & Bruzzone, L. (2010). A support vector domain method for change detection in multitemporal images. Pattern Recognition Letters, 31, 1148–1154.
Article Google Scholar
Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly detection: A survey. ACM Computing Surveys (CSUR), 41, 15.
Article Google Scholar
Demšar, J. (2006). Statistical comparisons of classifiers over multiple data sets. The Journal of Machine Learning Research, 7, 1–30.
Google Scholar
Duin, R., Juszczak, P., Paclik, P., Pekalska, E., De Ridder, D., Tax, D., & Verzakov, S. (2000). A matlab toolbox for pattern recognition. Prtools Version, 3, 109–111.
Google Scholar
Erfani, S. M., Rajasegarar, S., Karunasekera, S., & Leckie, C. (2016). High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning. Pattern Recognition, 58, 121–134.
Article Google Scholar
Ghasemi, A., Rabiee, H. R., Manzuri, M. T. & Rohban, M. H. (2016). A bayesian approach to the data description problem. arXiv preprint arXiv:1602.07507
Guo, S. M., Chen, L. C., & Tsai, J. S. H. (2009). A boundary method for outlier detection based on support vector domain description. Pattern Recognition, 42(1), 77–83.
Article Google Scholar
Hodge, V., & Austin, J. (2004). A survey of outlier detection methodologies. Artificial Intelligence Review, 22, 85–126.
Article Google Scholar
Huang, G., Chen, H., Zhou, Z., Yin, F., & Guo, K. (2011). Two-class support vector data description. Pattern Recognition, 44, 320–329.
Article Google Scholar
Kang, J. H., & Kim, S. B. (2013). A clustering algorithm-based control chart for inhomogeneously distributed TFT-LCD processes. International Journal of Production Research, 51(18), 5644–5657.
Article Google Scholar
Kang, P., & Cho, S. (2012). Support vector class description (SVCD): Classification in kernel space. Intelligent Data Analysis, 16, 351–364.
Article Google Scholar
Kumar, V. (2005). Parallel and distributed computing for cybersecurity. IEEE Distributed System Online, 6, 10.
Article Google Scholar
Lee, K., Kim, D.-W., Lee, D., & Lee, K. H. (2005). Improving support vector data description using local density degree. Pattern Recognition, 38, 1768–1771.
Article Google Scholar
Lee, K., Kim, D.-W., Lee, K. H., & Lee, D. (2007). Density-induced support vector data description. IEEE Transactions on Neural Networks, 18, 284–289.
Article Google Scholar
Lee, S.-W., Park, J., & Lee, S.-W. (2006). Low resolution face recognition based on support vector data description. Pattern Recognition, 39, 1809–1812.
Article Google Scholar
Li, K.-L., Huang, H.-K., Tian, S.-F. & Xu, W. (2003). Improving one-class SVM for anomaly detection. In 2003 International conference on machine learning and cybernetics. IEEE (pp. 3077–3081).
Moya. M., Koch M. & Hostetler L. (1993). One-class classifier networks for target recognition applications. In Proceedings of the world congresson neural networks, Portland (pp. 797–801).
Mu, T., & Nandi, A. K. (2009). Multiclass classification based on extended support vector data description. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 39, 1206–1216.
Article Google Scholar
Ning, X., & Tsung, F. (2013). Improved design of kernel distance-based charts using support vector methods. IIE Transactions, 45, 464–476.
Article Google Scholar
Phaladiganon, P., Kim, S. B., & Chen, V. C. (2014). A density-focused support vector data description method. Quality and Reliability Engineering International, 30(6), 879–890.
Article Google Scholar
Schölkopf, B., Platt, J. C., Shawe-Taylor, J., Smola, A. J., & Williamson, R. C. (2001). Estimating the support of a high-dimensional distribution. Neural Computation, 13, 1443–1471.
Article Google Scholar
Sotiris, V. A., Peter, W. T., & Pecht, M. G. (2010). Anomaly detection through a bayesian support vector machine. IEEE Transactions on Reliability, 59, 277–286.
Article Google Scholar
Spence, C., Parra, L., & Sajda, P. (2001). Detection, synthesis and compression in mammographic image analysis with a hierarchical image probability model. In Proceedings of the IEEE workshop on mathematical methods in biomedical image analysis. IEEE Computer Society, 3.
Tax, D. M., & Duin, R. P. (1999). Support vector domain description. Pattern Recognition Letters, 20, 1191–1199.
Article Google Scholar
Tax, D. M., & Duin, R. P. (2004). Support vector data description. Machine Learning, 54, 45–66.
Article Google Scholar
Thornhill, N. F., Patwardhan, S. C., & Shah, S. L. (2008). A continuous stirred tank heater simulation model with applications. Journal of Process Control, 18, 347–360.
Article Google Scholar
Turkoz, M., Kim, S., Son, Y., Jeong, M. K. & Elsayed, E. A. (2020). Generalized support vector data description for anomaly detection. Pattern Recognition, 100, 107119.
Turkoz, M., Kim, S., Jeong, Y. S., Al-Khalifa, K. N., & Hamouda, A. M. (2016). Distribution-free adaptive step-down procedure for fault identification. Quality and Reliability Engineering International, 32(8), 2701–2716.
Article Google Scholar
Turkoz, M., Kim, S., Jeong, Y. S., Jeong, M. K., Elsayed, A. E., Al-Khalifa, K. N., & Hamouda, A. M. (2019). Bayesian framework for fault variable identification. Journal of Quality Technology, 51(4), 375–391.
Article Google Scholar
Vapnik, V. (1995). The nature of statistical learning theory. Springer.
Zhang, Y., Lu, H., Zhang, L., & Ruan, X. (2016). Combining motion and appearance cues for anomaly detection. Pattern Recognition, 51, 443–452.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Marketing, Management, and Professional Sales, William Paterson University, Wayne, NJ, USA
Mehmet Turkoz
Department of Business Analytics and Actuarial Science, Siena College, Loudonville, NY, USA
Sangahn Kim

Authors

Mehmet Turkoz
View author publications
You can also search for this author in PubMed Google Scholar
Sangahn Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sangahn Kim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Turkoz, M., Kim, S. Multi-class Bayesian support vector data description with anomalies. Ann Oper Res 317, 287–312 (2022). https://doi.org/10.1007/s10479-021-04364-x

Download citation

Accepted: 19 October 2021
Published: 10 November 2021
Issue Date: October 2022
DOI: https://doi.org/10.1007/s10479-021-04364-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-class Bayesian support vector data description with anomalies

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

One-Class Classification Criterion Robust to Anomalies in Training Dataset

A fast and noise resilient cluster-based anomaly detection

Unsupervised Parameter Estimation for One-Class Support Vector Machines

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now