Benchmarking Support Vector Machines Implementation Using Multiple Techniques

Sukanya, M. V.; Sathyadevan, Shiju; Sreeveni, U. B. Unmesha

doi:10.1007/978-3-319-11218-3_22

M. V. Sukanya⁷,
Shiju Sathyadevan⁷ &
U. B. Unmesha Sreeveni⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 320))

1887 Accesses
4 Citations

Abstract

Data management becomes a complex task when hundreds of petabytes of data are being gathered, stored and processed on a day to day basis. Efficient processing of the exponentially growing data is inevitable in this context. This paper discusses about the processing of a huge amount of data through Support Vector machine (SVM) algorithm using different techniques ranging from single node Linier implementation to parallel processing using the distributed processing frameworks like Hadoop. Map-Reduce component of Hadoop performs the parallelization process which is used to feed information to Support Vector Machines (SVMs), a machine learning algorithm applicable to classification and regression analysis. Paper also does a detailed anatomy of SVM algorithm and sets a roadmap for implementing the same in both linear and Map-Reduce fashion. The main objective is explain in detail the steps involved in developing an SVM algorithm from scratch using standard linear and Map-Reduce techniques and also conduct a performance analysis across linear implementation of SVM, SVM implementation in single node Hadoop, SVM implementation in Hadoop cluster and also against a proven tool like R, gauging them with respect to the accuracy achieved, their processing pace against varying data sizes, capability to handle huge data volume without breaking etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Freenor, M.: An implementation of SVM for botnet detection in Support Vector Machines and Hadoop: Theory vs. Practice
Google Scholar
Sun, Z.: Geoffrey Fox Study on Parallel SVM Based on MapReduce Key Laboratory for Computer Network of Shandong Province, Shandong Computer Science Center, Jinan, Shandong, 250014, China 2School of Informatics and Computing, Pervasive Technology Institute, Indiana University Bloomington, Bloomington, Indiana, 47408, USA
Google Scholar
Srinivas, R.: Managing Large Sets Using Support Vector Machines. University of Nebraska at Lincoln
Google Scholar
Yu, H., Yang, J., Han, J.: Classifying Large Data Sets Using SVMs with Hierarchical Clusters (Department of Computer Science University of Illinois Urbana-Champaign, IL 61801 USA)
Google Scholar
Pontil, M., Verri, A.: Properties of Support Vector Machines Massachusetts institute of technology artificial intelligence laboratory and center for biological and computational learning department of brain and cognitive sciences
Google Scholar
Chang, E.Y., Zhu, K., Wang, H., Bai, H., Li, J., Qiu, Z., Cui, H.: PSVM: Parallelizing Support Vector Machines on Distributed Computers Google Research, Beijing, China
Google Scholar
Support Vector Machine Tutorial, ung (Ph.D) Dept. of CSIE, CYUT
Google Scholar
Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters Google, Inc.
Google Scholar
HDFS Under the Hood, Sanjay Radia Sradia Grid Computing, Hadoop
Google Scholar
Soman, K.P., Loganathan, R., Ajay, V.: Support Vector Machines and Other Kernel Methods by Centre for Excellence in Computational Engineering and Networking. Amrita Vishwa Vidyapeetham, Coimbatore
Google Scholar
Kiran, M., Kumar, A., Mukherjee, S., Prakash, R.: G Verification and Validation of MapReduce Program Model for Parallel Support Vector Machine Algorithm on Hadoop Cluster
Google Scholar
Pechyony, D., Shen, L., Jones, R.: Solving Large Scale Linear SVM with DistributedBlock Minimization
Google Scholar
Bhonde, M., Patil, P.: Efficient Text Classification Model Based on Improved Hyper- sphere Support Vector Machine with Map Reduce and Hadoop
Google Scholar
Pechyony, D., Shen, L., Jones, R.: Solving Large Scale Linear SVM with Distributed Block Minimization
Google Scholar
Yang, H.-C., Dasdan, A., Hsiao, R.-L., Parker, D.S.: Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters
Google Scholar
Dean, J., Ghemawat, S.: MapReduce: a flexible data processing tool
Google Scholar
Chu, C.-T., Kim, S.K., Lin, Y.A., Yu, Y.Y., Bradsky, G., Ng, A.Y., Olukotun, K.: Map-Reduce For Machine Learning on Multicore
Google Scholar

Download references

Author information

Authors and Affiliations

Amrita Center for Cyber Security, Amrita Vishwa Vidyapeetham, Kollam, India
M. V. Sukanya, Shiju Sathyadevan & U. B. Unmesha Sreeveni

Authors

M. V. Sukanya
View author publications
You can also search for this author in PubMed Google Scholar
Shiju Sathyadevan
View author publications
You can also search for this author in PubMed Google Scholar
U. B. Unmesha Sreeveni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. V. Sukanya .

Editor information

Editors and Affiliations

King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia
El-Sayed M. El-Alfy
Indian Institute of Information Technology and Management- - Kerala (IIITM-K), Trivandrum, Kerala, India
Sabu M. Thampi
Faculty of Design, Kyushu University, Fukuoka, Japan
Hideyuki Takagi
Department of Information Systems and Operations Management, University of Florida, Warrington College of Business, Florida, Florida, USA
Selwyn Piramuthu
University of Applied Sciences, Institute for Information Systems, Olten, Switzerland
Thomas Hanne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sukanya, M.V., Sathyadevan, S., Sreeveni, U.B.U. (2015). Benchmarking Support Vector Machines Implementation Using Multiple Techniques. In: El-Alfy, ES., Thampi, S., Takagi, H., Piramuthu, S., Hanne, T. (eds) Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 320. Springer, Cham. https://doi.org/10.1007/978-3-319-11218-3_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-11218-3_22
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11217-6
Online ISBN: 978-3-319-11218-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics