Abstract
These days a lot of raw data is generated from various common sources. This large amount of data, which would appear useless at first glance, is very important for companies and researchers as could provide a lot of helpful information. The data could be mined to get useful knowledge that could be used to make fruitful decisions. A lot of online tools and proprietary toolkits are available to the users and it becomes all the more cumbersome for them to know which is the best tool among these for the supervised learning algorithm and datasets they are applying. In order to aid this process, the paper progresses in this direction by doing a comparison of various data mining tools on the basis of their classification finesse. The various tools used in the paper are weka, knime and tanagra. Rigorous work on this has given the result that the performance of the tools is affected by the kind of datasets used and the way in which the supervised learning is done.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baker, R.S., Yacef, K.: The state of educational data mining in 2009: a review and future visions. JEDM-J. Educ. Data Min. 1(1), 3–17 (2009)
Bengio, Y., Grandvalet, Y.: No unbiased estimator of the variance of k-fold cross-validation. J. Mach. Learn. Res. 5, 1089–1105 (2004)
bin Othman, M.F., Yau, T.M.S.: Comparison of different classification techniques using WEKA for breast cancer. In: 3rd Kuala Lumpur International Conference on Biomedical Engineering 2006, pp. 520–523. Springer, Berlin, Heidelberg (2007)
Chauhan, N., Gautam, N.: Parametric comparison of data mining tools (2015)
David, S.K., Saeb, A.T., Al Rubeaan, K.: Comparative analysis of data mining tools and classification techniques using weka in medical bioinformatics. Comput. Eng. Intell. Syst. 4(13), 28–38 (2013)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)
Iyer, A., Jeyalatha, S., Sumbaly, R.: Diagnosis of diabetes using classification mining techniques.arXiv preprint arXiv:1502.03774 (2015)
Jain, D.: A comparison of data mining tools using the implementation of C4.5 Algorithm. Int. J. Sci. Res. 3(8), 33–37 (2014)
Patil, P.H., Thube, S., Ratnaparkhi, B., Rajeswari, K.: Analysis of different data mining tools using classification, clustering and association rule mining. Int. J. Comput. Appl. 93(8), 35–39 (2014)
Solanki, H.: Comparative study of data mining tools and analysis with unified data mining theory. Int. J. Comput. Appl. 75(16) (2013)
Tolan, G.M., Soliman, O.S.: An experimental study of classification algorithms for terrorism prediction (2015)
Vaithiyanathan, V., Rajeswari, K., Tajane, K., Pitale, R.: Comparison of different classification techniques using different datasets. Int. J. Adv. Eng. Technol. 6(2), 764 (2013)
Wimmer, H., Powell, L.M.: A comparison of open source tools for data science. J. Inf. Syst. Appl. Res. 9(2), 4 (2016)
WEKA, the University of Waikato. http://www.cs.waikato.ac.nz/ml/weka/
Tanagra – a Free Data Mining Software for Teaching and Research. http://eric.univ-lyon2.fr/~ricco/tanagra/en/tanagra.html
KNIME (Konstanz Information Miner). http://www.knime.org/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Goyal, A., Khandelwal, I., Anand, R., Srivastava, A., Swarnalatha, P. (2018). A Comparative Analysis of the Different Data Mining Tools by Using Supervised Learning Algorithms. In: Abraham, A., Cherukuri, A., Madureira, A., Muda, A. (eds) Proceedings of the Eighth International Conference on Soft Computing and Pattern Recognition (SoCPaR 2016). SoCPaR 2016. Advances in Intelligent Systems and Computing, vol 614. Springer, Cham. https://doi.org/10.1007/978-3-319-60618-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-60618-7_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60617-0
Online ISBN: 978-3-319-60618-7
eBook Packages: EngineeringEngineering (R0)