Abstract
In this paper, we present a three-stage expert system based on a hybrid support vector machines (SVM) approach to diagnose thyroid disease. Focusing on feature selection, the first stage aims at constructing diverse feature subsets with different discriminative capability. Switching from feature selection to model construction, in the second stage, the obtained feature subsets are fed into the designed SVM classifier for training an optimal predictor model whose parameters are optimized by particle swarm optimization (PSO). Finally, the obtained optimal SVM model proceeds to perform the thyroid disease diagnosis tasks using the most discriminative feature subset and the optimal parameters. The effectiveness of the proposed expert system (FS-PSO-SVM) has been rigorously evaluated against the thyroid disease dataset, which is commonly used among researchers who use machine learning methods for thyroid disease diagnosis. The proposed system has been compared with two other related methods including the SVM based on the Grid search technique (Grid-SVM) and the SVM based on Grid search and principle component analysis (PCA-Grid-SVM) in terms of their classification accuracy. Experimental results demonstrate that FS-PSO-SVM significantly outperforms the other ones. In addition, Compared to the existing methods in previous studies, the proposed system has achieved the highest classification accuracy reported so far by 10-fold cross-validation (CV) method, with the mean accuracy of 97.49% and with the maximum accuracy of 98.59%. Promisingly, the proposed FS-PSO-SVM expert system might serve as a new candidate of powerful tools for diagnosing thyroid disease with excellent performance.
Similar content being viewed by others
References
Ozyilmaz, L., and Yildirim T. Diagnosis of thyroid disease using artificial neural network methods. In Proceedings of ICONIP’02 nineth international conference on neural information processing, Orchid Country Club, Singapore (pp. 2033–2036), 2002.
Serpen, G., Jiang, H., and Allred, L. Performance analysis of probabilistic potential function neural network classifier. In Proceedings of artificial neural networks in engineering conference, St. Louis, MO, (Vol. 7, pp. 471–476), 1997.
Pasi, L. Similarity classifier applied to medical data sets, 2004, 10 sivua, Fuzziness in Finland’04. In International conference on soft computing, Helsinki, Finland & Gulf of Finland & Tallinn, Estonia, 2004.
Polat, K., Sahan, S., and Gunes, S., A novel hybrid method based on artificial immune recognition system (AIRS) with fuzzy weighted pre-processing for thyroid disease diagnosis. Expert Syst. Appl. 32(4):1141–1147, 2007.
Keles, A., and Keles, A., ESTDD: expert system for thyroid diseases diagnosis. Expert Syst. Appl. 34(1):242–246, 2008.
Temurtas, F., A comparative study on thyroid disease diagnosis using neural networks. Expert Syst. Appl. 36(1):944–949, 2009.
Dogantekin, E., Dogantekin, A., and Avci, D., An expert system based on Generalized Discriminant Analysis and Wavelet Support Vector Machine for diagnosis of thyroid diseases. Expert Syst. Appl. 38(1):146–150, 2011.
Boser, B. E., Guyon, I. M., and Vapnik, V. N. A training algorithm for optimal margin classifiers. In Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, ACM, 1992.
Cortes, C., and Vapnik, V., Support-vector networks. Mach. learn. 20(3):273–297, 1995.
Osuna, E., Freund, R., and Girosit, F. Training support vector machines: Application to face detection. In Proceedings of computer vision and pattern recognition, Puerto Rico (pp. 130–136), 1997.
Joachims, T., Nedellec, C., and Rouveirol, C. Text categorization with support vector machines: learning with many relevant. Proceedings of the 10th European Conference on Machine Learning, pp. 137–142, 1998.
Sakar, C., and Kursun, O., Telediagnosis of Parkinson’s disease using measurements of dysphonia. J. Med. Syst. 34(4):591–599, 2010.
John, G. H., Kohavi, R., and Pfleger, K. Irrelevant features and the subset selection problem. 1994: Citeseer.
Frohlich, H., Chapelle, O., and Scholkopf, B. Feature selection for support vector machines by means of genetic algorithms: In: 15th IEEE international conference on tools with artificial intelligence. Sacramento, CA, USA. pp. 142–148, 2003.
Bishop, C., Neural networks for pattern recognition. Oxford University Press, New York, 1995.
Vapnik, V. N., The nature of statistical learning theory. Springer, New York, 1995.
Vapnik, V., Statistical learning theory. Wiley, New York, 1998.
Cristianini, N., and Shawe-Taylor, J. An introduction to support vector machines and other kernel based learning methods. Cambridge University Press, New York, 2000.
Eberhart, R. C., and Kennedy, J. A new optimizer using particle swarm theory. In: Proceedings of the 6th International Symposium on Micro Machine and Human Science, vol. 43, pp. 39–43,1995.
Kennedy, J., and Eberhart, R. C. Particle swarm optimization. In: Proceedings of the IEEE International Conference on Neural Network, vol. 4, pp. 1942–1948, 1995.
Shi, Y., and Eberhart, R. A modified particle swarm optimizer, in: Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence, pp. 69–73, 1998.
Ratnaweera, A., Halgamuge, S., and Watson, H., Self-organizing hierarchical particle swarm optimizer with time-varying acceleration coefficients. IEEE Trans. Evol. Comput. 8(3):240–255, 2004.
Shi, Y., and Eberhart, R. C. Empirical study of particle swarm optimization. Congress on evolutionary computation, Washington D.C., USA, pp 1945–1949, 1999.
Salzberg, S. L., On comparing classifiers: pitfalls to avoid and a recommended approach. Data Min. Knowl. Discov. 1(3):317–328, 1997.
Ron, K. A study of cross-validation and bootstrap for accuracy estimation and model selection, in Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2, 1995.
Chang, C. C., and Lin, C. J. LIBSVM: a library for support vector machines. 2001, Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
Hsu, C. W., Chang, C. C., and Lin, C. J. A practical guide to support vector classification. Technical report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei, 2003. available at http://www.csie.ntu.edu.tw/cjlin/libsvm/.
Acknowledgements
This research is supported by the National Natural Science Foundation of China (NSFC) under Grant Nos. 60873149, 60973088, 60773099 and the National High-Tech Research and Development Plan of China under Grant Nos. 2006AA10Z245, 2006AA10A309. This work is also supported by the Open Projects of Shanghai Key Laboratory of Intelligent Information Processing in Fudan University under the Grand No. IIPL-09-007, the Open Project Program of the National Laboratory of Pattern Recognition (NLPR) and the basic scientific research fund of Chinese Ministry of Education.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, HL., Yang, B., Wang, G. et al. A Three-Stage Expert System Based on Support Vector Machines for Thyroid Disease Diagnosis. J Med Syst 36, 1953–1963 (2012). https://doi.org/10.1007/s10916-011-9655-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10916-011-9655-8