Abstract
In this paper, we propose the random forest algorithm of biggest margin trees \(RF{-}BMT\) for the multi-class classification. The novel algorithm enhances the classification of chest X-ray images, specifically for distinguishing between normal, covid-19, edema, mass-nodule, and pneumothorax cases. Our approach combines contrastive learning with our proposed algorithm to improve performance and address the limitation of labeled data by leveraging a large amount of unlabeled data for learning features. We propose training the \(RF{-}BMT\) algorithm on the features extracted from the linear fine-tuned model of Momentum Contrast (MoCo), which is trained on Resnet50 architecture. The \(RF{-}BMT\) algorithm plays a role as a replacement for softmax in deep networks. Based on the empirical results, our proposed \(RF{-}BMT\) algorithm demonstrates substantial improvement compared to solely fine-tuning the linear layer both the ImageNet pretrained model and the MoCo pretrained model, reaching an impressive accuracy rate of 88.4%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abadi, M., et al.: Tensorflow: large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016)
Afshar, P., Heidarian, S., Naderkhani, F., Oikonomou, A., Plataniotis, K.N., Mohammadi, A.: COVID-caps: a capsule network-based framework for identification of COVID-19 cases from x-ray images. Pattern Recogn. Lett. 138, 638–643 (2020)
Akkus, Z., Galimzianova, A., Hoogi, A., Rubin, D.L., Erickson, B.J.: Deep learning for brain MRI segmentation: state of the art and future directions. J. Digit. Imaging 30, 449–459 (2017)
Bhandary, A., et al.: Deep-learning framework to detect lung abnormality-a study with chest x-ray and lung CT scan images. Pattern Recogn. Lett. 129, 271–278 (2020)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.: Classification and Regression Trees. Wadsworth International (1984)
Castiglioni, I., et al.: Ai applications to medical images: from machine learning to deep learning. Physica Med. 83, 9–24 (2021)
Chakraborty, S., Murali, B., Mitra, A.K.: An efficient deep learning model to detect COVID-19 using chest x-ray images. Int. J. Environ. Res. Public Health 19(4), 2013 (2022)
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709 (2020)
Chen, T., Kornblith, S., Swersky, K., Norouzi, M., Hinton, G.: Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029 (2020)
Chen, X., Fan, H., Girshick, R., He, K.: Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297 (2020)
Chlap, P., Min, H., Vandenberg, N., Dowling, J., Holloway, L., Haworth, A.: A review of medical image data augmentation techniques for deep learning applications. J. Med. Imaging Radiat. Oncol. 65(5), 545–563 (2021)
Chouhan, V., et al.: A novel transfer learning based approach for pneumonia detection in chest x-ray images. Appl. Sci. 10(2), 559 (2020)
Cohen, J.P., Morrison, P., Dao, L., Roth, K., Duong, T.Q., Ghassemi, M.: COVID-19 image data collection: Prospective predictions are the future. arXiv preprint arXiv:2006.11988 (2020)
Do, T.N., Le, V.T., Doan, T.H.: SVM on top of deep networks for COVID-19 detection from chest x-ray images. J. Inf. Commun. Converg. Eng. (2022)
Do, T., Lenca, P., Lallich, S.: Classifying many-class high-dimensional fingerprint datasets using random forest of oblique decision trees. Vietnam J. Comput. Sci. 2(1), 3–12 (2015)
Dorogush, A.V., Ershov, V., Gulin, A.: CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363 (2018)
Haghanifar, A., Majdabadi, M.M., Ko, S.: COVID-19 chest x-ray image repository (2021). https://figshare.com/articles/dataset/COVID-19/_Chest/_X-Ray/_Image/_Repository/12580328
He, K., Fan, H., Wu, Y., Xie, S., Girshick, R.: Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9729–9738 (2020)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Hemdan, E.E.D., Shouman, M.A., Karar, M.E.: COVIDX-Net: a framework of deep learning classifiers to diagnose COVID-19 in x-ray images. arXiv preprint arXiv:2003.11055 (2020)
Irvin, J., et al.: CheXpert: a large chest radiograph dataset with uncertainty labels and expert comparison (2019)
Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Ker, J., Wang, L., Rao, J., Lim, T.: Deep learning applications in medical image analysis. IEEE Access 6, 9375–9389 (2017)
Kermany, D.S., et al.: Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(5), 1122–1131 (2018)
Kesim, E., Dokur, Z., Olmez, T.: X-ray chest image classification by a small-sized convolutional neural network. In: 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), pp. 1–5. IEEE (2019)
Liang, C.H., Liu, Y.C., Wu, M.T., Garcia-Castro, F., Alberich-Bayarri, A., Wu, F.Z.: Identifying pulmonary nodules or masses on chest radiography using deep learning: external validation and strategies to improve clinical practice. Clin. Radiol. 75(1), 38–45 (2020)
Nguyen, H.Q., et al.: VinDR-CXR: an open dataset of chest x-rays with radiologist’s annotations. Sci. Data 9(1), 429 (2022)
Ozturk, T., Talo, M., Yildirim, E.A., Baloglu, U.B., Yildirim, O., Acharya, U.R.: Automated detection of COVID-19 cases using deep neural networks with x-ray images. Comput. Biol. Med. 121, 103792 (2020)
Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA (1993)
Rajpurkar, P., et al.: Deep learning for chest radiograph diagnosis: a retrospective comparison of the cheXNeXt algorithm to practicing radiologists. PLoS Med. 15(11), e1002686 (2018)
Saltz, J., et al.: Stony brook university COVID-19 positive cases. Cancer Imaging Archive 4 (2021)
Shiraishi, J., et al.: Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules. Am. J. Roentgenol. 174(1), 71–74 (2000)
Sowrirajan, H., Yang, J., Ng, A.Y., Rajpurkar, P.: MoCo pretraining improves representation and transferability of chest x-ray models. In: Medical Imaging with Deep Learning, pp. 728–744. PMLR (2021)
Vapnik, V.: The Nature of Statistical Learning Theory, 2nd edn. Springer, Berlin (2000)
Vayá, M.D.L.I., et al.: BIMCV COVID-19+: a large annotated dataset of RX and CT images from COVID-19 patients. arXiv preprint arXiv:2006.01174 (2020)
Wang, X., Peng, Y., Lu, L., Lu, Z., Bagheri, M., Summers, R.M.: ChestX-ray8: hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2097–2106 (2017)
Winther, H., et al.: Dataset: COVID-19 Image Repository. Hannover Medical School, Hannover, Germany (2020)
Woźniak, M., Połap, D., Capizzi, G., Sciuto, G.L., Kośmider, L., Frankiewicz, K.: Small lung nodules detection based on local variance analysis and probabilistic neural network. Comput. Meth. Programs Biomed. 161, 173–180 (2018)
Yadav, S.S., Jadhav, S.M.: Deep convolutional neural network based medical image classification for disease diagnosis. J. Big Data 6(1), 1–18 (2019)
Zawacki, A., et al.: SIIM-ACR pneumothorax segmentation (2019). https://kaggle.com/competitions/siim-acr-pneumothorax-segmentation
Acknowledgements
This work has been funded by the STARWARS project (STormwAteR and WastewAteR networkS heterogeneous data AI-driven management). The authors would like to particularly thank for the STARWARS’ support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vo, TT., Do, TN. (2023). Biggest Margin Tree for the Multi-class Classification. In: Dang, T.K., Küng, J., Chung, T.M. (eds) Future Data and Security Engineering. Big Data, Security and Privacy, Smart City and Industry 4.0 Applications. FDSE 2023. Communications in Computer and Information Science, vol 1925. Springer, Singapore. https://doi.org/10.1007/978-981-99-8296-7_3
Download citation
DOI: https://doi.org/10.1007/978-981-99-8296-7_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8295-0
Online ISBN: 978-981-99-8296-7
eBook Packages: Computer ScienceComputer Science (R0)