skip to main content
research-article

Medical Image Classification based on an Adaptive Size Deep Learning Model

Published: 26 October 2021 Publication History

Abstract

With the rapid development of Artificial Intelligence (AI), deep learning has increasingly become a research hotspot in various fields, such as medical image classification. Traditional deep learning models use Bilinear Interpolation when processing classification tasks of multi-size medical image dataset, which will cause the loss of information of the image, and then affect the classification effect. In response to this problem, this work proposes a solution for an adaptive size deep learning model. First, according to the characteristics of the multi-size medical image dataset, the optimal size set module is proposed in combination with the unpooling process. Next, an adaptive deep learning model module is proposed based on the existing deep learning model. Then, the model is fused with the size fine-tuning module used to process multi-size medical images to obtain a solution of the adaptive size deep learning model. Finally, the proposed solution model is applied to the pneumonia CT medical image dataset. Through experiments, it can be seen that the model has strong robustness, and the classification effect is improved by about 4% compared with traditional algorithms.

References

[1]
Siam University, Bangkok, Bangkok, TH. 2017. Artificial intelligence, machine learning and deep learning. 2017 15th International Conference on ICT and Knowledge Engineering (ICT&KE), 1–6. DOI:
[2]
Zhengchao Zhang, Meng Li, Xi Lin, Yinhai Wang, Fang He. 2019. Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies. Transportation Research Part C-emerging Technologies 105:297–322. DOI:
[3]
Samaneh Mahdavifar, Ali A. Ghorbani. 2019. Application of deep learning to cybersecurity: A survey. Neurocomputing 347 (2019), 149–176. DOI:
[4]
Yu Li, Chao Huang, Lizhong Ding, Zhongxiao Li, Yijie Pan, and Xin Gao. 2019. Deep learning in bioinformatics: Introduction, application, and perspective in the big data era. Methods 166, 4–21. DOI:
[5]
Zahra Amini and Hossein Rabbani. 2016. Classification of medical image modeling methods: A review. Current Medical Imaging Reviews 12, 2 (2016), 130–148. DOI:
[6]
Yu-Dong Zhang, Zhengchao Dong, Shui-Hua Wang, Xiang Yu, Xujing Yao, Qinghua Zhou, Hua Hu, Min Li, Carmen Jiménez-Mesa, Javier Ramirez, Francisco J. Martinez, and Juan Manuel Gorriz. 2020. Advances in multimodal data fusion in Neuroimaging: Overview, challenges, and novel orientation, information fusion 64 (2020), 149–187.
[7]
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, and Ronald M. Summers. 2017. ChestX-Ray8: Hospital-Scale Chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3462–3471. DOI:
[8]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211–252. DOI: https://doi.org/10.1007/s11263-015-0816-y
[9]
Geert Litjens, Thijs Kooi, Babak Ehteshami Bejnordi, Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen A. W. M. van der Laak, Bram van Ginneken, and Clara I. Sánchez. 2017. A survey on deep learning in medical image analysis. Medical Image Analysis 42 (2017), 60–88. DOI:
[10]
A. Kaehler and G. Bradski. 2016. Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library. O'Reilly Media, Inc.
[11]
Hui Li, Xiao-Jun Wu, and Josef Kittler. 2018. Infrared and visible image fusion using a deep learning framework. 2018 24th International Conference on Pattern Recognition (ICPR), 2705–2710. DOI:
[12]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2017. ImageNet classification with deep convolutional neural networks. Communications of The ACM 60, 6 (2017), 84–90. DOI:https://doi.org/10.1145/3065386
[13]
K. Simonyan and A. Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. Computer Vision and Pattern Recognition. Retrieved from https://arxiv.org/abs/1409.1556.
[14]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 1–9. DOI:
[15]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778. DOI:
[16]
Mingxing Tan and Quoc V. Le. 2019. EfficientNet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning, 6105–6114. Retrieved from.
[17]
Siyuan Lu, Zhihai Lu, and Yu-Dong Zhang. 2019. Pathological brain detection based on AlexNet and transfer learning. Journal of Computational Science 30:41–47.
[18]
Shui-Hua Wang, Vishnu Varthanan Govindaraj, Juan Manuel Górriz, Xin Zhang, and Yu-Dong Zhang. 2021. Covid-19 classification by FGCNet with deep feature fusion from graph convolutional network and convolutional neural network. Information Fusion 67 (2021), 208–229.
[19]
Shui-Hua Wang, Deepak Ranjan Nayak, David S Guttery, Xin Zhang, and Yu-Dong Zhang. 2021. COVID-19 classification by CCSHNet with deep fusion using transfer learning and discriminant correlation analysis. Information Fusion 68 (2021), 131–148.
[20]
Shui-Hua Wang, Shipeng Xie, Xianqing Chen, David S. Guttery, Chaosheng Tang, Junding Sun, and Yu-Dong Zhang. 2019. Alcoholism identification based on an AlexNet transfer learning model. Frontiers in Psychiatry 10 (2019), 205. DOI:
[21]
Khalid M. Hosny, Mohamed A. Kassem, and Mohamed M. Foaud. 2019. Classification of skin lesions using transfer learning and augmentation with Alex-net. Foaud 14, 5. DOI:
[22]
Meiyu Li, Hailiang Tang, Michael D. Chan, Xiaobo Zhou, and Xiaohua Qian. 2020. DC-AL GAN: Pseudo progression and true tumor progression of glioblastoma multiform image classification based on DCGAN and AlexNet. Medical Physics 47, 3 (2020), 1139–1150. DOI:
[23]
Lei Geng, Siqi Zhang, Jun Tong, and Zhitao Xiao. 2019. Lung segmentation method with dilated convolution based on VGG-16 network. Computer-assisted surgery (Abingdon, England) 24, 27–33. DOI:
[24]
Taranjit Kaur and Tapan Kumar Gandhi. 2019. Automated brain image classification based on VGG-16 and transfer learning. 2019 International Conference on Information Technology (ICIT), 94–98. DOI:
[25]
JoonYul Choi, Tae Keun Yoo, Jeong Gi Seo, Jiyong Kwak, Terry Taewoong Um, and Tyler Hyungtaek Rim. 2017. Multi-categorical deep learning neural network to classify retinal images: A pilot study employing small database. PLOS ONE 12, 11 (2017). DOI:
[26]
Kalyanakumar Jayapriya and Israel Jeena Jacob. 2020. Hybrid fully convolutional networks-based skin lesion segmentation and melanoma detection using the deep feature. International Journal of Imaging Systems and Technology 30, 2 (2020), 348–357. DOI:
[27]
Jie Bai, Huiyan Jiang, Siqi Li, and Xiaoqi Ma. 2019. NHL pathological image classification based on hierarchical local information and GoogLeNet-Based representations. BioMed Research International 2019: 1065652. DOI:
[28]
Jianning Chi, Ekta Walia, Paul Babyn, Jimmy Wang, Gary Groot, and Mark Eramian. 2017. Thyroid nodule classification in ultrasound images by fine-tuning deep convolutional neural network. Journal of Digital Imaging 30, 4 (2017), 477–486. DOI:
[29]
Xiaoyu Cui, Ran Wei, Lixin Gong, Ruiqun Qi, Zeyin Zhao, Hongduo Chen, Kaixin Song, Amer A. A. Abdulrahman, Yining Wang, John Z. S. Chen, Shuo Chen, Yue Zhao, and Xinghua Gao. 2018. Assessing the effectiveness of artificial intelligence methods for melanoma: A retrospective review. Journal of The American Academy of Dermatology 81, 5 (2018), 1176–1180. DOI:
[30]
Ginji Hirano, Mitsutaka Nemoto, Yuichi Kimura, Yoshio Kiyohara, Hiroshi Koga, Naoya Yamazaki, Gustav Christensen, Christian Ingvar, Kari Nielsen, Atsushi Nakamura, Takayuki Sota, and Takashi Nagaoka. 2020. Automatic diagnosis of melanoma using hyperspectral data and GoogLeNet. Skin Research and Technology 26, 6 (2020), 891–897. DOI:
[31]
Qingchen Zhang, Changchuan Bai, Zhuo Liu, Laurence T. Yang, Hang Yu, Jingyuan Zhao, and Hong Yuan. 2020. A GPU-based residual network for medical image classification in smart medicine. Information Sciences 536 (2020), 91–100. DOI:
[32]
Zhenyu Lu, Yanzhong Bai, Yi Chen, Chunqiu Su, Shanshan Lu, Tianming Zhan, Xunning Hong, and Shuihua Wang. 2020. The classification of gliomas based on a Pyramid dilated convolution resnet model. Pattern Recognition Letters 133 (2020), 173–179. DOI:
[33]
Li Ma, Renjun Shuai, Xuming Ran, Wenjia Liu, and Chao Ye. 2020. Combining DC-GAN with ResNet for blood cell image classification. Medical & Biological Engineering & Computing 58, 6 (2020), 1251–1264. DOI:
[34]
Daniel Kermany, Kang Zhang, and Michael Goldbaum. 2018. Large dataset of labeled optical coherence tomography (OCT) and Chest X-Ray Images, Mendeley Data, V3. DOI:
[35]
W. Li, Q. Huang and G. Srivastava. 2021. Contour feature extraction of medical image based on multi-threshold optimization. Mobile Networks and Applications 26, 1 (2021 Feb), 381–389.
[36]
D. Połap, G. Srivastava, and K. Yu. 2021. Agent architecture of an intelligent medical system based on federated learning and blockchain technology. Journal of Information Security and Applications 1, 58 (2021 May), 102748.

Cited By

View all
  • (2025)Charger Placement With Wave InterferenceIEEE Transactions on Mobile Computing10.1109/TMC.2024.346040324:1(261-275)Online publication date: 1-Jan-2025
  • (2025)Long-tailed medical diagnosis with relation-aware representation learning and iterative classifier calibrationComputers in Biology and Medicine10.1016/j.compbiomed.2025.109772188(109772)Online publication date: Apr-2025
  • (2024)Toward Long Form Audio-Visual Video UnderstandingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367207920:9(1-26)Online publication date: 7-Jun-2024
  • Show More Cited By

Index Terms

  1. Medical Image Classification based on an Adaptive Size Deep Learning Model

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 17, Issue 3s
    October 2021
    324 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3492435
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 October 2021
    Accepted: 01 May 2021
    Revised: 01 April 2021
    Received: 01 November 2020
    Published in TOMM Volume 17, Issue 3s

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Deep learning
    2. medical images
    3. adaptive size
    4. classification
    5. multi-size

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • Natural Science Foundation of Hunan Province
    • Key Scientific Research Projects of Department of Education of Hunan Province
    • Hunan Provincial Science & Technology Project Foundation
    • National Natural Science Foundation of China
    • Scientific Research Fund of Hunan Provincial Education Department

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)200
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Charger Placement With Wave InterferenceIEEE Transactions on Mobile Computing10.1109/TMC.2024.346040324:1(261-275)Online publication date: 1-Jan-2025
    • (2025)Long-tailed medical diagnosis with relation-aware representation learning and iterative classifier calibrationComputers in Biology and Medicine10.1016/j.compbiomed.2025.109772188(109772)Online publication date: Apr-2025
    • (2024)Toward Long Form Audio-Visual Video UnderstandingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367207920:9(1-26)Online publication date: 7-Jun-2024
    • (2024)Backdoor Two-Stream Video Models on Federated LearningACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365130720:11(1-20)Online publication date: 12-Sep-2024
    • (2024)Learning Nighttime Semantic Segmentation the Hard WayACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365003220:7(1-23)Online publication date: 16-May-2024
    • (2024)Multimodal Visual-Semantic Representations Learning for Scene Text RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364655120:7(1-18)Online publication date: 27-Mar-2024
    • (2024)SWRM: Similarity Window Reweighting and Margin for Long-Tailed RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364381620:6(1-18)Online publication date: 8-Mar-2024
    • (2024)Nonlocal Hybrid Network for Long-tailed Image ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363025620:4(1-22)Online publication date: 11-Jan-2024
    • (2024)Intelligent Trajectory Design and Charging Scheduling in Wireless Rechargeable Sensor Networks With ObstaclesIEEE Transactions on Mobile Computing10.1109/TMC.2024.335007523:9(8664-8679)Online publication date: 1-Sep-2024
    • (2024)Market Sentiment Analysis Based on Image Processing With Put-Call Volatility Gap SurfaceIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.322405411:1(267-281)Online publication date: Feb-2024
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media