skip to main content
10.1145/3638884.3638887acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccipConference Proceedingsconference-collections
research-article

Statistical Characteristics-Based Multi-Scale Image Feature Extraction

Authors Info & Claims
Published:23 April 2024Publication History

ABSTRACT

Convolutional Neural Networks (CNNs) have improved image feature extraction ability at various scales. However, it fails to match the input features with different importance or complexity into the appropriate feature extraction branches more effectively. To solve this problem, we propose a novel statistical characteristics-based Multi-Scale Image Feature Extraction (SC-MSIFE) scheme for CNN, which adaptively matches batch image feature subsets into the appropriate feature extraction branches. We calculate and aggregate the gray distribution statistics of features to characterize the complexity, importance and interdependencies of batch image feature subsets, respectively. Then, we reorder the batch image feature subsets according to the gained information. Finally, we match the complex and significant batch image feature subsets into multi-scale feature extraction branches, while inputting the batch image feature subsets with simple and unimportant features into fewer-scale feature extraction branches. Extensive simulation results demonstrate the effectiveness of our proposed approach compared to baselines in terms of improving classification accuracy.

References

  1. Krizhevsky A, Sutskever I, and Hinton G E, 2017. Imagenet Classification with Deep Convolutional Neural Networks. Commun. ACM 60, 6 (June 2017), 84–90. http://dx.doi.org/10.1145/3065386.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, and Lecun Y, 2013. Overfeat: Integrated Recognition, Localization and Detection Using Convolutional Networks. arXiv:1312.6229. http://dx.doi.org/10.48550/arXiv.1312.6229.Google ScholarGoogle ScholarCross RefCross Ref
  3. Zeiler M D and Fergus R, 2014. Visualizing and Understanding Convolutional Networks. In 13th European Conference on Computer Vision (ECCV), Zurich, SWITZERLAND, 818-833. http://dx.doi.org/10.1007/978-3-319-10590-1_53.Google ScholarGoogle ScholarCross RefCross Ref
  4. Xu Y, Zhang L, Du B, and Zhang F, 2018. Spectral–Spatial Unified Networks for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing 56, 10, 5893-5909. http://dx.doi.org/10.1109/TGRS.2018.2827407.Google ScholarGoogle ScholarCross RefCross Ref
  5. Simonyan K and Zisserman A, 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR abs/1409.1556.Google ScholarGoogle Scholar
  6. Szegedy C, Ioffe S, Vanhoucke V, and Alemi A A, 2016. Inception-V4, Inception-Resnet and the Impact of Residual Connections on Learning. ArXiv abs/1602.07261.Google ScholarGoogle Scholar
  7. Szegedy C, Wei L, Yangqing J, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, and Rabinovich A, 2015. Going Deeper with Convolutions. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1-9. http://dx.doi.org/10.1109/CVPR.2015.7298594.Google ScholarGoogle ScholarCross RefCross Ref
  8. Szegedy C, Vanhoucke V, Ioffe S, Shlens J, and Wojna Z, 2016. Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2818-2826. http://dx.doi.org/10.1109/CVPR.2016.308.Google ScholarGoogle ScholarCross RefCross Ref
  9. He K, Zhang X, Ren S, and Sun J, 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770-778. http://dx.doi.org/10.1109/CVPR.2016.90.Google ScholarGoogle ScholarCross RefCross Ref
  10. Huang G, Liu Z, Maaten L V D, and Weinberger K Q, 2017. Densely Connected Convolutional Networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2261-2269. http://dx.doi.org/10.1109/CVPR.2017.243.Google ScholarGoogle ScholarCross RefCross Ref
  11. Yu F, Wang D, Shelhamer E, and Darrell T, 2018. Deep Layer Aggregation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2403-2412. http://dx.doi.org/10.1109/CVPR.2018.00255.Google ScholarGoogle ScholarCross RefCross Ref
  12. Gao S H, Cheng M M, Zhao K, Zhang X Y, Yang M H, and Torr P, 2021. Res2net: A New Multi-Scale Backbone Architecture. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 2, 652-662. http://dx.doi.org/10.1109/TPAMI.2019.2938758.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Lan R S, Sun L, Liu Z B, Lu H M, Pang C, and Luo X N, 2021. Madnet: A Fast and Lightweight Network for Single-Image Super Resolution. IEEE TRANSACTIONS ON CYBERNETICS 51, 3 (MAR), 1443-1453. http://dx.doi.org/10.1109/TCYB.2020.2970104.Google ScholarGoogle ScholarCross RefCross Ref
  14. Zhou W J, Fan X M, Yu L, and Lei J S, 2023. Misnet: Multiscale Cross-Layer Interactive and Similarity Refinement Network for Scene Parsing of Aerial Images. Ieee Journal of Selected Topics in Applied Earth Observations and Remote Sensing 16, 2025-2034. http://dx.doi.org/10.1109/jstars.2023.3243247.Google ScholarGoogle ScholarCross RefCross Ref
  15. Qi G Q, Zhang Y C, Wang K P, Mazur N, Liu Y, and Malaviya D, 2022. Small Object Detection Method Based on Adaptive Spatial Parallel Convolution and Fast Multi-Scale Fusion. Remote Sensing 14, 2 (Jan). http://dx.doi.org/10.3390/rs14020420.Google ScholarGoogle ScholarCross RefCross Ref
  16. Krizhevsky A, 2009. Learning Multiple Layers of Features from Tiny Images. In (2009).Google ScholarGoogle Scholar
  17. Lee C Y, Xie S, Gallagher P, Zhang Z, and Tu Z, 2014. Deeply-Supervised Nets. arXiv:1409.5185v2, 562-570. http://dx.doi.org/10.48550/arXiv.1409.5185.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICCIP '23: Proceedings of the 2023 9th International Conference on Communication and Information Processing
    December 2023
    648 pages
    ISBN:9798400708909
    DOI:10.1145/3638884

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 23 April 2024

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate61of301submissions,20%
  • Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)1

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format