skip to main content
10.1145/3503161.3548288acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Searching Lightweight Neural Network for Image Signal Processing

Published:10 October 2022Publication History

ABSTRACT

Recently, it has been shown that the traditional Image Signal Processing (ISP) can be replaced by deep neural networks due to their superior performance. However, most of these networks require heavy computation burden and thus are far from sufficient to be deployed on resource-limited platforms, including but not limited to mobile devices and FPGA. To tackle this challenge, we propose an automated search framework that derives ISP models with high image quality while satisfying the low-computation requirement. To reduce the search cost, we adopt the weight-sharing strategy by introducing a supernet and decouple the architecture search into two stages, supernet training and hard-aware evolutionary search. With the proposed framework, we can train the ISP model once and quickly find high-performance but low-computation models on multiple devices. Experiments demonstrate that the searched ISP models have an excellent trade-off between image quality and model complexity, i.e., achieve compelling reconstruction quality with more than 90% reduction in FLOPs as compared to the state-of-the-art networks.

Skip Supplemental Material Section

Supplemental Material

References

  1. Michael Ashikhmin. 2002. A Tone Mapping Algorithm for High Contrast Images. In Proceedings of Eurographics Workshop on Rendering Techniques. 145--156.Google ScholarGoogle Scholar
  2. Tim Brooks, Ben Mildenhall, Tianfan Xue, Jiawen Chen, Dillon Sharlet, and Jonathan T. Barron. 2019. Unprocessing Images for Learned Raw Denoising. In CVPR. 11036--11045.Google ScholarGoogle Scholar
  3. Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, and Song Han. 2020. Once-for-All: Train One Network and Specialize it for Efficient Deployment. In ICLR.Google ScholarGoogle Scholar
  4. Han Cai, Ligeng Zhu, and Song Han. 2019. ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware. In ICLR.Google ScholarGoogle Scholar
  5. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2018. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell., Vol. 40, 4 (2018), 834--848.Google ScholarGoogle ScholarCross RefCross Ref
  6. Xiangxiang Chu, Bo Zhang, and Ruijun Xu. 2021. FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search. In ICCV. IEEE, 12219--12228.Google ScholarGoogle Scholar
  7. Linhui Dai, Xiaohong Liu, Chengqi Li, and Jun Chen. 2020. AWNet: Attentive Wavelet Network for Image ISP. In ECCV Workshops, Vol. 12537. Springer, 185--201.Google ScholarGoogle Scholar
  8. Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. 2018. Neural Architecture Search: A Survey. CoRR, Vol. abs/1808.05377 (2018). showeprint[arXiv]1808.05377Google ScholarGoogle Scholar
  9. Michaë l Gharbi, Gaurav Chaurasia, Sylvain Paris, and Fré do Durand. 2016. Deep joint demosaicking and denoising. ACM Trans. Graph., Vol. 35, 6 (2016), 191:1--191:12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Yuchao Gu, Shang-Hua Gao, Xu-Sheng Cao, Peng Du, Shao-Ping Lu, and Ming-Ming Cheng. 2021. iNAS: Integral NAS for Device-Aware Salient Object Detection. In ICCV. IEEE, 4914--4924.Google ScholarGoogle Scholar
  11. Zichao Guo, Xiangyu Zhang, Haoyuan Mu, Wen Heng, Zechun Liu, Yichen Wei, and Jian Sun. 2020. Single Path One-Shot Neural Architecture Search with Uniform Sampling. In ECCV, Vol. 12361. 544--560.Google ScholarGoogle Scholar
  12. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In CVPR. IEEE, 770--778.Google ScholarGoogle Scholar
  13. Ming-Chun Hsyu, Chih-Wei Liu, Chao-Hung Chen, Chao-Wei Chen, and Wen-Chia Tsai. 2021. CSAnet: High Speed Channel Spatial Attention Network for Mobile ISP. In CVPR Workshops. 2486--2493.Google ScholarGoogle Scholar
  14. Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-Excitation Networks. In CVPR. IEEE, 7132--7141.Google ScholarGoogle Scholar
  15. Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q. Weinberger. 2017. Densely Connected Convolutional Networks. In CVPR. IEEE, 2261--2269.Google ScholarGoogle Scholar
  16. Andrey Ignatov, Cheng-Ming Chiang, Hsien-Kai Kuo, Anastasia Sycheva, and Radu Timofte. 2021. Learned Smartphone ISP on Mobile NPUs With Deep Learning, Mobile AI 2021 Challenge: Report. In CVPR Workshops. IEEE, 2503--2514.Google ScholarGoogle Scholar
  17. Andrey Ignatov, Luc Van Gool, and Radu Timofte. 2020. Replacing Mobile Camera ISP with a Single Deep Learning Model. In CVPR Workshops. IEEE, 2275--2285.Google ScholarGoogle ScholarCross RefCross Ref
  18. Andrey Ignatov and Radu Timofte. 2020. AIM 2020 Challenge on Learned Image Signal Processing Pipeline. In ECCV Workshops, Vol. 12537. 152--170.Google ScholarGoogle Scholar
  19. Justin Johnson, Alexandre Alahi, and Li Fei-Fei. 2016. Perceptual Losses for Real-Time Style Transfer and Super-Resolution. In ECCV, Vol. 9906. Springer, 694--711.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ngai Ming Kwok, Haiyan Shi, Quang Phuc Ha, Gu Fang, Shengyong Chen, and Xiuping Jia. 2013. Simultaneous Image Color Correction and Enhancement Using Particle Swarm Optimization. Eng. Appl. Artif. Intell., Vol. 26, 10 (2013), 2356--2371.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Hong-Kwai Lam, Oscar C. Au, and Chi-Wah Wong. 2004. Automatic white balancing using luminance component and standard deviation of RGB components [image preprocessing]. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). IEEE, 493--496.Google ScholarGoogle Scholar
  22. Xin Li. 2005. Demosaicing by Successive Approximation. IEEE Transactions on Signal Processing, Vol. 14, 3 (2005), 370--379.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Zhetong Liang, Jianrui Cai, Zisheng Cao, and Lei Zhang. 2021. CameraNet: A Two-Stage Framework for Effective Camera ISP Learning. IEEE Trans. Image Process., Vol. 30 (2021), 2248--2262.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Chenxi Liu, Liang-Chieh Chen, Florian Schroff, Hartwig Adam, Wei Hua, Alan L. Yuille, and Li Fei-Fei. 2019a. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation. In CVPR. IEEE, 82--92.Google ScholarGoogle Scholar
  25. Hanxiao Liu, Karen Simonyan, and Yiming Yang. 2019b. DARTS: Differentiable Architecture Search. In ICLR. 1--13.Google ScholarGoogle Scholar
  26. Pengju Liu, Hongzhi Zhang, Kai Zhang, Liang Lin, and Wangmeng Zuo. 2018. Multi-Level Wavelet-CNN for Image Restoration. In CVPR Workshops. 773--782.Google ScholarGoogle Scholar
  27. Shanto Rahman, Mostafijur Rahman, Mohammad Abdullah-Al-Wadud, Golam Dastegir Al-Quaderi, and Mohammad Shoyaib. 2016. An Adaptive Gamma Correction for Image Enhancement. EURASIP J. Image Video Process., Vol. 2016 (2016), 35.Google ScholarGoogle ScholarCross RefCross Ref
  28. Sivalogeswaran Ratnasingam. 2019. Deep Camera: A Fully Convolutional Neural Network for Image Signal Processing. In ICCV Workshops. IEEE, 3868--3878.Google ScholarGoogle Scholar
  29. Esteban Real, Alok Aggarwal, Yanping Huang, and Quoc V. Le. 2019. Regularized Evolution for Image Classifier Architecture Search. In AAAI. 4780--4789.Google ScholarGoogle Scholar
  30. Shaoqing Ren, Kaiming He, Ross B. Girshick, and Jian Sun. 2017. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell., Vol. 39, 6 (2017), 1137--1149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Eli Schwartz, Raja Giryes, and Alexander M. Bronstein. 2019. DeepISP: Toward Learning an End-to-End Image Processing Pipeline. IEEE Trans. Image Process., Vol. 28, 2 (2019), 912--923.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Wenzhe Shi, Jose Caballero, Ferenc Huszar, Johannes Totz, Andrew P. Aitken, Rob Bishop, Daniel Rueckert, and Zehan Wang. 2016. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. In CVPR. IEEE, 1874--1883.Google ScholarGoogle Scholar
  33. Xin Tao, Hongyun Gao, Xiaoyong Shen, Jue Wang, and Jiaya Jia. 2018. Scale-Recurrent Network for Deep Image Deblurring. In CVPR. IEEE, 8174--8182.Google ScholarGoogle Scholar
  34. Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. 2018. ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks. In ECCV Workshops, Vol. 11133. Springer, 63--79.Google ScholarGoogle Scholar
  35. Z. Wang, E.P. Simoncelli, and A.C. Bovik. 2003. Multiscale structural similarity for image quality assessment. In The Thrity-Seventh Asilomar Conference on Signals, Systems Computers, 2003, Vol. 2. 1398--1402 Vol.2.Google ScholarGoogle ScholarCross RefCross Ref
  36. H. Xu, L. Yao, Z. Li, X. Liang, and W. Zhang. 2019. Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification. In ICCV. 6648--6657.Google ScholarGoogle Scholar
  37. Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, and Hongkai Xiong. 2020. PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search. In ICLR.Google ScholarGoogle Scholar
  38. Kai Zhang, Wangmeng Zuo, Yunjin Chen, Deyu Meng, and Lei Zhang. 2017. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising. IEEE Trans. Image Process., Vol. 26, 7 (2017), 3142--3155.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. 2018a. Image Super-Resolution Using Very Deep Residual Channel Attention Networks. In ECCV, Vol. 11211. Springer, 294--310.Google ScholarGoogle ScholarCross RefCross Ref
  40. Yulun Zhang, Yapeng Tian, Yu Kong, Bineng Zhong, and Yun Fu. 2018b. Residual Dense Network for Image Super-Resolution. In CVPR. IEEE, 2472--2481.Google ScholarGoogle Scholar
  41. Xiawu Zheng, Rongrong Ji, Qiang Wang, Qixiang Ye, Zhenguo Li, Yonghong Tian, and Qi Tian. 2020. Rethinking Performance Estimation in Neural Architecture Search. In CVPR. IEEE, 11353--11362.Google ScholarGoogle Scholar
  42. Yu Zhu, Zhenyu Guo, Tian Liang, Xiangyu He, Chenghua Li, Cong Leng, Bo Jiang, Yifan Zhang, and Jian Cheng. 2020. EEDNet: Enhanced Encoder-Decoder Network for AutoISP. In ECCV Workshops, Vol. 12537. 171--184.Google ScholarGoogle Scholar
  43. Barret Zoph and Quoc V. Le. 2017. Neural Architecture Search with Reinforcement Learning. In ICLR.Google ScholarGoogle Scholar
  44. Barret Zoph, Vijay Vasudevan, Jonathon Shlens, and Quoc V. Le. 2018. Learning Transferable Architectures for Scalable Image Recognition. In CVPR. IEEE, 8697--8710.Google ScholarGoogle Scholar

Index Terms

  1. Searching Lightweight Neural Network for Image Signal Processing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '22: Proceedings of the 30th ACM International Conference on Multimedia
      October 2022
      7537 pages
      ISBN:9781450392037
      DOI:10.1145/3503161

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 October 2022

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia
    • Article Metrics

      • Downloads (Last 12 months)93
      • Downloads (Last 6 weeks)7

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader