ABSTRACT
Super-resolution techniques have recently made great strides, especially in the context of deep learning. In spite of this, not much research has been conducted on the explicit application of these techniques to biodiversity related problems such as species identification. We took a state-of-the-art super-resolution model (enhanced deep super-resolution network, i.e. EDSR), and enhanced it further with perceptual and texture losses, and a test-time-augmentation solution. Furthermore, we designed a qualitative assessment framework and studied its relationship with automated performance metrics. Our results show that our proposed modifications to EDSR improve the recovery of details, and that current automated metrics (e.g. Peak Signal-to-Noise Ratio) are inadequate in the context of super-resolution for species identification.
- Marco Bevilacqua, Aline Roumy, Christine Guillemot, and Marie Line Alberi-Morel. 2012. Low-complexity single-image super-resolution based on nonnegative neighbor embedding. (2012).Google Scholar
- Hong Chang, Dit-Yan Yeung, and Yimin Xiong. 2004. Super-resolution through neighbor embedding. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., Vol. 1. IEEE, I–I.Google ScholarCross Ref
- Liang Chen and Cheng-min Gao. 2007. Fast discrete bilinear interpolation algorithm. Computer Engineering and Design 28, 15 (2007), 3787–3790.Google Scholar
- Qiang Dai, Xi Cheng, Yan Qiao, and Youhua Zhang. 2020. Agricultural pest super-resolution and identification with attention enhanced residual and dense fusion generative and adversarial network. IEEE Access 8(2020), 81943–81959.Google ScholarCross Ref
- Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2014. Learning a deep convolutional network for image super-resolution. In European conference on computer vision. Springer, 184–199.Google ScholarCross Ref
- Chao Dong, Chen Change Loy, and Xiaoou Tang. 2016. Accelerating the super-resolution convolutional neural network. In European conference on computer vision. Springer, 391–407.Google ScholarCross Ref
- Alexey Dosovitskiy and Thomas Brox. 2016. Generating images with perceptual similarity metrics based on deep networks. Advances in neural information processing systems 29 (2016), 658–666.Google Scholar
- Claude E Duchon. 1979. Lanczos filtering in one and two dimensions. Journal of Applied Meteorology and Climatology 18, 8(1979), 1016–1022.Google ScholarCross Ref
- Frederick N Fritsch and Ralph E Carlson. 1980. Monotone piecewise cubic interpolation. SIAM J. Numer. Anal. 17, 2 (1980), 238–246.Google ScholarDigital Library
- Xinbo Gao, Kaibing Zhang, Dacheng Tao, and Xuelong Li. 2012. Image super-resolution with sparse neighbor embedding. IEEE Transactions on Image Processing 21, 7 (2012), 3194–3205.Google ScholarDigital Library
- Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2414–2423.Google ScholarCross Ref
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.Google ScholarCross Ref
- Justin Johnson, Alexandre Alahi, and Li Fei-Fei. 2016. Perceptual losses for real-time style transfer and super-resolution. In European conference on computer vision. Springer, 694–711.Google ScholarCross Ref
- Chieh-Chi Kao, Yuxiang Wang, Jonathan Waltman, and Pradeep Sen. 2019. Patch-Based Image Hallucination for Super Resolution With Detail Reconstruction From Similar Sample Images. IEEE Transactions on Multimedia 22, 5 (2019), 1139–1152.Google ScholarCross Ref
- Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. 2016. Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1646–1654.Google ScholarCross Ref
- Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, and Ming-Hsuan Yang. 2017. Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition. 624–632.Google ScholarCross Ref
- Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, 2017. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4681–4690.Google ScholarCross Ref
- Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. 2017. Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 136–144.Google ScholarCross Ref
- Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434(2015).Google Scholar
- Mehdi SM Sajjadi, Bernhard Scholkopf, and Michael Hirsch. 2017. Enhancenet: Single image super-resolution through automated texture synthesis. In Proceedings of the IEEE International Conference on Computer Vision. 4491–4500.Google ScholarCross Ref
- Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556(2014).Google Scholar
- Henry Stark and Peyma Oskoui. 1989. High-resolution image recovery from image-plane arrays, using convex projections. JOSA A 6, 11 (1989), 1715–1726.Google ScholarCross Ref
- Ying Tai, Jian Yang, and Xiaoming Liu. 2017. Image super-resolution via deep recursive residual network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3147–3155.Google ScholarCross Ref
- Ying Tai, Jian Yang, Xiaoming Liu, and Chunyan Xu. 2017. Memnet: A persistent memory network for image restoration. In Proceedings of the IEEE international conference on computer vision. 4539–4547.Google ScholarCross Ref
- Radu Timofte, Rasmus Rothe, and Luc Van Gool. 2016. Seven ways to improve example-based single image super resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1865–1873.Google ScholarCross Ref
- Josiah Wang, Katja Markert, Mark Everingham, 2009. Learning Models for Object Recognition from Natural Language Descriptions.. In BMVC, Vol. 1. 2.Google Scholar
- Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. 2018. Esrgan: Enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops. 0–0.Google Scholar
- Chih-Yuan Yang and Ming-Hsuan Yang. 2013. Fast direct super-resolution by simple functions. In Proceedings of the IEEE international conference on computer vision. 561–568.Google ScholarDigital Library
- Jianchao Yang, Zhaowen Wang, Zhe Lin, Scott Cohen, and Thomas Huang. 2012. Coupled dictionary training for image super-resolution. IEEE transactions on image processing 21, 8 (2012), 3467–3478.Google Scholar
- Jianchao Yang, John Wright, Thomas S Huang, and Yi Ma. 2010. Image super-resolution via sparse representation. IEEE transactions on image processing 19, 11 (2010), 2861–2873.Google ScholarDigital Library
- Ke Yu, Chao Dong, Liang Lin, and Chen Change Loy. 2018. Crafting a toolchain for image restoration by deep reinforcement learning. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2443–2452.Google ScholarCross Ref
- Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. 2018. Image super-resolution using very deep residual channel attention networks. In Proceedings of the European conference on computer vision (ECCV). 286–301.Google ScholarDigital Library
- Yulun Zhang, Yapeng Tian, Yu Kong, Bineng Zhong, and Yun Fu. 2018. Residual dense network for image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2472–2481.Google ScholarCross Ref
- Hang Zhao, Orazio Gallo, Iuri Frosio, and Jan Kautz. 2015. Loss functions for neural networks for image processing. arXiv preprint arXiv:1511.08861(2015).Google Scholar
Recommendations
Dilated projection correction network based on autoencoder for hyperspectral image super-resolution
AbstractThis paper focuses on improving the spatial resolution of the hyperspectral image (HSI) by taking the prior information into consideration. In recent years, single HSI super-resolution methods based on deep learning have achieved good ...
Highlights- We transform the single HSI SR problem into the abundance domain.
- We deploy a ...
Super-Resolution Image Reconstruction Using Wavelet Based Patch and Discrete Wavelet Transform
This paper proposes a novel method that combines the discrete wavelet transform (DWT) and example-based technique to reconstruct a high-resolution from a low-resolution image. Although previous interpolation- and example-based methods consider the ...
Image Super-Resolution Using Deep Belief Networks
ICIMCS '14: Proceedings of International Conference on Internet Multimedia Computing and ServiceIn this paper, we aim at using Deep Belief Networks (DBNs) to solve the problem of image super-resolution (SR). We exploit the hierarchical structure of the DBNs to capture the non-linear mapping from low-resolution (LR) patches to their high-resolution ...
Comments