Deep semantic segmentation-based multiple description coding

Li, Xue; Meng, Lili; Tan, Yanyan; Zhang, Jia; Wan, Wenbo; Zhang, Huaxiang

doi:10.1007/s11042-020-09283-w

Deep semantic segmentation-based multiple description coding

Published: 19 November 2020

Volume 80, pages 10323–10337, (2021)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Xue Li^1,2,
Lili Meng ORCID: orcid.org/0000-0002-8024-1669^1,2,
Yanyan Tan^1,2,
Jia Zhang^1,2,
Wenbo Wan^1,2 &
…
Huaxiang Zhang^1,2

313 Accesses
6 Citations
Explore all metrics

Abstract

In this paper, we propose a deep semantic segmentation-based multiple description coding (DSSMDC). In the proposed scheme, the input image is divided into two different subsets and getting two descriptions, which is named multiple description pre-processing (MDP). Then, two descriptions are encoded and decoded respectively by utilizing the deep semantic segmentation codec, in which the semantic segmentation label is as side information for improving image reconstruction quality. We can get one side reconstruction, when only one description is received at the decoder. If both descriptions are received at the decoder, we can get the central reconstruction. Experimental results show that the proposed scheme achieves better performance than other existing compression methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multiple description coding network based on semantic segmentation

Article 01 April 2022

Adaptive reconstruction based multiple description coding with randomly offset quantizations

Article 15 March 2018

An End-to-End Mutual Enhancement Network Toward Image Compression and Semantic Segmentation

References

Agustsson E, Mentzer F, Tschannen M, Cavigelli L, Timofte R, Benini L, Gool LV (2017) Soft-to-hard vector quantization for end-to-end learning compressible representations. In: Advances in Neural Information Processing Systems, pp 1141–1151
Agustsson E, Tschannen M, Mentzer F, timofte R, Gool LV (2019) Generative adversarial networks for extreme learned image compression. arXiv:1804.02958v3
Baccaglini, Tillo T, Olmo G (2007) A flexible r-d-based multiple description scheme for jpeg 2000. IEEE Signal Process Lett 14(3):197–200
Article Google Scholar
Ballé J, Laparra V, Simoncelli EP (2017) End-to-end optimized image compression. arXiv:1611.01704v3
Feng J, Wen T, Liu S, Jie R, Xun G, Zhao D (2017) An end-to-end compression framework based on convolutional neural networks IEEE Transactions on Circuits & Systems for Video Technology, PP(99) 1–1
Geng Y, Liang R-Z, Li W, Wang J, Liang G, Xu C, Wang J-Y (2016) Learning convolutional neural network to maximize pos@ top performance measure. arXiv:1609.08417
Geng Y, Zhang G, Li W, Gu Y, Liang R-Z, Liang G, Wang J, Wu Y, Patil N, Wang JY (2017) A novel image tag completion method based on convolutional neural transformation. In: International conference on artificial neural networks. Springer, New York, pp 539–546
Goyal VK (2001) Multiple description coding: compression meets the network. Signal Process Magazine IEEE 18(5):74–93
Article Google Scholar
Guoqian S, Upul S, Jie L, Chao T, Chengjie NT, Tran TD (2009) Multiple description coding with prediction compensation. IEEE Trans Image Process 18(5):1037–1047
Article MathSciNet Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Johnston N, Vincent D, Minnen D, Covell M, Singh S, Chinen T, Hwang SJ, Shor J, Toderici G (2018) Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4385–4393
Lili M, Jie L, Upul S, Yao Z, Huihui B, Kaup A (2014) Multiple description coding with randomly and uniformly offset quantizers. IEEE Trans Image Process 23(2):582–595
Article MathSciNet Google Scholar
Lin C, Tillo T, Zhao Y, Jeon B (2011) Multiple description coding for h. 264/avc with redundancy allocation at macro block level. IEEE Trans Circ Syst Video Technol 21(5):589–600
Article Google Scholar
Liu M, Zhu C (2009) Enhancing two-stage multiple description scalar quantization. IEEE Signal Process Lett 16(4):253–256
Article MathSciNet Google Scholar
Luo S, Yang Y, Yin Y, Shen C, Zhao Y, Song M (2018) Deepsic: Deep semantic image compression. In: International Conference on Neural Information Processing. Springer, New York, pp 96–106
Memos VA, Psannis KE (2016) Encryption algorithm for efficient transmission of hevc media. J Real-Time Image Proc 12(2):473–482
Article Google Scholar
Memos VA, Psannis KE, Ishibashi Y, Kim B-G, Gupta BB (2017) An efficient algorithm for media-based surveillance system (eamsus) in iot smart city framework. Future Gener Comput Syst 83(JUN):619–628
Google Scholar
Mohammad Akbari M, Liang J, Han J (2018) Dsslic: Deep semantic segmentation-based layered image compression. arXiv:1806.03348
Psannis KE, Stergiou C, Gupta BB (2018) Advanced media-based smart big data on intelligent cloud systems. IEEE Trans Sustain Comput 4(1):77–87
Article Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science
Stergiou C, Psannis KE, Kim BG, Gupta B (2018) Secure integration of iot and cloud computing. Future Gener Comput Syst 78(PT.3):964–975
Article Google Scholar
Stergiou C, Psannis KE, Plageras AP, Ishibashi Y, Kim B-G (2018) Algorithms for efficient digital media transmission over iot and cloud networking
Tammam T, Marco G, Gabriella O (2007) Multiple description image coding based on lagrangian rate allocation. IEEE Trans Image Process 16 (3):673–683
Article MathSciNet Google Scholar
Theis L, Shi W, Cunningham A, huszár F (2017) Lossy image compression with compressive autoencoders. arXiv:1703.00395
Tillo T, Grangetto M, Olmo G (2007) Multiple description image coding based on lagrangian rate allocation. IEEE Trans Image Process 16(3):673–683
Article MathSciNet Google Scholar
Tillo T, Olmo G (2004) A novel multiple description coding scheme compatible with the jpeg2000 decoder. IEEE Signal Process Lett 11(11):908–911
Article Google Scholar
Toderici G, O’Malley SM, Hwang SJ, Vincent D, Minnen D, Baluja S, Covell M, Sukthankar R (2015) Variable rate image compression with recurrent neural networks. Computer Science
Toderici G, Vincent D, Johnston N, Hwang SJ, Minnen D, Shor J, Covell M (2017) Full resolution image compression with recurrent neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5306–5314
Vaishampayan VA (1993) Design of multiple description scalar quantizers. IEEE Trans Inform Theor 39(3):821–834
Article MathSciNet Google Scholar
Wang T-C, Liu M-Y, Zhu J-Y, Tao A, Kautz J, Catanzaro B (2018) High-resolution image synthesis and semantic manipulation with conditional gans. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 8798–8807
Xu Y, Zhu C (2013) End-to-end rate-distortion optimized description generation for h. 264 multiple description video coding. IEEE Trans Circ Syst Video Technol 23(9):1523–1536
Article Google Scholar
Zhang G, Liang G, Li W, Fang J, Wang J, Geng Y, Wang J-Y (2017) Learning convolutional ranking-score function by query preference regularization. In: International conference on intelligent data engineering and automated learning. Springer, New York, pp 1–8
Zhang G, Liang G, Su F, Qu F, Wang JY (2018) Cross-domain attribute representation based on convolutional neural network. In: International Conference on Intelligent Computing. Springer, New York, pp 134–142
Zhao L, Bai H, Wang A, Zhao Y (2018) Multiple description convolutional neural networks for image compression. IEEE Trans Circ Syst Video Technol 1–1
Zhao H, Gallo O, Frosio I, Kautz J (2017) Loss functions for image restoration with neural networks. IEEE Trans Comput Imaging 3(1):47–57
Article Google Scholar
Zhao L, Huihui B, Wang A, Zhao Y (2017) Learning a virtual codec based on deep convolutional neural network to compress image. arXiv:1712.05969
Zhao H, Shi J, Qi X, Wang X, Jia J (2016) Pyramid scene parsing network
Zhou B, Hang Z, Puig X, Fidler S, Barriuso A, Torralba A (2017) Scene parsing through ade20k dataset. In: Computer Vision & Pattern Recognition
Zhu C, Liu M (2009) Multiple description video coding based on hierarchical b pictures. IEEE Trans Circ Syst Video Technol 19(4):511–521
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Science and Engineering, Shandong Normal University, Jinan, 250014, China
Xue Li, Lili Meng, Yanyan Tan, Jia Zhang, Wenbo Wan & Huaxiang Zhang
Institute of Data Science and Technology, Shandong Normal University, Jinan, 250014, China
Xue Li, Lili Meng, Yanyan Tan, Jia Zhang, Wenbo Wan & Huaxiang Zhang

Authors

Xue Li
View author publications
You can also search for this author in PubMed Google Scholar
Lili Meng
View author publications
You can also search for this author in PubMed Google Scholar
Yanyan Tan
View author publications
You can also search for this author in PubMed Google Scholar
Jia Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wenbo Wan
View author publications
You can also search for this author in PubMed Google Scholar
Huaxiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lili Meng.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, X., Meng, L., Tan, Y. et al. Deep semantic segmentation-based multiple description coding. Multimed Tools Appl 80, 10323–10337 (2021). https://doi.org/10.1007/s11042-020-09283-w

Download citation

Received: 18 May 2019
Revised: 07 May 2020
Accepted: 29 June 2020
Published: 19 November 2020
Issue Date: March 2021
DOI: https://doi.org/10.1007/s11042-020-09283-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep semantic segmentation-based multiple description coding

Abstract

Access this article

Similar content being viewed by others

Multiple description coding network based on semantic segmentation

Adaptive reconstruction based multiple description coding with randomly offset quantizations

An End-to-End Mutual Enhancement Network Toward Image Compression and Semantic Segmentation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Deep semantic segmentation-based multiple description coding

Abstract

Access this article

Similar content being viewed by others

Multiple description coding network based on semantic segmentation

Adaptive reconstruction based multiple description coding with randomly offset quantizations

An End-to-End Mutual Enhancement Network Toward Image Compression and Semantic Segmentation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation