Skip to main content
Log in

Deep semantic segmentation-based multiple description coding

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, we propose a deep semantic segmentation-based multiple description coding (DSSMDC). In the proposed scheme, the input image is divided into two different subsets and getting two descriptions, which is named multiple description pre-processing (MDP). Then, two descriptions are encoded and decoded respectively by utilizing the deep semantic segmentation codec, in which the semantic segmentation label is as side information for improving image reconstruction quality. We can get one side reconstruction, when only one description is received at the decoder. If both descriptions are received at the decoder, we can get the central reconstruction. Experimental results show that the proposed scheme achieves better performance than other existing compression methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Agustsson E, Mentzer F, Tschannen M, Cavigelli L, Timofte R, Benini L, Gool LV (2017) Soft-to-hard vector quantization for end-to-end learning compressible representations. In: Advances in Neural Information Processing Systems, pp 1141–1151

  2. Agustsson E, Tschannen M, Mentzer F, timofte R, Gool LV (2019) Generative adversarial networks for extreme learned image compression. arXiv:1804.02958v3

  3. Baccaglini, Tillo T, Olmo G (2007) A flexible r-d-based multiple description scheme for jpeg 2000. IEEE Signal Process Lett 14(3):197–200

    Article  Google Scholar 

  4. Ballé J, Laparra V, Simoncelli EP (2017) End-to-end optimized image compression. arXiv:1611.01704v3

  5. Feng J, Wen T, Liu S, Jie R, Xun G, Zhao D (2017) An end-to-end compression framework based on convolutional neural networks IEEE Transactions on Circuits & Systems for Video Technology, PP(99) 1–1

  6. Geng Y, Liang R-Z, Li W, Wang J, Liang G, Xu C, Wang J-Y (2016) Learning convolutional neural network to maximize pos@ top performance measure. arXiv:1609.08417

  7. Geng Y, Zhang G, Li W, Gu Y, Liang R-Z, Liang G, Wang J, Wu Y, Patil N, Wang JY (2017) A novel image tag completion method based on convolutional neural transformation. In: International conference on artificial neural networks. Springer, New York, pp 539–546

  8. Goyal VK (2001) Multiple description coding: compression meets the network. Signal Process Magazine IEEE 18(5):74–93

    Article  Google Scholar 

  9. Guoqian S, Upul S, Jie L, Chao T, Chengjie NT, Tran TD (2009) Multiple description coding with prediction compensation. IEEE Trans Image Process 18(5):1037–1047

    Article  MathSciNet  Google Scholar 

  10. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  11. Johnston N, Vincent D, Minnen D, Covell M, Singh S, Chinen T, Hwang SJ, Shor J, Toderici G (2018) Improved lossy image compression with priming and spatially adaptive bit rates for recurrent networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4385–4393

  12. Lili M, Jie L, Upul S, Yao Z, Huihui B, Kaup A (2014) Multiple description coding with randomly and uniformly offset quantizers. IEEE Trans Image Process 23(2):582–595

    Article  MathSciNet  Google Scholar 

  13. Lin C, Tillo T, Zhao Y, Jeon B (2011) Multiple description coding for h. 264/avc with redundancy allocation at macro block level. IEEE Trans Circ Syst Video Technol 21(5):589–600

    Article  Google Scholar 

  14. Liu M, Zhu C (2009) Enhancing two-stage multiple description scalar quantization. IEEE Signal Process Lett 16(4):253–256

    Article  MathSciNet  Google Scholar 

  15. Luo S, Yang Y, Yin Y, Shen C, Zhao Y, Song M (2018) Deepsic: Deep semantic image compression. In: International Conference on Neural Information Processing. Springer, New York, pp 96–106

  16. Memos VA, Psannis KE (2016) Encryption algorithm for efficient transmission of hevc media. J Real-Time Image Proc 12(2):473–482

    Article  Google Scholar 

  17. Memos VA, Psannis KE, Ishibashi Y, Kim B-G, Gupta BB (2017) An efficient algorithm for media-based surveillance system (eamsus) in iot smart city framework. Future Gener Comput Syst 83(JUN):619–628

    Google Scholar 

  18. Mohammad Akbari M, Liang J, Han J (2018) Dsslic: Deep semantic segmentation-based layered image compression. arXiv:1806.03348

  19. Psannis KE, Stergiou C, Gupta BB (2018) Advanced media-based smart big data on intelligent cloud systems. IEEE Trans Sustain Comput 4(1):77–87

    Article  Google Scholar 

  20. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science

  21. Stergiou C, Psannis KE, Kim BG, Gupta B (2018) Secure integration of iot and cloud computing. Future Gener Comput Syst 78(PT.3):964–975

    Article  Google Scholar 

  22. Stergiou C, Psannis KE, Plageras AP, Ishibashi Y, Kim B-G (2018) Algorithms for efficient digital media transmission over iot and cloud networking

  23. Tammam T, Marco G, Gabriella O (2007) Multiple description image coding based on lagrangian rate allocation. IEEE Trans Image Process 16 (3):673–683

    Article  MathSciNet  Google Scholar 

  24. Theis L, Shi W, Cunningham A, huszár F (2017) Lossy image compression with compressive autoencoders. arXiv:1703.00395

  25. Tillo T, Grangetto M, Olmo G (2007) Multiple description image coding based on lagrangian rate allocation. IEEE Trans Image Process 16(3):673–683

    Article  MathSciNet  Google Scholar 

  26. Tillo T, Olmo G (2004) A novel multiple description coding scheme compatible with the jpeg2000 decoder. IEEE Signal Process Lett 11(11):908–911

    Article  Google Scholar 

  27. Toderici G, O’Malley SM, Hwang SJ, Vincent D, Minnen D, Baluja S, Covell M, Sukthankar R (2015) Variable rate image compression with recurrent neural networks. Computer Science

  28. Toderici G, Vincent D, Johnston N, Hwang SJ, Minnen D, Shor J, Covell M (2017) Full resolution image compression with recurrent neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5306–5314

  29. Vaishampayan VA (1993) Design of multiple description scalar quantizers. IEEE Trans Inform Theor 39(3):821–834

    Article  MathSciNet  Google Scholar 

  30. Wang T-C, Liu M-Y, Zhu J-Y, Tao A, Kautz J, Catanzaro B (2018) High-resolution image synthesis and semantic manipulation with conditional gans. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 8798–8807

  31. Xu Y, Zhu C (2013) End-to-end rate-distortion optimized description generation for h. 264 multiple description video coding. IEEE Trans Circ Syst Video Technol 23(9):1523–1536

    Article  Google Scholar 

  32. Zhang G, Liang G, Li W, Fang J, Wang J, Geng Y, Wang J-Y (2017) Learning convolutional ranking-score function by query preference regularization. In: International conference on intelligent data engineering and automated learning. Springer, New York, pp 1–8

  33. Zhang G, Liang G, Su F, Qu F, Wang JY (2018) Cross-domain attribute representation based on convolutional neural network. In: International Conference on Intelligent Computing. Springer, New York, pp 134–142

  34. Zhao L, Bai H, Wang A, Zhao Y (2018) Multiple description convolutional neural networks for image compression. IEEE Trans Circ Syst Video Technol 1–1

  35. Zhao H, Gallo O, Frosio I, Kautz J (2017) Loss functions for image restoration with neural networks. IEEE Trans Comput Imaging 3(1):47–57

    Article  Google Scholar 

  36. Zhao L, Huihui B, Wang A, Zhao Y (2017) Learning a virtual codec based on deep convolutional neural network to compress image. arXiv:1712.05969

  37. Zhao H, Shi J, Qi X, Wang X, Jia J (2016) Pyramid scene parsing network

  38. Zhou B, Hang Z, Puig X, Fidler S, Barriuso A, Torralba A (2017) Scene parsing through ade20k dataset. In: Computer Vision & Pattern Recognition

  39. Zhu C, Liu M (2009) Multiple description video coding based on hierarchical b pictures. IEEE Trans Circ Syst Video Technol 19(4):511–521

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lili Meng.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, X., Meng, L., Tan, Y. et al. Deep semantic segmentation-based multiple description coding. Multimed Tools Appl 80, 10323–10337 (2021). https://doi.org/10.1007/s11042-020-09283-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-09283-w

Keywords

Navigation