Skip to main content
Log in

Localized layout analysis for retargeting of heterogeneous images

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Heterogeneous paper documents (such as newspaper, magazine) are very common in our daily life. They are usually scanned and stored as images. Reading such images on a mobile device is very awkward, as they can only be partially displayed to ensure readability. The user needs to frequently switch among different portions of the image to read clearly. It would be very helpful if the system can automatically determine an appropriate reading area around the user’s click position and retarget the area to the whole screen. In this paper, we propose a localized layout analysis method for retargeting of heterogeneous images. Once the user clicks on a fully displayed heterogeneous image, our method can automatically extract an appropriate rectangular region and scale the region to the whole screen for reading. The region is semantically meaningful, and the content is guaranteed to be clear enough when fully displayed on the screen. The experimental results show that our method can effectively avoid those tedious scale and translation operations when reading heterogeneous images, and thus improve the user’s experience greatly.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Agrawal M, Doermann D (2009) Voronoi++ A dynamic page segmentation approach based on Voronoi and Docstrum features. In: Proceedings of 10th international conference on document analysis and recognition, pp 1011–1015

  2. Avidan S, Shamir A (2007) Seam carving for content-aware image resizing. ACM Trans Graph 26(3):Article No. 10

    Article  Google Scholar 

  3. Chen LQ, Xie X, Fan X, Ma WY, Zhang HJ, Zhou HQ (2003) A visual attention model for adapting images on small displays. Multimed Syst 9(4):353–364

    Article  Google Scholar 

  4. Chen K, Yin F, Liu CL (2013) Hybrid page segmentation with efficient whitespace rectangles extraction and grouping. In: Proceedings of 12th international conference on document analysis and recognition, pp 958–962

  5. Cheng H, Bouman CA (2001) Multi-scale bayesian segmentation using a trainable context model. IEEE Trans Image Process 10(4):511–525

    Article  MATH  Google Scholar 

  6. Gal R, Sorkine O, Cohen-Or D (2006) Feature-aware texturing

  7. Jaekyu Ha, Haralick RM, Phillips IT (1995) Recursive X-Y cut using bounding boxes of connected components. In: Proceedings of 3th international conference on document analysis and recognition, pp 952–955

  8. Karni Z, Freedman D, Gotsman C (2009) Energy based image deformation

  9. Krähenbühl P, Lang M, Hornung A, Gross M (2009) A system for retargeting of streaming video. ACM Trans Graph 28(5):Article No. 126

    Article  Google Scholar 

  10. Lee SW, Ryu DS (2001) Parameter-free geometric document layout analysis. IEEE Trans Pattern Anal Mach Intell 23(11):1240–1256

    Article  Google Scholar 

  11. Liu F, Gleicher M (2005) Automatic image retargeting with fisheye-view warping

  12. O’Donovan P, Agarwala A, Hertzmann A (2014) Learning layouts for single-page graphic designs. IEEE Trans Vis Comput Graph 20(8):1200–1213

    Article  Google Scholar 

  13. O’Donovan P, Agarwala A, Hertzmann A (2015) Designscape: design with interactive layout suggestions. In: Proceedings of the 33rd annual ACM conference on human factors in computing systems, pp 1221–1224

  14. O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Pattern Anal Mach Intell 15(11):1162–1173

    Article  Google Scholar 

  15. Rubinstein M, Shamir A, Avidan S (2008) Improved seam carving for video retargeting. ACM Trans Graph 27(3):Article No. 16

    Article  Google Scholar 

  16. Santella A, Agrawala M, DeCarlo D, Salesin D, Cohen M (2006) Gaze-based interaction for semi-automatic photo cropping

  17. Sauvola J, Pietikäinen M (2000) Adaptive document image binarization. Pattern Recogn 33(2):225–236

    Article  Google Scholar 

  18. Shamir A, Avidan S (2009) Seam carving for media retargeting. Commun ACM 52(1):77–85

    Article  Google Scholar 

  19. Siegel S (1956) Non-parametric statistics for the behavioral sciences. McGraw-Hill, New York

    MATH  Google Scholar 

  20. Simon A, Pret JC, Johnson AP (1997) A fast algorithm for bottom-up document layout analysis. IEEE Trans Pattern Anal Mach Intell 19(3):273–277

    Article  Google Scholar 

  21. Smith R (2009) Hybrid page layout analysis via tab-stop detection. In: Proceedings of 10th international conference on document analysis and recognition, pp 241–245

  22. Suh B, Ling H, Bederso BB, Jacobs DW (2003) Automatic thumbnail cropping and its effectiveness

  23. Sun H-M (2005) Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA. In: Proceedings of 8th international conference on document analysis and recognition, pp 116–120

  24. Tran TA, Na IS, Kim SH (2016) Page segmentation using minimum homogeneity algorithm and adaptive mathematical morphology. Int J Doc Anal Recogn 1–19

  25. Wang YS, Tai CL, Sorkine O, Lee TY (2008) Optimized scale-and-stretch for image resizing. ACM Trans Graph 27(5):Article No. 118

    Article  Google Scholar 

  26. Zhang GX, Cheng MM, Hu SM, Martin RR (2009) A shape-preserving approach to image resizing. Comput Graph Forum 28(7):1897–1906

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Juncong Lin.

Electronic supplementary material

Below is the link to the electronic supplementary material.

(MP4 15.7 MB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gao, X., Zhang, G., Lin, J. et al. Localized layout analysis for retargeting of heterogeneous images. Multimed Tools Appl 77, 21163–21184 (2018). https://doi.org/10.1007/s11042-017-5405-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-017-5405-3

Keywords

Navigation