skip to main content
10.1145/1835449.1835533acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Visual summarization of web pages

Published: 19 July 2010 Publication History

Abstract

Visual summarization is a attractive new scheme to summarize web pages, which can help achieve a more friendly user experience in search and re-finding tasks by allowing users quickly get the idea of what the web page is about and helping users recall the visited web page. In this paper, we perform a careful study on the recently proposed visual summarization approaches, including the thumbnail of the web page snapshot, the internal image in the web page which is representative of the content in the page, and the visual snippet which is a synthesized image based on the internal image, the title, and the logo found in the web page. Moreover, since the internal image based summarization approach hardly works when the representative internal images are unavailable, we propose a new strategy, which retrieves the representative image from the external to summarize the web page. The experimental results suggest that the various summarization approaches have respective advantages on different types of web pages. While internal images and thumbnails can provide a reliable summarization on web pages with dominant images and web pages with simple structure respectively, the external images are regarded as a useful information to complement the internal images and are demonstrated very useful in helping users understanding new web pages . The visual snippet performs well on the re-finding tasks since it incorporates the title and logo which are advantageous on identifying the visited web pages.

References

[1]
Bing API. http://www.bing.com/developers.
[2]
M. Chen, J.-T. Sun, H.-J. Zeng, and K.-Y. Lam. A practical system of keyphrase extraction for web pages. In CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management, pages 277--278, New York, NY, USA, 2005. ACM.
[3]
A. Cockburn, S. Greenberg, B. McKenzie, M. Jasonsmith, and S. Kaasten. Webview: A graphical aid for revisiting web pages, 1999.
[4]
A. Cockburn and B. Mckenzie. What do web users do? an empirical analysis of web use. International Journal of Human-Computer Studies, 54:903--922, 2000.
[5]
S. Dziadosz and R. Chandrasekar. Do thumbnail previews help users make better relevance decisions about web search results? In SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 365--366, New York, NY, USA, 2002. ACM.
[6]
FastDial. https://addons.mozilla.org/en-us/firefox/addon/5721.
[7]
Google. http://www.google.com.
[8]
Y. Jing and S. Baluja. Visualrank: Applying pagerank to large-scale image search. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 30(11):1877--1890, May 2008.
[9]
T. Joachims. Optimizing search engines using clickthrough data. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 133--142, New York, NY, USA, 2002. ACM.
[10]
S. Kaasten, S. Greenberg, and C. Edwards. How people recognize previously seen web pages from titles, urls and thumbnails. In Proceedings of Human Computer Interaction, pages 247--265, 2001.
[11]
T. Kopetzky and M. Muhlhauser. Visual preview for link traversal on the world wide web. In WWW '99: Proceedings of the eighth international conference on World Wide Web, pages 1525--1532, New York, NY, USA, 1999.
[12]
Z. Li and L. Zhang. Improving relevance judgment of web search results with image excerpts. In WWW '08: Proceedings of the 17th international conference on World Wide Web, pages 21--30, April 2008.
[13]
Y.-F. Ma and H.-J. Zhang. Contrast-based image attention analysis by using fuzzy growing. In MULTIMEDIA '03: Proceedings of the eleventh ACM international conference on Multimedia, pages 374--381, New York, NY, USA, 2003. ACM.
[14]
T. Maekawa, T. Hara, and S. Nishio. Image classification for mobile web browsing. In WWW '06: Proceedings of the 15th international conference on World Wide Web, pages 43--52, New York, NY, USA, 2006. ACM.
[15]
Safari4. http://www.apple.com/safari.
[16]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Inf. Process. Manage., 24(5):513--523, 1988.
[17]
J. Teevan, E. Cutrell, D. Fisher, S. M. Drucker, G. Ramos, P. Andr2e, and C. Hu. Visual snippets: summarizing web pages for search and revisitation. In CHI '09: Proceedings of the 27th international conference on Human factors in computing systems, pages 2023--2032, New York, NY, USA, 2009. ACM.
[18]
Viewzi. http://www.viewzi.com/search/webscreenshot.
[19]
A. Woodruff, A. Faulring, R. Rosenholtz, J. Morrsion, and P. Pirolli. Using thumbnails to search the web. In CHI '01: Proceedings of the 19th international conference on Human factors in computing systems, pages 198--205, New York, NY, USA, 2001. ACM.
[20]
A. Woodruff, R. Rosenholtz, J. B. Morrison, A. Faulring, and P. Pirolli. A comparison of the use of text summaries, plain thumbnails, and enhanced thumbnails for web search tasks. J. Am. Soc. Inf. Sci. Technol., 53(2):172--185, 2002.
[21]
Q. Yu, S. Shi, Z. Li, J.-R. Wen, and W.-Y. Ma. Improve ranking by using image information. In ECIR'07: Proceedings of the 29th European conference on IR research, pages 645--652, 2007.

Cited By

View all
  • (2023)Summarizing Web Archive Corpora via Social Media Storytelling by Automatically Selecting and Visualizing ExemplarsACM Transactions on the Web10.1145/360603018:1(1-48)Online publication date: 11-Oct-2023
  • (2021)Automatically Selecting Striking Images for Social CardsProceedings of the 13th ACM Web Science Conference 202110.1145/3447535.3462505(36-45)Online publication date: 21-Jun-2021
  • (2020)Predicting Visual Importance Across Graphic Design TypesProceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology10.1145/3379337.3415825(249-260)Online publication date: 20-Oct-2020
  • Show More Cited By

Index Terms

  1. Visual summarization of web pages

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
      July 2010
      944 pages
      ISBN:9781450301534
      DOI:10.1145/1835449
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 July 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. visual summarization
      2. web page summarization

      Qualifiers

      • Research-article

      Conference

      SIGIR '10
      Sponsor:

      Acceptance Rates

      SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)7
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 17 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Summarizing Web Archive Corpora via Social Media Storytelling by Automatically Selecting and Visualizing ExemplarsACM Transactions on the Web10.1145/360603018:1(1-48)Online publication date: 11-Oct-2023
      • (2021)Automatically Selecting Striking Images for Social CardsProceedings of the 13th ACM Web Science Conference 202110.1145/3447535.3462505(36-45)Online publication date: 21-Jun-2021
      • (2020)Predicting Visual Importance Across Graphic Design TypesProceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology10.1145/3379337.3415825(249-260)Online publication date: 20-Oct-2020
      • (2019)The usefulness of multimedia surrogates for making relevance judgments about digital video objectsInformation Processing and Management: an International Journal10.1016/j.ipm.2019.10209156:6Online publication date: 1-Nov-2019
      • (2017)Learning Visual Importance for Graphic Designs and Data VisualizationsProceedings of the 30th Annual ACM Symposium on User Interface Software and Technology10.1145/3126594.3126653(57-69)Online publication date: 20-Oct-2017
      • (2017)MyWebSteps: Aiding Revisiting with a Visual Web HistoryInteracting with Computers10.1093/iwc/iww038Online publication date: 13-Jan-2017
      • (2015)Landmark Summarization With Diverse ViewpointsIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2014.236973125:11(1857-1869)Online publication date: 1-Nov-2015
      • (2014)Similarity Preserving Snippet-Based Visualization of Web Search ResultsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2013.24220:3(457-470)Online publication date: 1-Mar-2014
      • (2014)Visual and textual summarization of webpages2014 International Conference on Data Mining and Intelligent Computing (ICDMIC)10.1109/ICDMIC.2014.6954267(1-5)Online publication date: Sep-2014
      • (2013)Augmenting web search surrogates with imagesProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505714(399-408)Online publication date: 27-Oct-2013
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media