skip to main content
10.1145/1012807.1012829acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
Article

The site browser: catalyzing improvements in hypertext organization

Published:09 August 2004Publication History

ABSTRACT

The Site Browser endeavors to build an overview browsing system for the entire Web. Overview browsing represents an alternative to the search-based view of information work, and does so by providing a consistent set of summary views which can be browsed interactively. The views partition and linearize the corpus for ready understanding and exploration. They show a web site's relation to other sites, the broad nature of the information it contains and how it is structured, and how it has changed over time. The design challenge is to generate useful summary information in a process which is fast enough to be updated daily. Our current system maintains a continuously updated archive of 46 million sites representing 2.3 billion web pages.

References

  1. Alexa. http://www.alexa.com.Google ScholarGoogle Scholar
  2. E. Amitay, D. Carmel, A. Darlow, R. Lempel, and A. Soffer. The Connectivity Sonar: detecting site functionality by structural patterns. In Proceedings of ACM Hypertext '03, pages 38--47. ACM Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Z. Bar-Yossef and S. Rajagopalan. Template detection via data mining and its applications. In Proceedings of the 11th International World Wide Web Conference (WWW 2002), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. T. Berners-Lee, J. Hendler, and O. Lassila. The Semantic Web. Scientific American, May 2001.Google ScholarGoogle ScholarCross RefCross Ref
  5. L. Y. Bing~Liu, Kaidi~Zhao. Visualizing web site comparisons. In Proceedings of the 11th International World Wide Web Conference (WWW 2002), pages 693--703, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. V. Boyapati, K. Chevrier, A. Finkel, N. Glance, T. Pierce, R. Stockton, and C. Whitmer. ChangeDetector{tm}: a site-level monitoring tool for the WWW. In Proceedings of the 11th International World Wide Web Conference (WWW 2002), pages 570--579, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Brin, R. Motwani, L. Page, and T. Winograd. What can you do with a web in your pocket? Data Engineering Bulletin, 21(2):37--47, 1998.Google ScholarGoogle Scholar
  8. V. Bush. As we may think. The Atlantic Monthly, July 1945.Google ScholarGoogle Scholar
  9. J. Cho and S. Roy. Impact of web search engines on page popularity. In Proceedings of the 13th International World Wide Web Conference (WWW2004), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Dave, U. P. Karadkar, R. Furuta, L. Francisco-Revilla, F. Shipman, S. Dash, and Z. Dalal. Browsing intricately interconnected paths. In Proceedings of ACM Hypertext '03, pages 95--103. ACM Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, S. Rajagopalan, A. Tomkins, J. A. Tomlin, and J. Y. Zien. Semtag and seeker: Bootstrapping the semantic web via automated semantic annotation. In Proceedings of the 12th International World Wide Web Conference (WWW2003), May 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Dill, N. Eiron, D. Gibson, D. Gruhl, A. Jhingran, T. Kanungo, K. S. McCurley, S. Rajagopalan, A. Tomkins, J. A. Tomlin, and J. Y. Zien. Seeker: An architecture for web-scale text analytics. Technical Report RJ 10233 (95107), IBM Research, February 2002.Google ScholarGoogle Scholar
  13. N. Eiron and K. S. McCurley. Untangling compound documents on the web. In Proceedings of ACM Hypertext '03, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Haveliwala. Efficient encodings for document ranking vectors. In International Conference on Internet Computing, 2003.Google ScholarGoogle Scholar
  15. M. Hearst. User interfaces and visualization. In R. Baeza-Yates and B. Ribeiro-Neto (Eds.) Modern information retrieval. NY: ACM Press., 1999.Google ScholarGoogle Scholar
  16. Y. Maarek and I. Shaul. Webcutter: A system for dynamic and tailorable site mapping. In Proceedings of the 6th International World Wide Web Conference, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. G. Marchionini and B. Brunk. Toward a general relation browser: A GUI for information architects. In Journal of Digital Information, volume 4, 2003.Google ScholarGoogle Scholar
  18. K. S. McCurley. Geospatial mapping and navigation of the web. In Proceedings of the 10th International World Wide Web Conference (WWW2001), pages 221--229, Hong Kong, China, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Nation, C. Plaisant, G. Marchionini, and A. Komlodi. Visualizing websites using a hierarchical table of contents browser: WebTOC. In Designing for the Web: Practices and Reflections, 1997.Google ScholarGoogle Scholar
  20. D. Quan and D. Karger. How to make a semantic web browser. In Proceedings of the 13th International World Wide Web Conference (WWW2004), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. J. Sellen, R. Murphy, and K. L. Shaw. How knowledge workers use the web. In Proceedings of the SIGCHI conference on Human factors in computing systems, pages 227--234. ACM Press, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Teevan, C. Alvarado, M. S. Ackerman, and D. R. Karger. The perfect search engine is not enough: A study of orienteering behavior in directed search. In Proceedings of the SIGCHI conference on Human factors in computing systems. ACM Press, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. K.-P. Yee, K. Swearingen, K. Li, and M. Hearst. Faceted metadata for image search and browsing. In Proceedings of the SIGCHI conference on Human factors in computing systems, pages 401--408. ACM Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The site browser: catalyzing improvements in hypertext organization

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          HYPERTEXT '04: Proceedings of the fifteenth ACM conference on Hypertext and hypermedia
          August 2004
          284 pages
          ISBN:1581138482
          DOI:10.1145/1012807

          Copyright © 2004 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 9 August 2004

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate378of1,158submissions,33%

          Upcoming Conference

          HT '24
          35th ACM Conference on Hypertext and Social Media
          September 10 - 13, 2024
          Poznan , Poland

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader