Article

Dynamic structuring of web information for access visualization

Authors:
Jess Y. S. Mak

Hong Kong Polytechnic University, Hong Kong

Hong Kong Polytechnic University, Hong Kong
View Profile

,
Hong Va Leong

Hong Kong Polytechnic University, Hong Kong

Hong Kong Polytechnic University, Hong Kong
View Profile

,
Alvin T. S. Chan

Hong Kong Polytechnic University, Hong Kong

Hong Kong Polytechnic University, Hong Kong
View Profile

SAC '02: Proceedings of the 2002 ACM symposium on Applied computingMarch 2002Pages 778–784https://doi.org/10.1145/508791.508942

Published:11 March 2002Publication History

SAC '02: Proceedings of the 2002 ACM symposium on Applied computing

Pages 778–784

ABSTRACT

The Internet has led to the formation of a global information infrastructure. To explore a web site, a site map would be useful as a short cut for a user to locate for the target information in a structured and efficient manner, rather than drilling into the web site following hyperlinks, reading possibly irrelevant information. Useless information impacts a mobile web environment, where mobile clients are only connected with unreliable wireless channels of limited bandwidth. Structured web page organization at the web server proxy is an important issue to resolve to provide efficient browsing experience for web clients, while minimizing the browsing of unrelated pages or sites. In this paper, we adopt the Document Information Extraction mechanism to construct a document cluster dynamically and intelligently with respect to a requested root web page. The document cluster works like a dynamic site map, spanning across several web sites. The clusters are generated and stored in XML format at proxy server so that it can potentially benefit a large number of mobile clients. Clients process the XML clusters and transform them to be visualized through VRML or DOM. For VRML, a transformer is built at the client-side to support a three-dimensional modeling view. For DOM, JavaScript is used for accessing the parsed XML data to produce a two-dimensional tree output.

References

N. Belkin and W. B. Croft. Information filtering and information retrieval: two sides of the same coin? Communications of the ACM,35(2):29-38, 1992. Google ScholarDigital Library
H. Bharadvaj, A. Joshi and S. Auephanwiriyakul. An active transcoding proxy to support mobile web access. In Proceedings of International Conference on Reliable Distributed Systems, pp. 118-123, 1998. Google ScholarDigital Library
C. Brooks, M. S. Murray, S. Meeks and J. Miller. Application-specific proxy servers as HTTP stream transducers. In Proceedings of the 4th WWW Conference, 1996.Google Scholar
S. Brin and L. Page. The anatomy of a large-scale hypertextual search engine. In Proceedings of the 7th WWW Conference, pp. 107-117, 1998. Google ScholarDigital Library
S. Chakrabarti, M. Dom, D. Gibson, J. Kleinerg and S. Rajagopalan. Automatic resource compilation by analyzing hyperlink structure and associated text. In Proceedings of 7th WWW Conference, pp. 65-74, 1998. Google ScholarDigital Library
E. Chi. Web analysis visualization spreadsheet. In Proceedings of ACM Digital Libraries Workshop on Organizing Web Space, pp. 24-31, 1999.Google Scholar
S. Geffner, D. Agrawal, A. El Abbadi, T. Smith and M. Larsgaard. Smart indexes for efficient browsing of library collections. In Proceedings of IEEE Advances in Digital Libraries Conference, pp. 107-116, 1998. Google ScholarDigital Library
D. Gibson, J. Kleinberg and P. Raghavan. Structural analysis of the World Wide Web. In Proceedings of W3C Web Characterization Workshop, November 1998.Google Scholar
B. Housel, G. Samaras and D. B. Lindquist. WebExpress: a client/intercept based system for optimizing web browsing in a wireless environment. Mobile Networks and Applications,3(4):419-431, 1998. Google ScholarDigital Library
R. E. Kent and C. Neuss. Creating a web analysis and visualization environment. Computer Networks and ISDN Systems,28:109-117, 1995. Google ScholarDigital Library
J. Kleinberg. Authoritative sources in a hyperlinked environment. In Proceedings of ACM-SIAM Symposium on Discrete Algorithm, 1998. Google ScholarDigital Library
H. V. Leong. Browsing document clusters on mobile web. In Proceedings of ACM Digital Libraries Workshop on Organizing Web Space, pp. 76-90, 1999.Google Scholar
H. V. Leong, D. McLeod, A. Si and S. M. T. Yau. On supporting weakly-connected browsing in a mobile web environment. In Proceedings of International Conference on Distributed Computing Systems, pp. 538-546, 2000. Google ScholarDigital Library
H. V. Leong and A. Si. On adaptive caching in mobile databases. In Proceedings of ACM Symposium on Applied Computing, pp. 302-309, 1997. Google ScholarDigital Library
W. S. Li, K. S. Candan, Q. Vu and D. Agrawal. Retieving and organizing web pages by "information unit". In Proceedings of the 10th WWW Conference, pp. 230-244, 2001. Google ScholarDigital Library
W. S. Li, Q. Vu, D. Agrawal, Y. Hara and H. Takano. PowerBookmarks: a system for personalizable web information organization, sharing and management. In Proceedings of the 8th WWW Conference, 1999. Google ScholarDigital Library
P. Pirolli, J. Pitkow and R. Rao. Silk from a sow's ear: extracting usable structures from the web. In Proceedings of Conference on Human Factors in Computing Systems, CHI'96, pp. 118-125, 1996. Google ScholarDigital Library
E. Spertus. ParaSite: Mining structural information on the web. In Proceedings of the 6th WWW Conference, pp. 206-212, 1997. Also in Computer Networks,29:1205-1215. Google ScholarDigital Library

Index Terms

Dynamic structuring of web information for access visualization

Recommendations

XQuery in the browser
WWW '09: Proceedings of the 18th international conference on World wide web

Since the invention of the Web, the browser has become more and more powerful. By now, it is a programming and execution environment in itself. The predominant language to program applications in the browser today is JavaScript. With browsers becoming ...
Read More
DOM tree browsing of a very large XML document: Design and implementation

Browsing the DOM tree of an XML document is an act of following the links among the nodes of the DOM tree to find some desired nodes without any knowledge for search. When the structure of the XML document is not known to a user, browsing is the basic ...
Read More
Swi-prolog and the web

Prolog is an excellent tool for representing and manipulating data written in formal languages as well as natural language. Its safe semantics and automatic memory management make it a prime candidate for programming robust Web services. Although Prolog ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SAC '02: Proceedings of the 2002 ACM symposium on Applied computing
March 2002
1200 pages
ISBN:1581134452
DOI:10.1145/508791
Conference Chair:
Gary B. Lamont
Air Force Institute of Technology, USA
,
Program Chairs:
Hisham Haddad
Kennesaw State Univ., USA
,
George Papadopoulos
Univ. of Cyprus, Cyprus
,
Publications Chair:
Brajendra Panda
University of Arkansas, USA
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 March 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
DOM
VRML
XML
visualization
web document structure
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,650of6,669submissions,25%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 564
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Dynamic structuring of web information for access visualization

SAC '02: Proceedings of the 2002 ACM symposium on Applied computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

XQuery in the browser

DOM tree browsing of a very large XML document: Design and implementation

Swi-prolog and the web