ABSTRACT
The Internet has led to the formation of a global information infrastructure. To explore a web site, a site map would be useful as a short cut for a user to locate for the target information in a structured and efficient manner, rather than drilling into the web site following hyperlinks, reading possibly irrelevant information. Useless information impacts a mobile web environment, where mobile clients are only connected with unreliable wireless channels of limited bandwidth. Structured web page organization at the web server proxy is an important issue to resolve to provide efficient browsing experience for web clients, while minimizing the browsing of unrelated pages or sites. In this paper, we adopt the Document Information Extraction mechanism to construct a document cluster dynamically and intelligently with respect to a requested root web page. The document cluster works like a dynamic site map, spanning across several web sites. The clusters are generated and stored in XML format at proxy server so that it can potentially benefit a large number of mobile clients. Clients process the XML clusters and transform them to be visualized through VRML or DOM. For VRML, a transformer is built at the client-side to support a three-dimensional modeling view. For DOM, JavaScript is used for accessing the parsed XML data to produce a two-dimensional tree output.
- N. Belkin and W. B. Croft. Information filtering and information retrieval: two sides of the same coin? Communications of the ACM,35(2):29-38, 1992. Google ScholarDigital Library
- H. Bharadvaj, A. Joshi and S. Auephanwiriyakul. An active transcoding proxy to support mobile web access. In Proceedings of International Conference on Reliable Distributed Systems, pp. 118-123, 1998. Google ScholarDigital Library
- C. Brooks, M. S. Murray, S. Meeks and J. Miller. Application-specific proxy servers as HTTP stream transducers. In Proceedings of the 4th WWW Conference, 1996.Google Scholar
- S. Brin and L. Page. The anatomy of a large-scale hypertextual search engine. In Proceedings of the 7th WWW Conference, pp. 107-117, 1998. Google ScholarDigital Library
- S. Chakrabarti, M. Dom, D. Gibson, J. Kleinerg and S. Rajagopalan. Automatic resource compilation by analyzing hyperlink structure and associated text. In Proceedings of 7th WWW Conference, pp. 65-74, 1998. Google ScholarDigital Library
- E. Chi. Web analysis visualization spreadsheet. In Proceedings of ACM Digital Libraries Workshop on Organizing Web Space, pp. 24-31, 1999.Google Scholar
- S. Geffner, D. Agrawal, A. El Abbadi, T. Smith and M. Larsgaard. Smart indexes for efficient browsing of library collections. In Proceedings of IEEE Advances in Digital Libraries Conference, pp. 107-116, 1998. Google ScholarDigital Library
- D. Gibson, J. Kleinberg and P. Raghavan. Structural analysis of the World Wide Web. In Proceedings of W3C Web Characterization Workshop, November 1998.Google Scholar
- B. Housel, G. Samaras and D. B. Lindquist. WebExpress: a client/intercept based system for optimizing web browsing in a wireless environment. Mobile Networks and Applications,3(4):419-431, 1998. Google ScholarDigital Library
- R. E. Kent and C. Neuss. Creating a web analysis and visualization environment. Computer Networks and ISDN Systems,28:109-117, 1995. Google ScholarDigital Library
- J. Kleinberg. Authoritative sources in a hyperlinked environment. In Proceedings of ACM-SIAM Symposium on Discrete Algorithm, 1998. Google ScholarDigital Library
- H. V. Leong. Browsing document clusters on mobile web. In Proceedings of ACM Digital Libraries Workshop on Organizing Web Space, pp. 76-90, 1999.Google Scholar
- H. V. Leong, D. McLeod, A. Si and S. M. T. Yau. On supporting weakly-connected browsing in a mobile web environment. In Proceedings of International Conference on Distributed Computing Systems, pp. 538-546, 2000. Google ScholarDigital Library
- H. V. Leong and A. Si. On adaptive caching in mobile databases. In Proceedings of ACM Symposium on Applied Computing, pp. 302-309, 1997. Google ScholarDigital Library
- W. S. Li, K. S. Candan, Q. Vu and D. Agrawal. Retieving and organizing web pages by "information unit". In Proceedings of the 10th WWW Conference, pp. 230-244, 2001. Google ScholarDigital Library
- W. S. Li, Q. Vu, D. Agrawal, Y. Hara and H. Takano. PowerBookmarks: a system for personalizable web information organization, sharing and management. In Proceedings of the 8th WWW Conference, 1999. Google ScholarDigital Library
- P. Pirolli, J. Pitkow and R. Rao. Silk from a sow's ear: extracting usable structures from the web. In Proceedings of Conference on Human Factors in Computing Systems, CHI'96, pp. 118-125, 1996. Google ScholarDigital Library
- E. Spertus. ParaSite: Mining structural information on the web. In Proceedings of the 6th WWW Conference, pp. 206-212, 1997. Also in Computer Networks,29:1205-1215. Google ScholarDigital Library
Index Terms
- Dynamic structuring of web information for access visualization
Recommendations
XQuery in the browser
WWW '09: Proceedings of the 18th international conference on World wide webSince the invention of the Web, the browser has become more and more powerful. By now, it is a programming and execution environment in itself. The predominant language to program applications in the browser today is JavaScript. With browsers becoming ...
DOM tree browsing of a very large XML document: Design and implementation
Browsing the DOM tree of an XML document is an act of following the links among the nodes of the DOM tree to find some desired nodes without any knowledge for search. When the structure of the XML document is not known to a user, browsing is the basic ...
Swi-prolog and the web
Prolog is an excellent tool for representing and manipulating data written in formal languages as well as natural language. Its safe semantics and automatic memory management make it a prime candidate for programming robust Web services. Although Prolog ...
Comments