Abstract
This work proposes a method of searching for information in hypertext systems representing WWW sites. The method is based on the creation of a 2-level index. The first level of the index is related to information located only inside the nodes. The second level of the index relates to information which is not restricted to one node but encompasses a set of related nodes. The second level is based on the context hierarchy which is a hierarchical organization of the main themes dealt with by the information contained in the site and gives a notion of context to the pages. This notion permits a new operator named context: to be added to the query language allowing the user to better express his information need.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
E. Amitay. Hypertext: the importance of being different. MSc Dissertation. Centre for Cognitive Science. The University of Edinburgh, 1997.
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. In Proceedings of the Seventh International WWW Conference. IW3C2, 1998.
S. Chakrabarti, B. Dom, D. Gibson, S. R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Hypersearching the web. Scientific American, June 1999.
B. L. Doan and M. Beigbeder. Virtual www documents: A concept to explicit the structure of www sites. In Proceedings of the 21st Colloquim on Information Retrieval. BCS-IRSG, 1999.
C. E. Dyreson. A jumping spider: Restructuring the www graph to index concepts that span pages. In Proceedings of the Seventh International WWW Conference (Workshop on Information Reuse). IW3C2, 1998.
M. E. Frisse. Searching for information in a hypertext medical handbook. Communications of the ACM, 31 (7): 880 - 886, 1988.
U. Manber. Webglimpse - combining browsing and searching. In In Usenix Technical Conference, 1997.
M. Marchiori. The limits of web metadata and beyond. In Proceedings of the Seventh International WWW Conference. IW3C2, 1998.
Y. Mizuuchi and K. Tajima. Finding context paths for web pages. In Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia. ACM, 1999.
J. Savoy. Information Retrieval and Hypertext (M. Agosti and A. Smeaton), chapter 5 ( Citation Schemes in Hypertext Information Retrieval. Kluwer Academic Publishers, 1997.
G. Salton and M.J. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, New York, NY, 1983.
vR79] C.J. van Rijsbergen. Information Retrieval. Butterworths, 2nd edition, 1979.
R. Weiss. Hypursuit: A hierarchical network search engine that exploits contentlink hypertext clustering. In Proceedings of Hypertext '96. ACM, 1996.
O. Zamir and O. Etzioni. Grouper: A dynamic clustering interface to web search results. In Proceedings of the Eighth International WWW Conference. IW3C2, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aguiar, F., Beigbeder, M. (2001). Discovering the Context of WWW Pages to Improve the Effectiveness of Local Search Engines. In: Larsen, H.L., Andreasen, T., Christiansen, H., Kacprzyk, J., Zadrożny, S. (eds) Flexible Query Answering Systems. Advances in Soft Computing, vol 7. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1834-5_48
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1834-5_48
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1347-0
Online ISBN: 978-3-7908-1834-5
eBook Packages: Springer Book Archive