Abstract
The paper describes the capabilities that should be expected of intelligent Web search tools in order to respond properly to user’s information retrieval needs. The tools fall generally into three different categories: general purpose search engines, specialized search engines, and personal search tools, which differ in user expectations. Two features, however, seem to be of great importance for each tool: the capability to cluster the retrieved documents and the capability to draw user’s attention to next interesting documents (intelligent navigation). A number of clustering and intelligent navigation techniques are overviewed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aggarwal C.C., Al-Garawim F., Yu P.S. (2001): Intelligent crawling on the World Wide Web with arbitrary predicates. In: World Wide Web Conference 2001, pp. 96–105, url = citeseer.nj.nec.com/aggarwa101intelligent.html
Chakrabarti S., van den Berg M., Dom B (1999).. Focussed Crawling: A NewApproach to Topic Specific Resource Discovery. W W W Conference, 1999.
Cohn, D. and Hofmann, T. (2001). The missing link - a probabilistic model of document content and hypertext connectivity, in T. K. Leen, T. G. Dietterich and V. Tresp (eds), Advances in Neural Information Processing Systems, Vol. 10. http://citeseer.nj.nec.com/cohnOlmissing.html
Cohn D., Chang H. (2000): Learning to probabilistically identify authoritative documents. In Proceedings of the 17th International Conference on Machine Learning, 2000.
Deerwester S., Dumais S. T., Furnas G. W., Landauer T. K., Harshman R. (1990). Indexing by latent semantic analysis. J. of the American Society for Information Science, 41: 391–407, 1990.
Frank E., Paynter G.W., Witten I.H., Gutwin C., Nevill-Manning C. G.(1999): Domain-Specific Keyphrase Extraction. IJCAI,1999, 668–673 http:// citeseer.nj.nec.com/article/frank99domainspecific.html
Hofmann T. (1999):. Probabilistic latent semantic analysis. In Proceedings of the 15th Conference on Uncertainty in AI,pages 289–296.
Klopotek M.A., (2001). Inteligentne wyszukiwarki internetowe. (in Polish: Intelligent Search Engines). Akademicka Oficyna Wydawnicza EXIT. Warsaw. 304 pp.
Klopotek M.A. (1991): On the Phenomenon of Flattening ‘Flexible Prediction’ Concept Hierarchy. in: Ph. Jorrand, J. Kelemen, Eds.: Fundamentals of Artificial Intelligence Research. Lecture Notes in Artificial Intelligence 535,, pp. 99–111.
Lagus K. (2000): Text Mining with the WEBASOM. PhD Thesis, Helsinki University of Technology.
Marcu D., Gerber L. (2001): An inquiry into the nature of multidocument abstracts, extracts, and their evaluation. Proceedings of the NAACL-2001 Workshop on Automatic Summarization, Pittsburgh, PA, June 3, 2001. http://www.isi.edu/~marcuipapers/multidoceval01.pdf
Richardson M., Domingos P. (2001): The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank, url = citeseer.nj.nec.com/484739.htinl.
Schmitt J.C. (1991): Trigram-based method of language identification. October 1991. U.S. Patent number: 5062143
Zamir O., Etzioni O (1998).: Web-document clustering: a feasibility demonstration, in: Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’98), p.46–54
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kłopotek, M. (2003). Intelligent Information Retrieval on the Web. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1772-0_4
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2519-0
Online ISBN: 978-3-7908-1772-0
eBook Packages: Springer Book Archive