Intelligent Information Retrieval on the Web

Kłopotek, Mieczysław

doi:10.1007/978-3-7908-1772-0_4

Intelligent Information Retrieval on the Web

Mieczysław Kłopotek⁶

Chapter

224 Accesses
3 Citations

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 111))

Abstract

The paper describes the capabilities that should be expected of intelligent Web search tools in order to respond properly to user’s information retrieval needs. The tools fall generally into three different categories: general purpose search engines, specialized search engines, and personal search tools, which differ in user expectations. Two features, however, seem to be of great importance for each tool: the capability to cluster the retrieved documents and the capability to draw user’s attention to next interesting documents (intelligent navigation). A number of clustering and intelligent navigation techniques are overviewed.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal C.C., Al-Garawim F., Yu P.S. (2001): Intelligent crawling on the World Wide Web with arbitrary predicates. In: World Wide Web Conference 2001, pp. 96–105, url = citeseer.nj.nec.com/aggarwa101intelligent.html
Chapter Google Scholar
Chakrabarti S., van den Berg M., Dom B (1999).. Focussed Crawling: A NewApproach to Topic Specific Resource Discovery. W W W Conference, 1999.
Google Scholar
Cohn, D. and Hofmann, T. (2001). The missing link - a probabilistic model of document content and hypertext connectivity, in T. K. Leen, T. G. Dietterich and V. Tresp (eds), Advances in Neural Information Processing Systems, Vol. 10. http://citeseer.nj.nec.com/cohnOlmissing.html
Google Scholar
Cohn D., Chang H. (2000): Learning to probabilistically identify authoritative documents. In Proceedings of the 17th International Conference on Machine Learning, 2000.
Google Scholar
Deerwester S., Dumais S. T., Furnas G. W., Landauer T. K., Harshman R. (1990). Indexing by latent semantic analysis. J. of the American Society for Information Science, 41: 391–407, 1990.
Article Google Scholar
Frank E., Paynter G.W., Witten I.H., Gutwin C., Nevill-Manning C. G.(1999): Domain-Specific Keyphrase Extraction. IJCAI,1999, 668–673 http:// citeseer.nj.nec.com/article/frank99domainspecific.html
Google Scholar
Hofmann T. (1999):. Probabilistic latent semantic analysis. In Proceedings of the 15th Conference on Uncertainty in AI,pages 289–296.
Google Scholar
Klopotek M.A., (2001). Inteligentne wyszukiwarki internetowe. (in Polish: Intelligent Search Engines). Akademicka Oficyna Wydawnicza EXIT. Warsaw. 304 pp.
Google Scholar
Klopotek M.A. (1991): On the Phenomenon of Flattening ‘Flexible Prediction’ Concept Hierarchy. in: Ph. Jorrand, J. Kelemen, Eds.: Fundamentals of Artificial Intelligence Research. Lecture Notes in Artificial Intelligence 535,, pp. 99–111.
Google Scholar
Lagus K. (2000): Text Mining with the WEBASOM. PhD Thesis, Helsinki University of Technology.
Google Scholar
Marcu D., Gerber L. (2001): An inquiry into the nature of multidocument abstracts, extracts, and their evaluation. Proceedings of the NAACL-2001 Workshop on Automatic Summarization, Pittsburgh, PA, June 3, 2001. http://www.isi.edu/~marcuipapers/multidoceval01.pdf
Google Scholar
Richardson M., Domingos P. (2001): The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank, url = citeseer.nj.nec.com/484739.htinl.
Google Scholar
Schmitt J.C. (1991): Trigram-based method of language identification. October 1991. U.S. Patent number: 5062143
Google Scholar
Zamir O., Etzioni O (1998).: Web-document clustering: a feasibility demonstration, in: Proceedings of the 19th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’98), p.46–54
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, ul. Ordona 21, 01-237, Warsaw, Poland
Mieczysław Kłopotek

Authors

Mieczysław Kłopotek
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Technical University of Lodz, ul. Sterlinga 16/18, 90-217, Lodz, Poland
Piotr S. Szczepaniak
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Piotr S. Szczepaniak & Janusz Kacprzyk &
Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo, 28660, Madrid, Spain
Javier Segovia
Computer Science Division, Department of Electrical Engineering and Computer Sciences, University of California, 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kłopotek, M. (2003). Intelligent Information Retrieval on the Web. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-7908-1772-0_4
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2519-0
Online ISBN: 978-3-7908-1772-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics