Abstract
In this article we analyze the problem of searching the WWW, giving some insight and models to understand its complexity. Then we survey the two main current techniques used to search the WWW. Finally, we present recent results that can help to partially solve the challenges posed.
This work was supported by CYTED Project VII.13: AMYRI.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Excite: Main page. http://www.excite.com, 1995. 43
Yahoo!: Main page. http://www.yahoo.com, 1995. 43
Hotbot: Main page. http://www.hotbot.com, 1996. 43
Northern Light: Main page. http://www.northernlight.com, 1997. 43
Search Engine Watch: Main page. http://www.searchenginewatch.com, 1997. 43
O. Alonso and R. Baeza-Yates. A bookpile applet. http://www.dcc.uchile.cl/-.~.rbaeza/sem/visual/mio/Bucpil.html, 1997. 49
O. Alonso and R. Baeza-Yates. Visualizations of answers in WWW retrieval. Technical report, Department of Computer Science, Univ. of Chile, 1997. http://nova1.cs.nvgc.vt.edu/alonso/viswww.html. 49
M. Araújo, G. Navarro, and N. Ziviani. Large text searching allowing errors. In Proc. WSP’97, pages 2–20. Carleton University Press, 1997. 41, 42
R. Baeza-Yates. Modeling, browsing and querying large text databases. Technical Report DCC-94-2, Dept. of Computer Science, Univ. of Chile, 1994. 44
R. Baeza-Yates. Visualizing large answers in text databases. In Int. Workshop on Advanced User Interfaces (AVI’96), pages 101–107, Gubbio, Italy, May 1996. ACM Press. 47, 48, 49
R. Baeza-Yates and G. Navarro. Block-addressing indices for approximate text retrieval. In Proc. CIKM’97, pages 1–8, Las Vegas, USA, 1997. 41, 45
R. Baeza-Yates, G. Navarro, J. Vegas, and P. de la Fuente. A model and a visual query language for structured text. Santa Cruz, Bolivia, Sept 1998. IEEE CS Press. 46
R. Baeza-Yates and N. Ziviani. AMYRI: Main page. http://www.dcc.ufmg.br/-latin/amyri/, 1997. 40, 49
R.A. Baeza-Yates and G. Navarro. Integrating contents and structure in text retrieval. ACM SIGMOD Record, 25(1):67–79, March 1996. 46
C. Mic Bowman, Peter B. Danzig, Darren R. Hardy, Udi Manber, and Michael F. Schwartz. The harvest information discovery and access system. Computer Networks and ISDN Systems, 28:119–125, 1995. 43, 45
T. Bray. Measuring the web. In Fifth International World Wide Web Conference, Paris, May 1996. http://www5conf.inria.fr/fich..html/papers/P9/-Overview.html. 41
M. Chalmers and P. Chitson. BEAD: Exploration in information visualization. In ACM SIGIR’92, 1992. 48
Digital Equipment Corporation. Alta Vista: Main page. http://altavista.-digital.com, 1996. 43
M. Crovella and A. Bestavros. Self-similarity in World Wide Web traffic: Evidence and possible causes. In ACM Sigmetrics Conference on Measurement and Modeling of Computer Systems, pages 160–169, May 1996. 41
Daniel Dreilinger. Savvysearch home page. 1996. http://guaraldi.cs.-colostate.edu:2000. 43
Stephen Eick. Graphically displaying text. Journal of Computational and Graphical Statistics, 3(2):127–142, 1994. 47
W. Frakes and R. Baeza-Yates, editors. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, 1992. 45
G. Gonnet and R. Baeza-Yates. Handbook of Algorithms and Data Structures. Addison-Wesley, 2nd edition, 1991. 42
J. Heaps. Information Retrieval-Computational and Theoretical Aspects. Academic Press, NY, 1978. 41
M. Hearst. Tilebars: Visualization of term distribution information in full text information access. In ACM SIGCHI, Denver, CO, May 1995. 48
M. Hemmje, C. Kunkel, and A. Willet. Lyberworld-a visualization user interface supporting text retrieval. In 17th ACM SIGIR, Dublin, Jul 1994. 48
S. Lawrence and C.L. Giles. Searching the world wide web (in reports). Science, 280(5360):98, April 3 1998. 43
U. Manber and P. Bigot. Search Broker: Main page. http://debussy.cs.-arizona.edu/sb/, 1997. 43
U. Manber, M. Smith, and B. Gopal. Webglimpse: combining browsing and searching. In Proc. of USENIX Technical Conference, 1997. 43
U. Manber and S. Wu. GLIMPSE: A tool to search through entire file systems. In Proc. USENIX Technical Conference, pages 23–32. USENIX Association, Berkeley, CA, USA, Winter 1994. 45
G. Miller, E. Newman, and E. Friedman. Length-frequency statistics for written English. Information and Control, 1:370–380, 1958. 42
E. Moura, G. Navarro, N. Ziviani, and R. Baeza-Yates. Fast searching on compressed text allowing errors. In Proc. SIGIR’98, Melbourne, Australia, August 1998. ACM Press. 46
D. Olsen. Bookmarks: An enhanced scroll bar. ACM Trans. on Computer Graphics, 11(3):291–295, 1992. 47
K. Olsen, R. Korfhage, K. Sochats, M. Spring, and J. Williams. Visualization of a document collection: The VIBE system. Information Processing and Management, 29(1):69–81, 1993. 48
V. Ribeiro and N. Ziviani. Meta Miner: Main page. http://canela.dcc.ufmg.-br:8080/metaminer.html, 1997. 43
Erik Selberg and Oren Etzioni. Multi-service search and comparison using the MetaCrawler. In Proceedings of the Fourth International World Wide Web Conference, Boston, December 1995. http://www.w3.org/pub/Conferences/WWW4/-Papers/169. 43
A. Spoerri. Infocrystal: A visual tool for information retrieval and management. In Information and Knowledge Management’93, Washington D.C., 1993. 48
G. Zipf. Human Behaviour and the Principle of Least Effort. Addison-Wesley, 1949. 42
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Baeza-Yates, R.A. (1998). Searching the World Wide Web: Challenges and Partial Solutions. In: Coelho, H. (eds) Progress in Artificial Intelligence — IBERAMIA 98. IBERAMIA 1998. Lecture Notes in Computer Science(), vol 1484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49795-1_4
Download citation
DOI: https://doi.org/10.1007/3-540-49795-1_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64992-2
Online ISBN: 978-3-540-49795-0
eBook Packages: Springer Book Archive