Skip to main content

Searching the World Wide Web: Challenges and Partial Solutions

  • Conference paper
  • First Online:
Progress in Artificial Intelligence — IBERAMIA 98 (IBERAMIA 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1484))

Included in the following conference series:

Abstract

In this article we analyze the problem of searching the WWW, giving some insight and models to understand its complexity. Then we survey the two main current techniques used to search the WWW. Finally, we present recent results that can help to partially solve the challenges posed.

This work was supported by CYTED Project VII.13: AMYRI.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Excite: Main page. http://www.excite.com, 1995. 43

  2. Yahoo!: Main page. http://www.yahoo.com, 1995. 43

  3. Hotbot: Main page. http://www.hotbot.com, 1996. 43

  4. Northern Light: Main page. http://www.northernlight.com, 1997. 43

  5. Search Engine Watch: Main page. http://www.searchenginewatch.com, 1997. 43

  6. O. Alonso and R. Baeza-Yates. A bookpile applet. http://www.dcc.uchile.cl/-.~.rbaeza/sem/visual/mio/Bucpil.html, 1997. 49

  7. O. Alonso and R. Baeza-Yates. Visualizations of answers in WWW retrieval. Technical report, Department of Computer Science, Univ. of Chile, 1997. http://nova1.cs.nvgc.vt.edu/alonso/viswww.html. 49

  8. M. Araújo, G. Navarro, and N. Ziviani. Large text searching allowing errors. In Proc. WSP’97, pages 2–20. Carleton University Press, 1997. 41, 42

    Google Scholar 

  9. R. Baeza-Yates. Modeling, browsing and querying large text databases. Technical Report DCC-94-2, Dept. of Computer Science, Univ. of Chile, 1994. 44

    Google Scholar 

  10. R. Baeza-Yates. Visualizing large answers in text databases. In Int. Workshop on Advanced User Interfaces (AVI’96), pages 101–107, Gubbio, Italy, May 1996. ACM Press. 47, 48, 49

    Google Scholar 

  11. R. Baeza-Yates and G. Navarro. Block-addressing indices for approximate text retrieval. In Proc. CIKM’97, pages 1–8, Las Vegas, USA, 1997. 41, 45

    Google Scholar 

  12. R. Baeza-Yates, G. Navarro, J. Vegas, and P. de la Fuente. A model and a visual query language for structured text. Santa Cruz, Bolivia, Sept 1998. IEEE CS Press. 46

    Google Scholar 

  13. R. Baeza-Yates and N. Ziviani. AMYRI: Main page. http://www.dcc.ufmg.br/-latin/amyri/, 1997. 40, 49

  14. R.A. Baeza-Yates and G. Navarro. Integrating contents and structure in text retrieval. ACM SIGMOD Record, 25(1):67–79, March 1996. 46

    Article  Google Scholar 

  15. C. Mic Bowman, Peter B. Danzig, Darren R. Hardy, Udi Manber, and Michael F. Schwartz. The harvest information discovery and access system. Computer Networks and ISDN Systems, 28:119–125, 1995. 43, 45

    Article  Google Scholar 

  16. T. Bray. Measuring the web. In Fifth International World Wide Web Conference, Paris, May 1996. http://www5conf.inria.fr/fich..html/papers/P9/-Overview.html. 41

  17. M. Chalmers and P. Chitson. BEAD: Exploration in information visualization. In ACM SIGIR’92, 1992. 48

    Google Scholar 

  18. Digital Equipment Corporation. Alta Vista: Main page. http://altavista.-digital.com, 1996. 43

  19. M. Crovella and A. Bestavros. Self-similarity in World Wide Web traffic: Evidence and possible causes. In ACM Sigmetrics Conference on Measurement and Modeling of Computer Systems, pages 160–169, May 1996. 41

    Google Scholar 

  20. Daniel Dreilinger. Savvysearch home page. 1996. http://guaraldi.cs.-colostate.edu:2000. 43

  21. Stephen Eick. Graphically displaying text. Journal of Computational and Graphical Statistics, 3(2):127–142, 1994. 47

    Article  Google Scholar 

  22. W. Frakes and R. Baeza-Yates, editors. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, 1992. 45

    Google Scholar 

  23. G. Gonnet and R. Baeza-Yates. Handbook of Algorithms and Data Structures. Addison-Wesley, 2nd edition, 1991. 42

    Google Scholar 

  24. J. Heaps. Information Retrieval-Computational and Theoretical Aspects. Academic Press, NY, 1978. 41

    MATH  Google Scholar 

  25. M. Hearst. Tilebars: Visualization of term distribution information in full text information access. In ACM SIGCHI, Denver, CO, May 1995. 48

    Google Scholar 

  26. M. Hemmje, C. Kunkel, and A. Willet. Lyberworld-a visualization user interface supporting text retrieval. In 17th ACM SIGIR, Dublin, Jul 1994. 48

    Google Scholar 

  27. S. Lawrence and C.L. Giles. Searching the world wide web (in reports). Science, 280(5360):98, April 3 1998. 43

    Article  Google Scholar 

  28. U. Manber and P. Bigot. Search Broker: Main page. http://debussy.cs.-arizona.edu/sb/, 1997. 43

  29. U. Manber, M. Smith, and B. Gopal. Webglimpse: combining browsing and searching. In Proc. of USENIX Technical Conference, 1997. 43

    Google Scholar 

  30. U. Manber and S. Wu. GLIMPSE: A tool to search through entire file systems. In Proc. USENIX Technical Conference, pages 23–32. USENIX Association, Berkeley, CA, USA, Winter 1994. 45

    Google Scholar 

  31. G. Miller, E. Newman, and E. Friedman. Length-frequency statistics for written English. Information and Control, 1:370–380, 1958. 42

    Article  Google Scholar 

  32. E. Moura, G. Navarro, N. Ziviani, and R. Baeza-Yates. Fast searching on compressed text allowing errors. In Proc. SIGIR’98, Melbourne, Australia, August 1998. ACM Press. 46

    Google Scholar 

  33. D. Olsen. Bookmarks: An enhanced scroll bar. ACM Trans. on Computer Graphics, 11(3):291–295, 1992. 47

    Article  Google Scholar 

  34. K. Olsen, R. Korfhage, K. Sochats, M. Spring, and J. Williams. Visualization of a document collection: The VIBE system. Information Processing and Management, 29(1):69–81, 1993. 48

    Article  Google Scholar 

  35. V. Ribeiro and N. Ziviani. Meta Miner: Main page. http://canela.dcc.ufmg.-br:8080/metaminer.html, 1997. 43

  36. Erik Selberg and Oren Etzioni. Multi-service search and comparison using the MetaCrawler. In Proceedings of the Fourth International World Wide Web Conference, Boston, December 1995. http://www.w3.org/pub/Conferences/WWW4/-Papers/169. 43

  37. A. Spoerri. Infocrystal: A visual tool for information retrieval and management. In Information and Knowledge Management’93, Washington D.C., 1993. 48

    Google Scholar 

  38. G. Zipf. Human Behaviour and the Principle of Least Effort. Addison-Wesley, 1949. 42

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Baeza-Yates, R.A. (1998). Searching the World Wide Web: Challenges and Partial Solutions. In: Coelho, H. (eds) Progress in Artificial Intelligence — IBERAMIA 98. IBERAMIA 1998. Lecture Notes in Computer Science(), vol 1484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49795-1_4

Download citation

  • DOI: https://doi.org/10.1007/3-540-49795-1_4

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64992-2

  • Online ISBN: 978-3-540-49795-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics