Skip to main content

Web Information Resource Discovery: Past, Present, and Future

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2869))

Abstract

In a time span of twelve years, the World Wide Web–only a computer and an internet connection away from anybody anywhere, and with abundant, diverse and sometimes incorrect, redundant, spam, and bad information–has become the major information repository for the masses and the world. The web is becoming all things to all people, totally oblivious to nation/country/continent boundaries, promising mostly free information to all, and quickly growing into a repository in all languages and all cultures. With large digital libraries and increasingly significant educational resources, the web is becoming an equalizer, a balancing force, and an opportunity for all, especially for underdeveloped/developing countries. The web is both exciting and overwhelming, changing the way the world communicates, from the way businesses are conducted to the way masses are educated, from the way research is performed to the way research results are disseminated. It is fair to say that the web will only get more diverse, larger and more chaotic in the near future.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agichtein, E., Eskin, E., Gravano, L.: Combining Strategies for Extracting Relations from Text Collections. ACM SIGMOD (2000)

    Google Scholar 

  2. Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: The 5th ACM International Conference on Digital Libraries (June 2000)

    Google Scholar 

  3. Agichtein, E., Gravano, L.: Querying Text Databases for Efficient Information Extraction. In: Proce. of the 19th IEEE Intl Conference on Data Engineering (ICDE) (2003)

    Google Scholar 

  4. Brickley, D., Guha, R.V.: Resource Description Framework Schema (RDFS). W3C Proposed Recommendation (1999), available at http://www.w3.org/TR/PR-rdf-schema

  5. Bharat, K., Henzinger, M.R.: Improved algorithms for topic distillation in a hyperlinked environment. In: ACM SIGIR Conf. (1998)

    Google Scholar 

  6. Broekstra, J., Klein, M., Fensel, D., Horrocks, I.: Adding formal semantics to the Web: building on top of RDF Schema. In: Proc. of the ECDL (2000)

    Google Scholar 

  7. Berners-Lee, T.: Semantic Web Roadmap. W3C draft (January 2000), available at http://www.w3.org/DesignIssues/Semantic.html

  8. De Bra, P.M.E., Post, R.D.J.: Searching for arbitrary information in the WWW: Making Client-based searching feasible. In: WWW Conf. (1994)

    Google Scholar 

  9. Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, Brisbane, Australia (1998)

    Google Scholar 

  10. Brin, S.: Extracting patterns and relations from the world wide web. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 172–183. Springer, Heidelberg (1998), http://citeseer.nj.nec.com/brin98extracting.html

    Chapter  Google Scholar 

  11. Chakrabarti, S., et al.: Mining the web’s link structure. IEEE Computer (August 1999)

    Google Scholar 

  12. Chakrabarti, S., van den Berg, M., Dom, B.: Focused crawling: A new approach to topicspecific web resource Discovery. In: Proceedings of WWW 8 Conf. (1999)

    Google Scholar 

  13. Cho, J., Garcia-Molina, H., Page, L.: Efficient crawling through URL ordering. In: Proceedings of the Seventh International World-Wide Web Conference (1998)

    Google Scholar 

  14. Chakrabarti, S.: Mining the Web: Discovering knowledge from hypertext data. Morgan- Kaufmann Publishers, San Francisco (2003)

    Google Scholar 

  15. Diligenti, M., Coetzee, F., Lawrence, S., Giles, C.L., Gori, M.: Focused Crawling using Context Graphs. In: VLDB 2000 (2000)

    Google Scholar 

  16. Eberhart, A.: Survey of RDF data on the web. In: Proc. of the 6th World Multiconference on Systemics, Cybernetics and Informatics (SCI) (2002)

    Google Scholar 

  17. Google History, at http://www.google.com/corporate/history.html

  18. Gruber, T.: A translation approach to portable ontologies. Knowledge Acquisition (1993)

    Google Scholar 

  19. Guarino, N.: Formal Ontology and Information Systems. In: Guarino, N. (ed.) Formal Ontology in Information Systems, Proc. of the 1st International Conference (1998)

    Google Scholar 

  20. Grishman, R., Huttunen, S., Yangarber, R.: Real-Time Event Extraction for Infectious Disease Outbreaks. In: Proceedings of Human Language Technology Conference (2002)

    Google Scholar 

  21. Grishman, R.: Information extraction: Techniques and challenges. In: Pazienza, M.T. (ed.) SCIE 1997. LNCS(LNAI), vol. 1299. Springer, Heidelberg (1997)

    Google Scholar 

  22. Hersovici, M., et al.: The sharksearch algorithm—an application: Tailored web site mapping. In: WWW 7 Conf. (1998)

    Google Scholar 

  23. Horrocks et al.: The Ontology Inference Layer OIL. Technical report, Free University of Amsterdam (2000), http://www.ontoknowledge.org/oil/

  24. Kleinberg, J.: Authoritative Sources in hyperlinked environments. In: The 9th ACM SIAM Symposium on Discrete Mathematics (1998)

    Google Scholar 

  25. Koivunen, M., Miller, E.: W3C Semantic Web Activity. In: The proceedings of the Semantic Web Kick-off Seminar in Finland, November 2 (2001)

    Google Scholar 

  26. Lempel, R., Moran, S.: SALSA: The stochastic approach for link-structure analysis. ACM TOIS (April 2001)

    Google Scholar 

  27. Lassila, O., Swick, R.: Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation, February 22 (1999)

    Google Scholar 

  28. Manola, F., Miller, E.: RDF Primer. W3C Working Draft, January 23 (2003)

    Google Scholar 

  29. Menczer, F., Pant, G., Ruiz, M., Srinivasan, P.: Evaluating topic-driven Web crawlers. In: Proc. 24th Intl. ACM SIGIR Conf. (2001)

    Google Scholar 

  30. Najork, M., Weiner, J.: Breadth-First search crawling yields high-quality pages. In: WWW 1998 (1998)

    Google Scholar 

  31. Ng, A., Zheng, A., Jordan, M.: Stable algorithms for link analysis. In: ACM SIGIR (2001)

    Google Scholar 

  32. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web. Stanford Digital Libraries Working Paper (1998)

    Google Scholar 

  33. Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)

    Google Scholar 

  34. International Directory of Search Engines. Search Engine Colossus (2003), available at http://www.searchenginecolossus.com

  35. The Major Search Engines and Directories. Search Engine Watch Report, Danny Sullivan (2003), available at: searchenginewatch.com/links/article.php/2156221

  36. The Semantic Web Community Portal, at http://www.semanticweb.org

  37. Search Links, available at http://searchenginewatch.com/links/index.php

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ozsoyoglu, G., Al-Hamdani, A. (2003). Web Information Resource Discovery: Past, Present, and Future. In: Yazıcı, A., Şener, C. (eds) Computer and Information Sciences - ISCIS 2003. ISCIS 2003. Lecture Notes in Computer Science, vol 2869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39737-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39737-3_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20409-1

  • Online ISBN: 978-3-540-39737-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics