Abstract
In a time span of twelve years, the World Wide Web–only a computer and an internet connection away from anybody anywhere, and with abundant, diverse and sometimes incorrect, redundant, spam, and bad information–has become the major information repository for the masses and the world. The web is becoming all things to all people, totally oblivious to nation/country/continent boundaries, promising mostly free information to all, and quickly growing into a repository in all languages and all cultures. With large digital libraries and increasingly significant educational resources, the web is becoming an equalizer, a balancing force, and an opportunity for all, especially for underdeveloped/developing countries. The web is both exciting and overwhelming, changing the way the world communicates, from the way businesses are conducted to the way masses are educated, from the way research is performed to the way research results are disseminated. It is fair to say that the web will only get more diverse, larger and more chaotic in the near future.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agichtein, E., Eskin, E., Gravano, L.: Combining Strategies for Extracting Relations from Text Collections. ACM SIGMOD (2000)
Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: The 5th ACM International Conference on Digital Libraries (June 2000)
Agichtein, E., Gravano, L.: Querying Text Databases for Efficient Information Extraction. In: Proce. of the 19th IEEE Intl Conference on Data Engineering (ICDE) (2003)
Brickley, D., Guha, R.V.: Resource Description Framework Schema (RDFS). W3C Proposed Recommendation (1999), available at http://www.w3.org/TR/PR-rdf-schema
Bharat, K., Henzinger, M.R.: Improved algorithms for topic distillation in a hyperlinked environment. In: ACM SIGIR Conf. (1998)
Broekstra, J., Klein, M., Fensel, D., Horrocks, I.: Adding formal semantics to the Web: building on top of RDF Schema. In: Proc. of the ECDL (2000)
Berners-Lee, T.: Semantic Web Roadmap. W3C draft (January 2000), available at http://www.w3.org/DesignIssues/Semantic.html
De Bra, P.M.E., Post, R.D.J.: Searching for arbitrary information in the WWW: Making Client-based searching feasible. In: WWW Conf. (1994)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, Brisbane, Australia (1998)
Brin, S.: Extracting patterns and relations from the world wide web. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 172–183. Springer, Heidelberg (1998), http://citeseer.nj.nec.com/brin98extracting.html
Chakrabarti, S., et al.: Mining the web’s link structure. IEEE Computer (August 1999)
Chakrabarti, S., van den Berg, M., Dom, B.: Focused crawling: A new approach to topicspecific web resource Discovery. In: Proceedings of WWW 8 Conf. (1999)
Cho, J., Garcia-Molina, H., Page, L.: Efficient crawling through URL ordering. In: Proceedings of the Seventh International World-Wide Web Conference (1998)
Chakrabarti, S.: Mining the Web: Discovering knowledge from hypertext data. Morgan- Kaufmann Publishers, San Francisco (2003)
Diligenti, M., Coetzee, F., Lawrence, S., Giles, C.L., Gori, M.: Focused Crawling using Context Graphs. In: VLDB 2000 (2000)
Eberhart, A.: Survey of RDF data on the web. In: Proc. of the 6th World Multiconference on Systemics, Cybernetics and Informatics (SCI) (2002)
Google History, at http://www.google.com/corporate/history.html
Gruber, T.: A translation approach to portable ontologies. Knowledge Acquisition (1993)
Guarino, N.: Formal Ontology and Information Systems. In: Guarino, N. (ed.) Formal Ontology in Information Systems, Proc. of the 1st International Conference (1998)
Grishman, R., Huttunen, S., Yangarber, R.: Real-Time Event Extraction for Infectious Disease Outbreaks. In: Proceedings of Human Language Technology Conference (2002)
Grishman, R.: Information extraction: Techniques and challenges. In: Pazienza, M.T. (ed.) SCIE 1997. LNCS(LNAI), vol. 1299. Springer, Heidelberg (1997)
Hersovici, M., et al.: The sharksearch algorithm—an application: Tailored web site mapping. In: WWW 7 Conf. (1998)
Horrocks et al.: The Ontology Inference Layer OIL. Technical report, Free University of Amsterdam (2000), http://www.ontoknowledge.org/oil/
Kleinberg, J.: Authoritative Sources in hyperlinked environments. In: The 9th ACM SIAM Symposium on Discrete Mathematics (1998)
Koivunen, M., Miller, E.: W3C Semantic Web Activity. In: The proceedings of the Semantic Web Kick-off Seminar in Finland, November 2 (2001)
Lempel, R., Moran, S.: SALSA: The stochastic approach for link-structure analysis. ACM TOIS (April 2001)
Lassila, O., Swick, R.: Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation, February 22 (1999)
Manola, F., Miller, E.: RDF Primer. W3C Working Draft, January 23 (2003)
Menczer, F., Pant, G., Ruiz, M., Srinivasan, P.: Evaluating topic-driven Web crawlers. In: Proc. 24th Intl. ACM SIGIR Conf. (2001)
Najork, M., Weiner, J.: Breadth-First search crawling yields high-quality pages. In: WWW 1998 (1998)
Ng, A., Zheng, A., Jordan, M.: Stable algorithms for link analysis. In: ACM SIGIR (2001)
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web. Stanford Digital Libraries Working Paper (1998)
Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)
International Directory of Search Engines. Search Engine Colossus (2003), available at http://www.searchenginecolossus.com
The Major Search Engines and Directories. Search Engine Watch Report, Danny Sullivan (2003), available at: searchenginewatch.com/links/article.php/2156221
The Semantic Web Community Portal, at http://www.semanticweb.org
Search Links, available at http://searchenginewatch.com/links/index.php
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ozsoyoglu, G., Al-Hamdani, A. (2003). Web Information Resource Discovery: Past, Present, and Future. In: Yazıcı, A., Şener, C. (eds) Computer and Information Sciences - ISCIS 2003. ISCIS 2003. Lecture Notes in Computer Science, vol 2869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39737-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-39737-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20409-1
Online ISBN: 978-3-540-39737-3
eBook Packages: Springer Book Archive