Web Information Resource Discovery: Past, Present, and Future

Ozsoyoglu, Gultekin; Al-Hamdani, Abdullah

doi:10.1007/978-3-540-39737-3_2

Web Information Resource Discovery: Past, Present, and Future

Gultekin Ozsoyoglu⁶ &
Abdullah Al-Hamdani⁶

Conference paper

677 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2869))

Abstract

In a time span of twelve years, the World Wide Web–only a computer and an internet connection away from anybody anywhere, and with abundant, diverse and sometimes incorrect, redundant, spam, and bad information–has become the major information repository for the masses and the world. The web is becoming all things to all people, totally oblivious to nation/country/continent boundaries, promising mostly free information to all, and quickly growing into a repository in all languages and all cultures. With large digital libraries and increasingly significant educational resources, the web is becoming an equalizer, a balancing force, and an opportunity for all, especially for underdeveloped/developing countries. The web is both exciting and overwhelming, changing the way the world communicates, from the way businesses are conducted to the way masses are educated, from the way research is performed to the way research results are disseminated. It is fair to say that the web will only get more diverse, larger and more chaotic in the near future.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agichtein, E., Eskin, E., Gravano, L.: Combining Strategies for Extracting Relations from Text Collections. ACM SIGMOD (2000)
Google Scholar
Agichtein, E., Gravano, L.: Snowball: Extracting relations from large plain-text collections. In: The 5th ACM International Conference on Digital Libraries (June 2000)
Google Scholar
Agichtein, E., Gravano, L.: Querying Text Databases for Efficient Information Extraction. In: Proce. of the 19th IEEE Intl Conference on Data Engineering (ICDE) (2003)
Google Scholar
Brickley, D., Guha, R.V.: Resource Description Framework Schema (RDFS). W3C Proposed Recommendation (1999), available at http://www.w3.org/TR/PR-rdf-schema
Bharat, K., Henzinger, M.R.: Improved algorithms for topic distillation in a hyperlinked environment. In: ACM SIGIR Conf. (1998)
Google Scholar
Broekstra, J., Klein, M., Fensel, D., Horrocks, I.: Adding formal semantics to the Web: building on top of RDF Schema. In: Proc. of the ECDL (2000)
Google Scholar
Berners-Lee, T.: Semantic Web Roadmap. W3C draft (January 2000), available at http://www.w3.org/DesignIssues/Semantic.html
De Bra, P.M.E., Post, R.D.J.: Searching for arbitrary information in the WWW: Making Client-based searching feasible. In: WWW Conf. (1994)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, Brisbane, Australia (1998)
Google Scholar
Brin, S.: Extracting patterns and relations from the world wide web. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 172–183. Springer, Heidelberg (1998), http://citeseer.nj.nec.com/brin98extracting.html
Chapter Google Scholar
Chakrabarti, S., et al.: Mining the web’s link structure. IEEE Computer (August 1999)
Google Scholar
Chakrabarti, S., van den Berg, M., Dom, B.: Focused crawling: A new approach to topicspecific web resource Discovery. In: Proceedings of WWW 8 Conf. (1999)
Google Scholar
Cho, J., Garcia-Molina, H., Page, L.: Efficient crawling through URL ordering. In: Proceedings of the Seventh International World-Wide Web Conference (1998)
Google Scholar
Chakrabarti, S.: Mining the Web: Discovering knowledge from hypertext data. Morgan- Kaufmann Publishers, San Francisco (2003)
Google Scholar
Diligenti, M., Coetzee, F., Lawrence, S., Giles, C.L., Gori, M.: Focused Crawling using Context Graphs. In: VLDB 2000 (2000)
Google Scholar
Eberhart, A.: Survey of RDF data on the web. In: Proc. of the 6th World Multiconference on Systemics, Cybernetics and Informatics (SCI) (2002)
Google Scholar
Google History, at http://www.google.com/corporate/history.html
Gruber, T.: A translation approach to portable ontologies. Knowledge Acquisition (1993)
Google Scholar
Guarino, N.: Formal Ontology and Information Systems. In: Guarino, N. (ed.) Formal Ontology in Information Systems, Proc. of the 1st International Conference (1998)
Google Scholar
Grishman, R., Huttunen, S., Yangarber, R.: Real-Time Event Extraction for Infectious Disease Outbreaks. In: Proceedings of Human Language Technology Conference (2002)
Google Scholar
Grishman, R.: Information extraction: Techniques and challenges. In: Pazienza, M.T. (ed.) SCIE 1997. LNCS(LNAI), vol. 1299. Springer, Heidelberg (1997)
Google Scholar
Hersovici, M., et al.: The sharksearch algorithm—an application: Tailored web site mapping. In: WWW 7 Conf. (1998)
Google Scholar
Horrocks et al.: The Ontology Inference Layer OIL. Technical report, Free University of Amsterdam (2000), http://www.ontoknowledge.org/oil/
Kleinberg, J.: Authoritative Sources in hyperlinked environments. In: The 9th ACM SIAM Symposium on Discrete Mathematics (1998)
Google Scholar
Koivunen, M., Miller, E.: W3C Semantic Web Activity. In: The proceedings of the Semantic Web Kick-off Seminar in Finland, November 2 (2001)
Google Scholar
Lempel, R., Moran, S.: SALSA: The stochastic approach for link-structure analysis. ACM TOIS (April 2001)
Google Scholar
Lassila, O., Swick, R.: Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation, February 22 (1999)
Google Scholar
Manola, F., Miller, E.: RDF Primer. W3C Working Draft, January 23 (2003)
Google Scholar
Menczer, F., Pant, G., Ruiz, M., Srinivasan, P.: Evaluating topic-driven Web crawlers. In: Proc. 24^th Intl. ACM SIGIR Conf. (2001)
Google Scholar
Najork, M., Weiner, J.: Breadth-First search crawling yields high-quality pages. In: WWW 1998 (1998)
Google Scholar
Ng, A., Zheng, A., Jordan, M.: Stable algorithms for link analysis. In: ACM SIGIR (2001)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web. Stanford Digital Libraries Working Paper (1998)
Google Scholar
Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)
Google Scholar
International Directory of Search Engines. Search Engine Colossus (2003), available at http://www.searchenginecolossus.com
The Major Search Engines and Directories. Search Engine Watch Report, Danny Sullivan (2003), available at: searchenginewatch.com/links/article.php/2156221
The Semantic Web Community Portal, at http://www.semanticweb.org
Search Links, available at http://searchenginewatch.com/links/index.php

Download references

Author information

Authors and Affiliations

Dept of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, Ohio, 44106
Gultekin Ozsoyoglu & Abdullah Al-Hamdani

Authors

Gultekin Ozsoyoglu
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Al-Hamdani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Engineering, Middle East Technical University, Ankara, Turkey
Adnan Yazıcı
Department of Computer Engineering, Middle East Technical University, 06531, Ankara, Turkey
Cevat Şener

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ozsoyoglu, G., Al-Hamdani, A. (2003). Web Information Resource Discovery: Past, Present, and Future. In: Yazıcı, A., Şener, C. (eds) Computer and Information Sciences - ISCIS 2003. ISCIS 2003. Lecture Notes in Computer Science, vol 2869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39737-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-39737-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20409-1
Online ISBN: 978-3-540-39737-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics