Skip to main content
Log in

Can the Web turn into a digital library?

  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

There is no doubt that the enormous amounts of information on the WWW are influencing how we work, live, learn and think. However, information on the WWW is in general too chaotic, not reliable enough and specific material often too difficult to locate that it cannot be considered a serious digital library. In this paper we concentrate on the question how we can retrieve reliable information from the Web, a task that is fraught with problems, but essential if the WWW is supposed to be used as serious digital library. It turns out that the use of search engines has many dangers. We will point out some of the possible ways how those dangers can be reduced and how dangerous traps can be avoided. Another approach to find useful information on the Web is to use “classical” resources of information like specialized dictionaries, lexica or encyclopaedias in electronic form, such as the Britannica. Although it seemed for a while that such resources might more or less disappear from the Web due to attempts such as Wikipedia, some to the classical encyclopaedias and specialized offerings have picked up steam again and should not be ignored. They do sometimes suffer from what we will call the “wishy-washy” syndrome explained in this paper. It is interesting to note that Wikipedia which is also larger than all other encyclopaedias (at least the English version) is less afflicted by this syndrome, yet has some other serious drawbacks. We discuss how those could be avoided and present a system that is halfway between prototype and production system that does take care of many of the aforementioned problems and hence may be a model for further undertakings in turning (part of) the Web into a useable digital library.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Andera, M., Stein, B., Lipka, N.: Towards Automatic Quality Assurance in Wikipedia. WWW 2011 Poster (2011)

  2. APA Picture Desk: http://www.picturedesk.com/bild-disp/apa/de/home.html (2011)

  3. Blekko: http://blekko.com. Accessed 23 Nov 2010

  4. Brockhaus.: Der elektronische Brockhaus. Mannheim, Germany (2006)

  5. Cangie, R.: http://robinoula.com/media-and-technology/information-consolidation-why-aol-huffpo-is-bad-for-democracy/ (2008)

  6. Citizendium: http://en.citizendium.org/ (2011)

  7. Encyclopaedia Britannica: Electronic Version. http://www.britannica.com/. Accessed 30 Nov 2010

  8. Encyclopedias: http://www.encyclopedia.com/. Accessed 25 Nov 2010

  9. Eissen, S., Stein, B.: Intrinsic plagiarism detection. In: Advances in Information Retrieval. Lecture Notes in Computer Science, vol. 3936, pp. 565–569. Springer, Berlin (2006)

  10. Europeana: http://www.europeana.eu/portal/ (2011)

  11. IMAGNO: www.imagno.com (2011)

  12. InfoMed Austria: Medizinische Informationen. Bohmann, Vienna (1999)

  13. JUCS: http://www.jucs.org (2011)

  14. Kappe, F., Maurer, H., Zaka, B.: Plagiarism: a survey. J. Universal Comput. Sci. 12, 8 http://www.jucs.org (2006)

  15. Keen, A.: The cult of the amateur. Double Day (2007)

  16. Korica-Pehserl, P.: Semi-automatic information retrieval and consolidation of pictures from large databases. JUCS 17 (2011, to appear)

  17. Knol: http://knol.google.com/k (2011)

  18. List of Encyclopaedias: http://en.wikipedia.org/wiki/Listof_onlineencyclopedias. Accessed 30 Nov 2010

  19. Maurer, H., Kulathuramaiyer, N.: Fighting plagiarism and IPR violation: why is it so important? Inf. Services Use 27(4), 185–191 (2007)

    Google Scholar 

  20. Meyer zu Eissen, S., Stein, B., Kulig, M.: Plagiarism detection without reference collections. In: Advances in Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization, pp. 359–366. Springer, Berlin (2007)

  21. Mihalcea, R., Corley, C., Strapparava, C.: Corpus-Based and Knowledge-Base Measures of Text Similarity. American Association for Artificial Intelligence (2006)

  22. Mueller, H., Maurer, H.: A new approach for e-books for teaching and learning. In: Proc. ED.MEDIA 2011, Lisbon, July 2011 (2010)

  23. Partners: http://www.austria-lexikon.at/af/Infos_zum_AF/Partner (2011)

  24. Saarbruecken: http://www.mmci.uni-saarland.de/en/start (2011)

  25. Scholarpedia: http://en.wikipedia.org/wiki/Scholarpedia (2011)

  26. Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Information quality work organization in wikipedia. J. Am. Soc. Inf. Sci. Technol. 59(6), 983–1001 (2008). Online version at http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1532-2890

    Google Scholar 

  27. Surowiecki, J.: 2005. The Wisdom of the Crowds. Achor Books (2006)

  28. The Austrian Ecnyclopaedia: AEIOU. http://www.aeiou.at. Accessed 30 Nov 2010 (1995)

  29. Weinmann, J.J., Learned-Miller, E., Hanson, A.R.: Scene text recognition using similarity and a lexicon with sparse belief propagation. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1733–1746 (2009). http://www.computer.org/portal/web/tpami (2009)

    Google Scholar 

  30. Wikipedia Foundation: http://wikimediafoundation.org/wiki/Home. Accessed 27 Nov (2010)

  31. Wurzinger, G.: Data consolidation in large bodies of information. J. Universal Comput. Sci. 16(21), 3314–3323 http://www.jucs.org (2010)

    Google Scholar 

  32. Zaka, B., Kulathuramayer, N., Maurer, H., Balke, W.-T.: Topic-centered aggregation of presentations for learning object repurposing. In: Proceedings of AACE E-Learn Vancouver 2008, pp. 3335–3342 (2008)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hermann Maurer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Maurer, H., Mueller, H. Can the Web turn into a digital library?. Int J Digit Libr 13, 65–75 (2013). https://doi.org/10.1007/s00799-012-0097-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-012-0097-9

Keywords

Navigation