Abstract
Since the mid-1990s has emerged a new research field, webometrics, investigating the nature and properties of the Web drawing on modern informetric methodologies. The article attempts to point to selected areas of webometric research that demonstrate interesting progress and space for development as well as to some currently less promising areas. Recent investigations of search engine coverage and performance are reviewed as a frame for selected quality and content analyses. Problems with measuring Web Impact Factors (Web-IF) are discussed. Concluding the article, new directions of webometrics are outlined for performing knowledge discovery and issue tracking on the Web, partly based on bibliometric methodologies used in bibliographic and citation databases. In this framework graph theoretic approaches, including path analysis, transversal links, “weak ties” and “small-world” phenomena are integrated.
Similar content being viewed by others
References
E. S. Allen, J. M. Burke, M. E. Welch, L. H. Rieseberg (1999), How reliable is science information on the Web? Science, 402:722.
T. Almind, P. Ingwersen (1997), Informetric analyses on the World Wide Web: Methodological approaches to “Webometrics”, Journal of Documentation, 53:404-426.
J. Bar-Ilan (1998), The mathematician, Paul Erdos (1913-1996) in the eyes of the Internet, Scientometrics, 43:257-267.
J. Bar-Ilan (1999), Search engine results over time: A case study on search engine stability, Cybermetrics, 2/3, paper 1. ISSN: 1137-5019 (http://www.cindoc.csic.es/cybermetrics/articles/v2i1p1.html; visited 08.11.2000).
J. Bar-Ilan (2000), The Web as an information resource on informetrics? A content analysis, Journal of the American Society for Information Science, 51:432-443.
J. Bar-Ilan, B. C. Peritz (1999), The life span of a specific topic on the Web. The case of “informetrics”: A quantitative analysis, Scientometrics, 46:371-382.
M. Bates, S. Lu (1997), An exploratory profile of personal home pages: Content, design, metaphors, Online & CDROM Review, 21:331-340.
D. Bawden (1986), Information systems and the stimulation of creativity, Journal of Information Science, 12:203-216.
T. Berners-Lee (1997), Realising the full potential of the Web. World Wide Web Consortium, (http://www.w3.org/1998/02/Potential.html; visited 08.11.2000).
K. Bharat, M. Henzinger (1998), Improved algorithms for topic distillation in a hyperlinked environment. In: W. B. Croft et al. (Eds.). Proceedings of the 21st annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, pp. 104-111.
L. BjÖrneborn (2000), Verdensvævet som “small-world”-netværk og mulighedsrum: omridset af en forståelsesmodel for transversale links på World Wide Web. [“Small-World” Web and Possibility Space: outlining a conceptual framework for transversal links on the WWW]. Master.s Thesis. Royal School of Library and Information Science, Copenhagen.
S. Brin, L. Page (1998), The anatomy of a large-scale hypertextual Web search engine, WWW7 Conference, (http://www-db.stanford.edu/~backrub/google.html; visited 08.11.2000).
A. Broder et al. (2000), Graph structure in the Web, WWW9 Conference. (http://www.almaden.ibm.com/cs/k53/www9.final; visited 08.11.2000)
V. Bush (1945), As we may think, The Atlantic Monthly, 176 (July) 641-649.
S. J. Clarke, P. Willett (1997), Estimating the recall performance of Web search engines, Aslib Proceedings, 49:184-189.
M. P. Courtois, M. W. Berry (1999), Results ranking in Web search engines, Online, (May/June) 39-46.
B. Cronin, G. Mckim (1996), Science and scholarship on the World Wide Web: A North American perspective, Journal of Documentation, 52:163-172.
L. Cui (1999), Rating health Web sites using the principles of citation analysis: A bibliometric approach. Journal of Medical Internet Research, 1(1) e4 (ISSN: 1438-8871) (http://www.jmir.org/1999/1/e4/index.htm; visited 08.11.2000).
R. Davies (1989), The creation of new knowledge by information retrieval and classification, Journal of Documentation, 45:273-301.
H. DE Jong, A. Rip (1997), The computer revolution in science: steps towards the realization of computer-supported discovery environments, Artificial Intelligence, 91:225-256.
A. Dillon, B. A. Gushrowski (2000), Genres and the Web: Is the personal home page the first uniquely digital genre? Journal of the American Society for Information Science, 51:202-205.
O. Etzioni (1996), The World-Wide Web: quagmire or gold mine?, Communications of the ACM, 39 (Nov.) 65-68.
G. Eysenbach (1998), Towards quality management of medical information on the Internet: Evaluation, labelling, and filtering of information, British Medical Journal, 317:1496-1502.
U. Fayyad, G. Piatetsky-Shapiro, P. Smyth (1996), The KDD process for extracting useful knowledge from volumes of data, Communications of the ACM, 39 (Nov.) 27-34.
N. Ford (1999), Information retrieval and creativity: towards support for the original thinker, Journal of Documentation, 55:528-542.
W. J. Frawley, G. Piatetsky-Shapiro, C. J. Matheus, Knowledge discovery in databases: An overview, In: G. PIATETSKY-SHAPIRO, W.J. Frawley (Eds). Knowledge discovery in databases. Menlo Park, Cal.: AAAI Press, 1991
E. Garfield (1966), The who and why of ISI, Essays of an Information Scientist, 1 (1962-73) 33-37. Originally printed in Karger Gazette, March 5, 1966.
E. Garfield (1994), Linking literatures: An intriguing use of the citation index, Current Contents, 21 (May 23) 3-5.
D. Gibson, J. Kleinberg, P. Raghavan (1998), Inferring web communities from link topology, Proceedings of the 9th ACM Conference on Hypertext and Hypermedia. (http://www.cs.cornell.edu/home/kleinber/ht98.pdf; visited 08.11.2000).
M. S. Granovetter (1973), The strength of weak ties, American Journal of Sociology, 78:1360-1380.
P. Ingwersen (1998), The calculation of Web Impact Factors, Journal of Documentation, 54:236-243.
J. T. Klein (1996), Crossing boundaries: knowledge, disciplinarities, and interdisciplinarities, Charlottesville, Virg.: University Press of Virginia.
J. M. Kleinberg (1998), Authoritative sources in a hyperlinked environment, Proceedings of the 9th annual ACM-SIAM Symposium on Discrete Algorithms, pp. 668-677.
M. Kochen (Ed.) (1989), The small world. Norwood, N. J.: Ablex Publishing Corporation.
F. W. Lancaster, J.-L. Lee (1985), Bibliometric techniques applied to issues management: A case study, Journal of the American Society for Information Science, 36:389-397.
R. Larson (1996), Bibliometrics of the World Wide Web: An exploratory analysis of the intellectual structure of cyberspace. In: S. HARDIN (Ed.) Proceedings of the 59th Annual Meeting of the American Society for Information Science, 33:71-78.
S. Lawrence, C. L. Giles (1998), Searching the World Wide Web. Science, 280:98-100.
P. Losiewicz, D. W. Oard, R. N. Kostoff (2000), Textual data mining to support science and technology management, Journal of Intelligent Information Systems, 15:99-119.
S. Milgram (1967), The small-world problem, Psychology Today, 1:60-67.
C. Oppenheim, A. Morris, C. Mcknight (2000), The evaluation of WWW search engines. Journal of Documentation, 56:190-211.
S. J. Pierce (1999), Boundary crossing in research literatures as a means of interdisciplinary information transfer, Journal of the American Society for Information Science, 50:271-279.
J. Qin, M. J. Norton (1999) (Eds). Introduction (In issue: Knowledge Discovery in Bibliographic Databases). Library Trends, 48 (Summer) 1-8.
J. M. Rodriguez i Gairin (1997), Volorando el impacto de la informacion en Internet: Altavista, el “Citation Index” de la Red. Revista Espanola de Documentacion Scientifica, 20 (2):175-181.
R. Rousseau (1997), Sitations: An exploratory study. Cybermetrics, 1 paper 1. ISSN: 1137-5019. (http://www.cindoc.csic.es/cybermetrics/articles/v1i1p1.html; visited 08.11.2000).
R. Rousseau (1999), Daily time series of common single word searches in AltaVista and NorthernLight. Cybermetrics, 2/3, paper 2. ISSN: 1137-5019. (http://www.cindoc.csic.es/cybermetrics/articles/v2i1p2.html; visited 08.11.2000).
H. Small (1999), A passage through science: Crossing disciplinary boundaries, Library Trends, 48 (Summer) 72-108.
A. G. Smith (1999), A tale of two web spaces: Comparing sites using web impact factors. Journal of Documentation, 55:577-592.
H. Snyder, H. Rosenbaum (1999), Can search engines be used as tools for web-link analysis? A critical view, Journal of Documentation, 55:375-384.
D. R. Swanson (1986), Undiscovered public knowledge, Library Quarterly, 56:103-118.
D. R. Swanson, N. R. Smalheiser (1997), An interactive system for finding complementary literatures: A stimulus to scientific discovery, Artificial Intelligence, 91:183-203.
D. R. Swanson, N. R. Smalheiser (1999), Implicit text linkages between Medline records: using Arrowsmith as an aid to scientific discovery, Library Trends, 48(Summer) 48-59.
M. Thelwall (2000), Web impact factors and search engine coverage, Journal of Documentation, 56:185-189.
R. E. ValdÉs-Perez (1999), Principles of human-computer collaboration for knowledge discovery in science, Artificial Intelligence, 107:335-346.
B. Vickery (1997), Knowledge discovery from databases: an introductory review, Journal of Documentation, 53:107-122.
D. J. Watts (1999), Small worlds: the dynamics of networks between order and randomness, Princeton University Press, Princeton, N.J.
D. J. Watts, S. H. Strogatz (1998), Collective dynamics of “small-world” networks, Nature, 393 (June 4) 440-442.
I. Wormell (2000), Critical aspects of the Danish welfare state as revealed by issue tracking, Scientometrics, 4:237-250.
E. Wynn, J. E. Katz (1997), Hyperbole over cyberspace: Self-presentation and social boundaries in Internet home pages and discourse, Information Society, 13:297-327.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Björneborn, L., Ingwersen, P. Perspective of webometrics. Scientometrics 50, 65–82 (2001). https://doi.org/10.1023/A:1005642218907
Issue Date:
DOI: https://doi.org/10.1023/A:1005642218907