Skip to main content

Web Characteristics and Evolution

  • Reference work entry
Encyclopedia of Database Systems
  • 67 Accesses

Definition

Web characteristics are properties related to collections of documents accessible via the World Wide Web. There are vast numbers of properties that can be characterized. Some examples include the number of words in a document, the length of a document in bytes, the language a document is authored in, the mime-type of a document, properties of the URL that indentifies a document, HTML tags used to author a document, and the hyperlink structure created by the collection of documents.

As in the physical world, the process of change that the web continually undergoes is identified as web evolution. The web is a tremendously dynamic place, with new users, servers, and pages entering and leaving the system continuously, which causes the web to change very rapidly. Web evolution encompasses changes in all web characteristics, as defined above.

Historical Background

As early as 1994, researchers were interested in studying characteristics of the World Wide Web. As documented by...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 2,500.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Angeles Serrano M., Maguitman A., Santo Fortunato ná M.B., and Vespignani V. 2007.Decoding the structure of the www: A comparative analysis of web crawls. ACM Trans. Web, 1(2):10,

    Article  Google Scholar 

  2. Baeza-Yates R., Castillo C., and Efthimiadis E.N. 2007.Characterization of national web domains. ACM Trans. Int. Tech., 7(2): 9

    Article  Google Scholar 

  3. Broder A.Z., Glassman S.C., Manasse M.S., and Zweig G. Syntactic clustering of the web. pp. 1157–1166. 1997, In Selected papers from the sixth International Conference on World Wide Web,

    Google Scholar 

  4. Broder A., Kumar R., Maghoul F., Raghavan P., Rajagopalan S., Stata R., Tomkins A., and Wiener J. Graph structure in the web: Experiments and models. In Proc. 8th Int. World Wide Web Conference, 2002.

    Google Scholar 

  5. Charikar M.S. Similarity estimation techniques from rounding algorithms. pp. 380–388.2002, In Proc. 34th Annual ACM Symp. on Theory of Computing,

    Google Scholar 

  6. Cho J. and Garcia-Molina H. The evolution of the web and implications for an incremental crawler. pp. 200–209.2000, In Proc. 26th Int. Conf. on Very Large Data Bases,

    Google Scholar 

  7. Douglis F., Feldmann A., Krishnamurthy B., and Mogul J.C. Rate of change and other metrics: a live study of the world wide web. In Proc. 1st USENIX Symp. on Internet Tech. and Syst., 1997.

    Google Scholar 

  8. Fetterly D., Manasse M., Najork M., and Wiener J. A large-scale study of the evolution of web pages. In Proc. 12th Int. World Wide Web Conference, 2003, pp. 669–678.

    Google Scholar 

  9. Henzinger M. Finding near-duplicate web pages: a large-scale evaluation of algorithms. pp. 284–291.2006, In Proc. 32nd Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval,

    Google Scholar 

  10. Henzinger M.R., Heydon A., Mitzenmacher M., and Najork M. On near-uniform url sampling. Comput. Netw., 33(1–6):294–308, 2000.

    Google Scholar 

  11. Lawrence S. and Giles L.C. Accessibility of information on the web. Nature, 400(6740):107–107, July 1999.

    Article  Google Scholar 

  12. Pitkow J.E. Summary of www characterizations. Comput. Netw., 30(1–7):551–558, 1998.

    Google Scholar 

  13. Woodruff A., Aoki P.M., Brewer E.A., Gauthier P., and Rowe L.A. An investigation of documents from the world wide web. Comput. Netw., 28(7–11):963–980, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this entry

Cite this entry

Fetterly, D. (2009). Web Characteristics and Evolution. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_456

Download citation

Publish with us

Policies and ethics