Skip to main content

Extraction of Structural Information from the Web

  • Conference paper
  • 1311 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3614))

Abstract

The Web can be regarded as a huge graph when each Web page is regarded as a node and each hyperlink as an edge. There are several attempts for visualizing the structure of the Web, such as touchgraph or KartOO. In order to achieve visualization that assists users’ information acquisition from the Web, two constructs (keywords and pages) are required in the visualization. In this paper, a cluster of keywords and Web pages is regarded as “structural information” in the Web. We have developed a visualization system that shows clusters of Web pages and keywords. Based on online Web resources, appropriate relations can be visualized without analyzing the contents of Web pages.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barabasi, A.-L.: LINKED – The New Science of Networks. Perseus Publishing, Cambridge (2002)

    Google Scholar 

  2. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph Structure in the Web: Experiments and models. In: Proc. of the 9th WWW Conference, pp. 309–320 (2000)

    Google Scholar 

  3. Flake, G.W., Lawrence, S., Giles, C.L., Coetzee, F.M.: Self-Organization and Identification of Web Communities. IEEE Computer 35(3), 66–71 (2002)

    Google Scholar 

  4. Kleinberg, J., Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.: The Web as a Graph: Measurements, Models, and Methods. In: Asano, T., Imai, H., Lee, D.T., Nakano, S.-i., Tokuyama, T. (eds.) COCOON 1999. LNCS, vol. 1627, pp. 1–17. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  5. Kosala, R., Blockeel, H.: Web Mining Research: A Survey. ACM SIGKDD Explorations 2(1), 1–15 (2000)

    Article  Google Scholar 

  6. Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.: Trawling the Web for Emerging Cyber-Communities. In: Proc. of the 8th WWW Conference (1999)

    Google Scholar 

  7. Murata, T.: Discovery of Web Communities Based on the Co-occurrence of References. In: Morishita, S., Arikawa, S. (eds.) DS 2000. LNCS (LNAI), vol. 1967, pp. 65–75. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  8. Murata, T.: Finding Related Web Pages Based on Connectivity Information from a Search Engine. In: Poster Proc. of 10th WWW conference, pp. 18–19 (2001)

    Google Scholar 

  9. Murata, T.: Visualizing the Structure of Web Communities Based on Data Acquired from a Search Engine. IEEE Transactions on Industrial Electronics 50(5), 860–866 (2003)

    Article  Google Scholar 

  10. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank Citation Ranking: Bringing Order to the Web., Online manuscript (1998), http://www-db.stanford.edu/~backrub/pageranksub.ps

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Murata, T. (2005). Extraction of Structural Information from the Web. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_159

Download citation

  • DOI: https://doi.org/10.1007/11540007_159

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28331-7

  • Online ISBN: 978-3-540-31828-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics