Abstract
The availability of network big data, such as those from online users’ surfing records, communication records, and e-commerce records, makes it possible for us to probe into and quantify the regular patterns of users’ long-range and complex interactions between websites. If we see the Web as a virtual living organism, according to the metabolic theory, the websites must absorb “energy” to grow, reproduce, and develop. We are interested in the following two questions: 1) where does the “energy” come from? 2) will the websites generate macro influence on the whole Web based on the “energy”? Our data consist of more than 30 000 online users’ surfing log data from China Internet Network Information Center. We would consider the influence as metabolism and users’ attention flow as the energy for the websites. We study how collective attention distributes and flows among different websites by the empirical attention flow network. Different from traditional studies which focused on information flow, we study users’ attention flow, which is not only a “reversed” way to study Web structure and transmission mode, but also the first step to understand the underlying dynamics of the World Wide Web. We find that the macro influence of websites scales sub-linearly against the collective attention flow dwelling time, which is not consistent with the heuristics that the more users’ dwelling time is, the greater influence a website will have. Further analysis finds a supper-linear scaling relationship between the influence of websites and the attention flow intensity. This is a websites version of Kleiber’s law. We further notice that the development cycle of the websites can be split into three phases: the uncertain growth phase, the partially accelerating growth phase, and the fully accelerating growth phase. We also find that compared with the widespread hyperlinks analysis models, the attention flow network is an effective theoretical tool to estimate and rank websites.
Similar content being viewed by others
References
Watts D J, Strogatz S H. Collective dynamics of ‘small-world’ networks. Nature, 1998, 393(6684): 440–442.
Huberman B A, Pirolli P L T, Pitkow J E et al. Strong regu-larities in world wide web surfing. Science, 1998, 280(5360): 95–97.
Meng X F, Li Y, Zhu J. Social computing in the era of big data: Opportunities and challenges. Journal of Computer Research and Development, 2013, 50(12): 2483–2491. (in Chinese)
Lazer D, Pentland A, Adamic L et al. Computational social science. Science, 2009, 323(5915): 721–723.
Estrin D. Small data, where n=me. Communications of the ACM, 2014, 57(4): 32–34.
Simon H A. Designing organizations for an information-rich world. In Computers, Communication, and the Public Interest, Martin G (ed.), Baltimore, MD: The Johns Hopkins Press, 1971, pp.40–41.
Kleiber M. Body size and metabolism. Hilgardia: A Journal of Agricultural Science, 1932, 6(11): 315–353.
Zhang J, Guo L. Scaling behaviors of weighted food webs as energy transportation networks. Journal of Theoretical Biology, 2010, 264(3): 760–770.
Wu F, Huberman B A. Novelty and collective attention. Proceedings of the National Academy of Sciences, 2007, 104(45): 17599–17601.
Falkinger J. Attention economies. Journal of Economic Theory, 2007, 133(1): 266–294.
Qiu J, Ren Q. Link analysis and web influence evaluation for enterprise websites. In Proc. the 7th IFIP International Conference on e-Business, e-Services, and e-Society (I3E2007), Oct. 2007, pp.300–308.
Brin S, Page L. The anatomy of a large-scale hypertextual web search engine. In Proc. the 7th Int. World Wide Web Conference (WWW1998), April 1998, pp.107–117.
Broder A, Kumar R, Maghoul F et al. Graph structure in the web. Computer Networks, 2000, 33(1/2/3/4/5/6): 309–320.
Bianchini M, Gori M, Scarselli F. Inside PageRank. ACM Transactions on Internet Technology, 2005, 5(1): 92–128.
Kleinberg J M. Authoritative sources in a hyperlinked environment. Journal of the ACM, 1999, 46(5): 604–632.
Cattuto C, Loreto V, Pietronero L. Semiotic dynamics and collaborative tagging. Proceedings of the National Academy of Sciences, 2007, 104(5): 1461–1464.
Aral S, Walker D. Identifying influential and susceptible members of social networks. Science, 2012, 337(6092): 337–341.
Liu L, Tang J, Han J et al. Mining topic-level influence in heterogeneous networks. In Proc. the 19th ACM Int. Conference on Information and Knowledge Management (CIKM2010), Oct. 2010, pp.199–208.
Cui P, Wang F, Liu S et al. Who should share what?: Item-level social influence prediction for users and posts ranking. In Proc. the 34th Int. ACM Conference on Research and Development in Information Retrieval (SIGIR2011), July 2011, pp.185–194.
Barabási A L. Network science: Luck or reason. Nature, 2012, 489: 507–508.
Serrano M Á, Boguñá M, Vespignani A. Extracting the multiscale backbone of complex weighted networks. Proceedings of the National Academy of Sciences, 2009, 106 (16): 6483–6488.
Zhang J, Feng Y. Common patterns of energy flow and biomass distribution on weighted food webs. Physica A: Statistical Mechanics and Its Applications, 2014, 405(C): 278–288.
Shi P, Zhang J, Yang B et al. Hierarchicality of trade flow networks reveals complexity of products. PLoS ONE, 2014, 9(6): e98247.
Wang D, Song C, Barabási1 A L. Quantifying long-term scientific impact. Science, 2013, 342(6154): 127–132.
Barabasi A L, Albert R. Emergence of scaling in random networks. Science, 1999, 286(5439): 509–512.
Backstrom L, Boldi P, Rosa M et al. Four degrees of separation. In Proc. the 4th Annual ACM Web Science Conference (WebSci2012), June 2012, pp.33–42.
Loeuille N, Loreau M. Evolutionary emergence of size-structured food webs. Proceedings of the National Academy of Sciences, 2005, 102(16): 5761–5766.
Zhang J. Energy flows in complex ecological systems: A review. Journal of Systems Science and Complexity, 2009, 22(3): 345–359.
Wu L, Zhang J, Zhao M. The metabolism and growth of Web forums. PLoS ONE, 2014, 9(8): e102646.
Kumar R, Tomkins A. A characterization of online browsing behavior. In Proc. the 19th Int. World Wide Web Conference (WWW2010), April 2010, pp.561–570.
Stojmirović A, Yu Y. Information flow in interaction networks II: Channels, path lengths, and potentials. Journal of Computational Biology, 2012, 19(4): 379–403.
Kemeny J, Snell J. Finite Markov Chains. Springer-Verlag Press, 1976, pp.45–49.
Garlaschelli D, Caldarelli G, Pietronero L. Universal scaling relations in food webs. Nature, 2003, 423(6936): 165–168.
Martnez-Mekler G, Martnez R A, del Rio M B et al. Universality of rank-ordering distributions in the arts and sciences. PloS ONE, 2009, 4(3): e4791.
Newman M E J. Power laws, Pareto distributions and Zipf’s law. Contemporary Physics, 2005, 46(5): 323–351.
Zhu J H, Meng T, Xie Z M et al. A teapot graph and its hierarchical structure of the Chinese web. In Proc. the 17th Int. World Wide Web Conference (WWW2008), April 2008, pp.1133–1134.
Li S, Guan Z, Tang L et al. Exploiting consumer reviews for product feature ranking. Journal of Computer Science and Technology, 2012, 27(3): 635–649.
Author information
Authors and Affiliations
Corresponding author
Additional information
Special Section on Networking and Distributed Computing for Big Data
The work was supported by the Research Funds of Renmin University of China under Grant No. 11XNL010§the National Natural Science Foundation of China under Grant Nos. 61379050, 91224008, and 61532010, and the Specialized Research Fund for the Doctoral Program of Higher Education of China under Grant No. 20130004130001.
Rights and permissions
About this article
Cite this article
Li, Y., Zhang, J., Meng, XF. et al. Quantifying the Influence of Websites Based on Online Collective Attention Flow. J. Comput. Sci. Technol. 30, 1175–1187 (2015). https://doi.org/10.1007/s11390-015-1592-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-015-1592-4