skip to main content
10.1145/2808797.2809380acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
short-paper

Breaking the News: Extracting the Sparse Citation Network Backbone of Online News Articles

Authors Info & Claims
Published:25 August 2015Publication History

ABSTRACT

Networks of online news articles and blog posts are some of the most commonly used data sets in network science. As a result, they have become a vital piece of network analysis and are used for the evaluation of algorithms that work on large networks, or serve as examples in the analysis of information diffusion and propagation. Similarly, scientific citation networks are part of the bedrock upon which much of modern network analysis is built and have been studied for decades. In this paper, we show that the backbone inherent to networks of online news articles shares significant structural similarities to scientific citation networks once the noise of spurious links is stripped away. We present a data set of news articles that, while it is extremely sparse and lightweight, still contains information relevant to the propagation of information in mass media and is remarkably similar to scientific citation networks, thus opening the door to the use of established methodologies from scientometrics and bibliometrics in the analysis of online news propagation.

References

  1. M. Cha, J. Pérez, and H. Haddadi, "Flash floods and ripples: The spread of media content through the blogosphere," in ICWSM '09, 2009.Google ScholarGoogle Scholar
  2. S. A. Myers, C. Zhu, and J. Leskovec, "Information diffusion and external influence in networks," in KDD '12. ACM, 2012, pp. 33--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Yang and J. Leskovec, "Modeling information diffusion in implicit networks," in ICDM '10. IEEE, 2010, pp. 599--608. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Atkinson and E. Van der Goot, "Near real time information mining in multilingual news," in WWW '09. ACM, 2009, pp. 1153--1154. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. B. E. Teitler, M. D. Lieberman, D. Panozzo, J. Sankaranarayanan, H. Samet, and J. Sperling, "Newsstand: A new view on news," in SIGSPATIAL '08. ACM, 2008, p. 18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. I. Flaounas, M. Turchi, O. Ali, N. Fyson, T. De Bie, N. Mosdell, J. Lewis, and N. Cristianini, "The structure of the EU mediasphere," PloS one, vol. 5, no. 12, p. e14243, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  7. J. Leskovec, L. Backstrom, and J. Kleinberg, "Meme-tracking and the dynamics of the news cycle," in KDD '09. ACM, 2009, pp. 497--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. E. Garfield, "Citation analysis as a tool in journal evaluation," Science, vol. 178, no. 4060, pp. 471--479, 1972.Google ScholarGoogle ScholarCross RefCross Ref
  9. J. E. Hirsch, "An index to quantify an individual's scientific research output," PNAS, vol. 102, no. 46, pp. 16 569--16 572, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  10. F. Radicchi, S. Fortunato, and A. Vespignani, "Citation networks," in Models of Science Dynamics. Springer, 2012, pp. 233--257.Google ScholarGoogle Scholar
  11. A. B. Jaffe and M. Trajtenberg, Patents, Citations, and Innovations: A Window on the Knowledge Economy. MIT Press, 2002.Google ScholarGoogle Scholar
  12. J. H. Fowler and S. Jeon, "The authority of supreme court precedent," Social networks, vol. 30, no. 1, pp. 16--30, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  13. A. Spitz and E.-Á. Horvát, "Measuring long-term impact based on network centrality: Unraveling cinematic citations," PloS one, vol. 9, no. 10, p. e108857, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  14. M. Gamon, S. Basu, D. Belenko, D. Fisher, M. Hurst, and A. C. König, "Blews: Using blogs to provide context for news articles." in ICWSM '08, 2008.Google ScholarGoogle Scholar
  15. D. Easley and J. Kleinberg, Networks, Crowds, and Markets: Reasoning about a highly connected world. Cambridge University Press, 2010. Google ScholarGoogle ScholarCross RefCross Ref
  16. L. Lloyd, D. Kechagias, and S. Skiena, "Lydia: A system for large-scale news analysis," in SPIRE '05. Springer, 2005, pp. 161--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Albert and A.-L. Barabási, "Statistical mechanics of complex networks," Reviews of modern physics, vol. 74, no. 1, p. 47, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  18. M. E. Newman, "Mixing patterns in networks," Physical Review E, vol. 67, no. 2, p. 026126, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  19. J. G. Foster, D. V. Foster, P. Grassberger, and M. Paczuski, "Edge direction and the structure of networks," PNAS, vol. 107, no. 24, pp. 10 815--10 820, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  20. S. N. Dorogovtsev and J. F. Mendes, "Evolution of networks with aging of sites," Phys Rev E, vol. 62, no. 2, p. 1842, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  21. K. B. Hajra and P. Sen, "Modelling aging characteristics in citation networks," Physica A, vol. 368, no. 2, pp. 575--582, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  22. Z.-X. Wu and P. Holme, "Modeling scientific-citation patterns and other triangle-rich acyclic networks," Phys Rev E, vol. 80, no. 3, p. 037101, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  23. B. Bollobás and O. M. Riordan, "Mathematical results on scale-free random graphs," Handbook of graphs and networks: from the genome to the Internet, pp. 1--34, 2003.Google ScholarGoogle Scholar
  24. J. Leskovec, J. Kleinberg, and C. Faloutsos, "Graphs over time: densification laws, shrinking diameters and possible explanations," in KDD '05. ACM, 2005, pp. 177--187. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. E. Mones, P. Pollner, and T. Vicsek, "Universal hierarchical behavior of citation networks," J. Stat. Mech. Theor. Exp., vol. 2014, no. 5, p. P05023, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  26. F. Radicchi, S. Fortunato, and C. Castellano, "Universality of citation distributions: Toward an objective measure of scientific impact," PNAS, vol. 105, no. 45, pp. 17 268--17 272, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  27. S. P. Borgatti, "Centrality and network flow," Social networks, vol. 27, no. 1, pp. 55--71, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  1. Breaking the News: Extracting the Sparse Citation Network Backbone of Online News Articles

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015
          August 2015
          835 pages
          ISBN:9781450338547
          DOI:10.1145/2808797

          Copyright © 2015 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 25 August 2015

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate116of549submissions,21%

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader