Skip to main content
Log in

Rapid modeling and analyzing networks extracted from pre-structured news articles

  • SI: Data to Model
  • Published:
Computational and Mathematical Organization Theory Aims and scope Submit manuscript

Abstract

In the face of uprisings and revolutions happening in several countries within short period of time (Arab Spring 2011), the need for fast network assessments is compelling. In this article we present a rapid network assessment approach which uses a vast amount of pre-indexed news data to provide up-to-date overview and orientation in emerging and ongoing incidents. We describe the fully automated process of preparing the data and creating the dynamic meta-networks. We also describe the network analytical measures that we are using to identify important topics, persons, organizations, and locations in these networks. With our rapid network modeling and analysis approach first results can be provided within hours. In the explorative study of this article we use 108,000+ articles from 600+ English written news sources discussing Egypt, Libya, and Sudan within a time period of 18 months to show an application scenario of our approach. In particular we are looking at the involvement of other countries and their politicians during time periods of major incidents.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. http://www.lexisnexis.com/hottopics/lnacademic/?

  2. As week 1 starts on Monday, January 3rd, New Year’s Day is part of week 52.

  3. For Russia Vladimir Putin and Dmitry Medvedev are both players in international media but Putin is more important.

References

  • Apte C, Damerau F, Weiss S (1994) Automated learning of decision rules for text categorization. ACM Trans Inf Syst 12(3):233–251

    Article  Google Scholar 

  • Batagelj V, Mrvar A, Zaversnik M (2002) Network analysis of texts. In: Proceedings B of the 5th international multi-conference information science, pp 143–148

    Google Scholar 

  • Bertrand A, Cellier JM (1995) Psychological approach to indexing: effects of the operator’s expertise upon indexing behaviour. J Inf Sci 21(6):459–472

    Article  Google Scholar 

  • Bonacich P (1972) Factoring and weighting approaches to status scores and clique identification. J Math Sociol 2:113–120

    Article  Google Scholar 

  • Borgatti SP, Everett MG (1997) Network analysis of 2-mode data. Soc Netw 19(3):243–269

    Article  Google Scholar 

  • Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177

    Article  Google Scholar 

  • Carley KM (1997) Network text analysis: the network position of concepts. In: Text analysis for the social sciences: methods for drawing statistical inferences from texts and transcripts. Roberts CW (ed) Lawrence Erlbaum Associates, Mahwah

    Google Scholar 

  • Carley KM (2002a) Summary of key network measures for characterizing organizational architectures. Unpublished document, CMU 2002, Carnegie Mellon University, SCS/ISR

  • Carley KM (2002b) Smart agents and organizations of the future. In: Lievrouw LA, Livingstone S (eds) The handbook of new media. Sage, Thousand Oaks

    Google Scholar 

  • Carley KM, Columbus D (2011) Basic lessons in ORA and AutoMap 2011. Technical report, CMU-ISR-11-109, Carnegie Mellon University, SCS/ISR

  • Carley KM, Bigrigg M, Papageorgiou C, Johnson J, Kunkel F, Lanham M, Martin M, Morgan G, Schmerl B, van Holt T (2011a) Rapid ethnographic assessment: data-to-model. In: Proceedings of HSCB focus 2011: integrating social science theory and analytic methods for operational use

    Google Scholar 

  • Carley KM, Reminga J, Storrick J, Columbus D (2011b) ORA user’s guide 2011. Technical report, CMU-ISR-11-107, Carnegie Mellon University, SCS/ISR

  • Chickering D, Heckerman D, Meek C (1997) A Bayesian approach for learning Bayesian networks with local structure. In: Proceedings of thirteenth conference on uncertainty in artificial intelligence

    Google Scholar 

  • Davis A, Gardner B, Gardner M (1941) Deep south: a social anthropological study of caste and class. University of Chicago Press, Chicago

    Google Scholar 

  • Diesner J, Carley KM (2004) Using network text analysis to detect the organizational structure of covert networks. In: Proceedings of the North American association for computational social and organizational science NAACSOS

    Google Scholar 

  • Diesner J, Carley KM (2005) Revealing social structure from texts: meta-matrix text analysis as a novel method for network text analysis. In: Narayanan VK, Armstrong DJ (eds) Causal mapping for information systems and technology research. Idea Group Publishing, Harrisburg

    Google Scholar 

  • Diesner J, Carley KM (2008) Conditional random fields for entity extraction and ontological text coding. J Comput Math Organ Theory 14:248–262

    Article  Google Scholar 

  • Diesner J, Carley KM (2010) Mapping socio-cultural networks of Sudan from open-source, large-scale text data. In: Proceedings of the 29th annual conference of the Sudan studies association

    Google Scholar 

  • Dumais S, Platt J, Heckerman D, Sahami M (1998) Inductive learning algorithms and representations for text categorization. In: Proceedings of the 7th international conference on information and knowledge management

    Google Scholar 

  • Faust K (1997) Centrality in affiliation networks. Soc Netw 19(2):157–191

    Article  Google Scholar 

  • Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40:35–41

    Article  Google Scholar 

  • Freeman LC (1979) Centrality in social networks: conceptual clarification. Soc Netw 1:215–239

    Article  Google Scholar 

  • Garlan D, Carley KM, Schmerl B, Bigrigg M, Celiku O (2009) Using service-oriented architectures for socio-cultural analysis. In: Proceedings of the 21st international conference on software engineering and knowledge engineering

    Google Scholar 

  • Harris KJ, Jerome NW, Fawcett SB (1997) Rapid assessment procedures: a review and critique. Human Organ 56(3):375–378

    Google Scholar 

  • Holsti OR (1969) Content analysis for the social sciences and humanities. Addison-Wesley, Reading

    Google Scholar 

  • Hotho A, Jäschke R, Schmitz C, Stumme G (2006) BibSonomy: a social bookmark and publication sharing system. In: Proceedings of the conceptual structures tool interoperability workshop at the 14th international conference on conceptual structures, pp 87–102

    Google Scholar 

  • Howe J (2008) Crowdsourcing: why the power of the crowd is driving the future of business. Three Rivers, New York

    Google Scholar 

  • Mayfield J, McNamee P, Costello C, Piatko C, Banerjee A (2002) JHU/APL at TREC 2001: experiments in filtering and in Arabic, video, and web retrieval. In: Proceedings of the tenth text retrieval conference, NIST special publication

    Google Scholar 

  • Tsvetovat M, Reminga J, Carley KM (2004) DyNetML: interchange format for rich social network data. Technical report, CMU-ISRI-04-105, Carnegie Mellon University, SCS/ISR

  • Wasserman S, Faust K (1995) Social network analysis, methods and applications. Cambridge University Press, Cambridge

    Google Scholar 

  • Wei W, Pfeffer J, Reminga J, Carley KM (2011) Handling weighted, asymmetric, self-looped, and disconnected networks in ORA. Technical report, CMU-ISR-11-113, Carnegie Mellon University, SCS/ISR

Download references

Acknowledgements

This work is supported in part by the Office of Naval Research (ONR), United States Navy (ONR MURI N000140811186, ONR MMV N00014060104). The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Office of Naval Research or the U.S. government. The authors wish to acknowledge Jeff Reminga who has been instrumental in developing much of the related technology and Bradley Schmerl for his endeavor to include the data-to-network process of this article into SORASCS.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jürgen Pfeffer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pfeffer, J., Carley, K.M. Rapid modeling and analyzing networks extracted from pre-structured news articles. Comput Math Organ Theory 18, 280–299 (2012). https://doi.org/10.1007/s10588-012-9122-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10588-012-9122-1

Keywords

Navigation