Skip to main content

Finding Organizational Accounts Based on Structural and Behavioral Factors on Twitter

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10899))

Abstract

Various socio-political organizations, from activist groups to propaganda campaigners, create accounts on Twitter to reach out, influence and gain followers. In order to analyze the impact of these organizational accounts, the first step is to identify them. In this paper, we develop and experiment with a set of network-based, behavioral, temporal and spatial characteristics in these accounts, independent of domain or language, to identify features that can be useful in detecting organizational accounts. In order to assess this model, we experimented with a microblog corpus comprised of over 7 million tweets from 150,000 Twitter users in Bangladesh, tweeted between June and October 2016. We sampled 31,139 accounts using cold-start heuristics to locate and label nearly 200 organizational accounts, distributed as 68 NGOs, 62 news outlets, 35 political groups, and 17 public intellectual and iconic figures. The remaining accounts were labeled as individuals. Next, we developed a set of features and experimented with a set of linear and non-linear classifiers. The highest performing sparse logistic regression classifier achieved an accuracy of 68.2% precision and 64.4% recall leading to a 66.2% F1-score in detecting less than 1% rare organizational accounts using a set of content- and language-independent features.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Twitter Usage Statistics Report, Internet Live Stats, http://www.internetlivestats.com/twitter-statistics/.

  2. 2.

    Since public intellectuals, leaders and celebrities share similar spatial, behavioral and connectivity related characteristics with group and organizational accounts, we labeled them together as “organizational” - as opposed to “individual” accounts.

  3. 3.

    Internet World Stats. Bangladesh Internet Usage and Telecommunications Reports. http://www.internetworldstats.com/asia/bd.htm.

References

  1. Baringhaus, L., Franz, C.: On a new multivariate two-sample test. J. Multivar. Anal. 88, 190–206 (2004)

    Article  MathSciNet  Google Scholar 

  2. Bonacich, P.: Factoring and weighting approaches to status scores and clique identification. J. Math. Sociol. 2(1), 113–120 (1972)

    Article  Google Scholar 

  3. Can, E.F., Oktay, H., Manmatha, R.: Predicting retweet count using visual cues. In: Proceedings of the 22nd ACM International Conference on Conference on Information & Knowledge Management. ACM (2013)

    Google Scholar 

  4. Celli, F., Rossi, L.: The role of emotional stability in twitter conversations. In: Proceedings of WSASM. Association for Computational Linguistics (2012)

    Google Scholar 

  5. De Silva, L., Riloff, E.: User type classification of tweets with implications for event recognition. In: ACL 2014, p. 98 (2014)

    Google Scholar 

  6. McCorriston, J., et al.: Organizations are users too: characterizing and detecting the presence of organizations on twitter. In: ICWSM (2015)

    Google Scholar 

  7. Wagner, C., et al.: The wisdom in tweetonomies: acquiring latent conceptual structures from social awareness streams. In: Proceedings of the 3rd ISSW. ACM (2010)

    Google Scholar 

  8. Conover, M.D., et al.: Predicting the political alignment of twitter users. In: Privacy, Security, Risk and Trust (PASSAT), pp. 192–199. IEEE (2011)

    Google Scholar 

  9. Rao, D., et al.: Classifying latent user attributes in Twitter. In: Proceedings of the 2nd International Workshop on Search and Mining User-generated Contents. ACM (2010)

    Google Scholar 

  10. Burger, J., et al.: Discriminating gender on Twitter. In: EMNLP 2011 (2011)

    Google Scholar 

  11. Kim, S.M., et al.: Distinguishing individuals from organisations on twitter. In: WWW 2017 (2017)

    Google Scholar 

  12. Varol, O., et al.: Online human-bot interactions: Detection, estimation, and characterization (2017) arXiv preprint arXiv:1703.03107

  13. Page, L., et al.: The pagerank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)

    Google Scholar 

  14. Wu, S., et al.: Who says what to whom on twitter. In: Proceedings of the 20th International Conference on World Wide Web. ACM (2011)

    Google Scholar 

  15. Yasugi, N., et al.: Use of twitter as an instrument for disseminating public information in providing public goods and roles of e-government: evidence from Japanese prefectures. Int. J. Eng. Innovative Technol. 3, 128–133 (2013)

    Google Scholar 

  16. Katz, E.: The two-step flow of communication: an up-to-date report on an hypothesis. Public Opin. Q. 21, 61–78 (1957)

    Article  Google Scholar 

  17. Klemm, K., Eguiluz, V.M.: Growing scale-free networks with small-world behavior. Phys. Rev. E 65, 057102 (2002)

    Article  Google Scholar 

  18. Mahmud, J., Nichols, J., Drews, C.: Home location identification of twitter users. ACM Trans. Intell. Syst. Technol. 5, 47 (2014)

    Article  Google Scholar 

  19. Nacos, B.: Politics and the twitter revolution: how tweets influence the relationship between political leaders and the public. Polit. Sci. Q. 128, 178–179 (2013)

    Article  Google Scholar 

  20. Nguyen, D.-P., Gravel, R., Trieschnigg, D., Meder, T.: “How Old Do You Think I Am?” a study of language and age in twitter (2013)

    Google Scholar 

  21. Sabidussi, G.: The centrality index of a graph. Psychometrika 31, 581–606 (1966)

    Article  MathSciNet  Google Scholar 

  22. Savage, D., Zhang, X., Yu, X., Chou, P., Wang, Q.: Anomaly detection in online social networks. Soc. Netw. 39, 62–70 (2014)

    Article  Google Scholar 

  23. Seidman, S.: Network structure and minimum degree. Soc. Netw. 5, 269–287 (1983)

    Article  MathSciNet  Google Scholar 

  24. Tomek, I.: Two modifications of CNN. IEEE Trans. Syst. Man Cybern. 6, 769–772 (1976)

    MathSciNet  MATH  Google Scholar 

  25. Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the Office of Naval Research under Grant No.: N00014-16-1-2015, and N00014-15-1-2722.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sultan Alzahrani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Alzahrani, S., Gore, C., Salehi, A., Davulcu, H. (2018). Finding Organizational Accounts Based on Structural and Behavioral Factors on Twitter. In: Thomson, R., Dancy, C., Hyder, A., Bisgin, H. (eds) Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2018. Lecture Notes in Computer Science(), vol 10899. Springer, Cham. https://doi.org/10.1007/978-3-319-93372-6_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-93372-6_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-93371-9

  • Online ISBN: 978-3-319-93372-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics