Skip to main content
Log in

Influential tweeters in relation to highly cited articles in altmetric big data

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

The relationship between influential tweeters and highly cited articles in the field of information sciences was analysed using Twitter data gathered by Altmetric.com from July 2011 through February 2017. The dataset consists of more than 10,000 tweets, and these mentions, retweets and followers were used to generate a connected, undirected graph. This graph reveals the most influential tweeters by identifying the largest drop in the eigenvalue of adjacency or affinity matrix of a graph when certain nodes are removed; those which, when deleted, cause the greatest drop in the eigenvalue of the graph are considered to be the most influential. The machine-learning model applied in this work utilizes a feature vector containing the accumulated sum of the rank scores of those influential users who tweet a given article, along with known altmetric features such as the user type and post counts for various social media. Finally, the supervised-learning model was trained using Random Forest and Support Vector Machine classifiers with 11 features, including the sum of the ranks of influential users who tweet a given article in our dataset. The results were analysed using Receiver Operating Characteristic (ROC) curves and Precision Recall (PR) curves, which give the commendable outcomes compared to the baseline model. We found that, for the classification of highly cited articles, Twitter users’ score for influence is the most important feature. Finally, we show that our model—which was trained by taking the score for influence into consideration—outperforms the baseline, at 79% for ROC and 90% for PR with the Random Forest Model, effectively identifying the highly cited articles.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. The data and code used in this research can be downloaded at the following URL: https://github.com/slab-itu/influential_twitter_users/.

References

  • Alonso, O., Carson, C., Gerster, D., Ji, X., & Nabar, U. S. (2010). Detecting uninteresting content in text streams. In Proceedings of the SIGIR 2010 workshop on crowdsourcing for search evaluation (CSE 2010) (pp. 39–42).

  • Anger, I., & Kittl, C. (2011). Measuring influence on Twitter. In Proceedings of the 11th international conference on knowledge management and knowledge technologies, 31(1), pp. 1–31.

  • Bates, M. (2017). Tracking disease: Digital epidemiology offers new promise in predicting outbreaks. IEEE Pulse. https://doi.org/10.1109/mpul.2016.2627238.

    Google Scholar 

  • Beauchamp, N. (2017). Predicting and interpolating state-level polls using Twitter textual data. American Journal of Political Science, 61, 490–503. https://doi.org/10.1111/ajps.12274.

    Article  Google Scholar 

  • Bornmann, L. (2014). Do altmetrics point to the broader impact of research? An overview of benefits and disadvantages of altmetrics. Journal of Informetrics, 8(4), 895–903. https://doi.org/10.1016/j.joi.2014.09.005.

    Article  Google Scholar 

  • Bornmann, L. (2016). What do altmetrics counts mean? A plea for content analyses. Journal of the Association for Information Science and Technology, 67(4), 1016–1017.

    Article  Google Scholar 

  • Bornmann, L., & Haunschild, R. (2016). How to normalize Twitter counts? A first attempt based on journals in the Twitter Index. Scientometrics, 107(3), 1405–1422.

    Article  Google Scholar 

  • Chakrabarti, D., Wang, Y., Wang, C., Leskovec, J., & Faloutsos, C. (2008). Epidemic thresholds in real networks. ACM Transactions on Information and System Security (TISSEC), 10(4), 1.

    Article  Google Scholar 

  • Chung, F. R. (1997). Spectral graph theory (vol. 92, Regional Conference Series in Mathematics). Rhode Island: American Mathematical Society/Conference Board of the Mathematical Sciences. ISBN: 978-0-8218-0315-8.

  • Costas, R., Zahedi, Z., & Wouters, P. (2015). Do ‘altmetrics’ correlate with citations? Extensive comparison of altmetric indicators with citations from a multidisciplinary perspective. Journal of the Association for Information Science and Technology, 66(10), 2003–2019.

    Article  Google Scholar 

  • Escamilla, I., Torres-Ruiz, M., Moreno-Ibarra, M., Quintero, R., Guzmán, G., & Luna-Soto, V. (2016). Geocoding tweets approach based on conceptual representations in the context of the knowledge society. International Journal on Semantic Web and Information Systems (IJSWIS), 12(1), 44–61.

    Article  Google Scholar 

  • Hassan, S. U., Imran, M., Gillani, U., Aljohani, N. R., Bowman, T. D., & Didegah, F. (2017). Measuring social media activity of scientific literature: An exhaustive comparison of scopus and novel altmetrics big data. Scientometrics, 113(2), 1037–1057.

    Article  Google Scholar 

  • Haustein, S. (2016). Grand challenges in altmetrics: Heterogeneity, data quality and dependencies. Scientometrics, 108(1), 412–423. https://doi.org/10.1007/s11192-016-1910-9.

    Article  Google Scholar 

  • Haustein, S., Bowman, T. D., & Costas, R. (2015). Interpreting ‘altmetrics’: Viewing acts on social media through the lens of citation and social theories. arXiv preprint arXiv:1502.05701.

  • Haustein, S., Peters, I., Sugimoto, C. R., Thelwall, M., & Larivière, V. (2014). Tweeting biomedicine: An analysis of tweets and citations in the biomedical literature. Journal of the Association for Information Science and Technology, 65(4), 656–669.

    Article  Google Scholar 

  • Hesterman, J. Y., Caucci, L., Kupinski, M. A., Barrett, H. H., & Furenlid, L. R. (2010). Maximum-likelihood estimation with a contracting-grid search algorithm. IEEE Transactions on Nuclear Science, 57(3), 1077–1084.

    Article  Google Scholar 

  • Hussain, A. R., Hameed, M. A., & Sayeedunnissa, S. F. (2012). Measuring influence in social networks using a network amplification score-an analysis using cloud computing. In 2012 12th International conference on hybrid intelligent systems (HIS).

  • Jarwar, M. A., Abbasi, R. A., Mushtaq, M., Maqbool, O., Aljohani, N. R., Daud, A., et al. (2017). CommuniMents: A framework for detecting community-based sentiments for events. International Journal on Semantic Web and Information Systems (IJSWIS), 13(2), 87–108.

    Article  Google Scholar 

  • Kalloubi, F., Nfaoui, E. H., & El Beqqali, O. (2017). Harnessing semantic features for large-scale content-based hashtag recommendations on microblogging platforms. International Journal on Semantic Web and Information Systems (IJSWIS), 13(1), 63–81.

    Article  Google Scholar 

  • Kemp, S. (2017). Digital in 2017: Global overview. Retrieved from ‘We are social. https://wearesocial.com/blog/2017/01/digital-in-2017-global-overview. Accessed 10 June 2018.

  • Lee, M. K., Yoon, H. Y., Smith, M., Park, H. J., & Park, H. W. (2017). Mapping a Twitter scholarly communication network: A case of the association of internet researchers’ conference. Scientometrics, 112(2), 767–797. https://doi.org/10.1007/s11192-017-2413-z.

    Article  Google Scholar 

  • Lotan, G., Ananny, M., Gaffney, D., & Pearce, I. (2011). The Arab Spring/The revolutions were tweeted: Information flows during the 2011 Tunisian and Egyptian revolutions. International Journal of Communication, 5, 1375–1405.

    Google Scholar 

  • Priem, J., Piwowar, H., & Hemminger, B. (2011). Altmetrics in the wild: An exploratory study of impact metrics based on social media. In Metrics 2011: Symposium on informetric and scientometric research, New Orleans, USA.

  • Priem, J., Taraborelli, D., Groth, P., & Neylon, C. (2010). Altmetrics: A manifesto. Available online at http://altmetrics.org/manifesto/. Accessed 10 June 2018.

  • Quercia, D., Ellis, J., Capra, L., & Crowcroft, J. (2011). In the mood for being influential on Twitter. In Privacy, security, risk and trust (PASSAT) and 2011 IEEE 3rd international conference on social computing (SocialCom) (pp. 307–314). IEEE.

  • Shrivastava, R., & Mahajan, P. (2016). Influence of social networking sites on scholarly communication: A study using literature in artificial intelligence. Journal of Librarianship and Information Science. https://doi.org/10.1177/0961000616678309.

    Google Scholar 

  • Sud, P., & Thelwall, M. (2014). Evaluating altmetrics. Scientometrics, 98(2), 1131–1143.

    Article  Google Scholar 

  • Sugimoto, C. R., Work, S., Larivière, V., & Haustein, S. (2017). Scholarly use of social media and altmetrics: A review of the literature. Journal of the Association for Information Science and Technology, 68(9), 2037–2062. https://doi.org/10.1002/asi.23833.

    Article  Google Scholar 

  • Tariq, J., Ahmad, M., Khan, I., & Shabbir, M. (2017). Scalable approximation algorithm for network immunization. arXiv preprint arXiv:1711.00784.

  • Thelwall, M., Haustein, S., Larivière, V., & Sugimoto, C. R. (2013). Do altmetrics work? Twitter and ten other social web services. PLoS ONE, 8(5), e64841. https://doi.org/10.1371/journal.pone.0064841.

    Article  Google Scholar 

  • Tsou, A., Bowman, T.D., Ghazinejad, A., & Sugimoto, C.R. (2015). Who tweets about science? In Proceedings of the 2015 international society for scientometrics and informetrics (pp. 95–100), Istanbul, Turkey.

  • Yang, M.-C., Lee, J.-T., Lee, S.-W., & Rim, A. H.-C. (2012). Finding interesting posts in Twitter based on retweet graph analysis. In 35th International ACM SIGIR conference on research and development in information retrieval (pp. 1073–1074), August, Portland, OR.

  • Zahedi, Z., Costas, R., & Wouters, P. (2014). How well developed are altmetrics? A cross-disciplinary analysis of the presence of ‘alternative metrics’ in scientific publications. Scientometrics, 101(2), 1491–1513.

    Article  Google Scholar 

Download references

Acknowledgements

The research work has been supported by the NRPU grant no. 6857/Punjab/NRPU/R&D/HEC/2016 funded by the Higher Education Commission of Pakistan.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saeed-Ul Hassan.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hassan, SU., Bowman, T.D., Shabbir, M. et al. Influential tweeters in relation to highly cited articles in altmetric big data. Scientometrics 119, 481–493 (2019). https://doi.org/10.1007/s11192-019-03044-9

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-019-03044-9

Keywords

Navigation