URWF: user reputation based weightage framework for twitter micropost classification

Bukhari, Asad; Qamar, Usman; Ghazia, Ume

doi:10.1007/s10257-016-0320-0

URWF: user reputation based weightage framework for twitter micropost classification

Original Article
Published: 08 June 2016

Volume 15, pages 623–659, (2017)
Cite this article

Information Systems and e-Business Management Aims and scope Submit manuscript

Asad Bukhari¹,
Usman Qamar¹ &
Ume Ghazia²

324 Accesses
5 Citations
Explore all metrics

Abstract

Sentiment analysis is an emerging field that helps in understanding the sentiments of users on microblogging sites. Many sentiment analysis techniques have been proposed by researchers that classify and analyze the sentiments from micropost posted by various users. Majorly, these techniques perform text based classification that does not allow predicting the micropost impact. Further, it is very difficult to analyze this huge volume of online content produced each day. Therefore, an effective technique for sentiment analysis is required that not only performs the precise text-based classification but also makes the analysis easy by reducing the volume of data. Moreover, micropost impact must also be determined in order to segregate the high impact microposts in corpus. In the present study, we have presented sentiment analysis framework that incorporates any text based classification and separates out the high impact microposts from low impact by calculating the factor of user reputation. This user reputation is calculated by considering multiple factors regarding user activities that may help organizations to know customer opinions and views about their products and services. This way, volume of data becomes small that has to be analyzed by considering only microposts posted by high impact users. Multiple text classifications classes are introduced instead of just positive, negative and neutral for precise sentiment classification. The proposed framework also calculates the accumulated weight of each micropost by multiplying the user reputation with the assigned sentiment score. The user reputation calculation factors are validated by using Spearman rho and Kendall tau correlation coefficient. The framework is further evaluated by using the Sanders topic based corpus and results are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

NBA All-Star Prediction Using Twitter Sentiment Analysis

Sentiment Analysis of Smartphone Product Reviews Using Weightage Calculation

Machine Learning Techniques to Determine the Polarity of Messages on Social Networks

Notes

https://dev.twitter.com/docs/api/1.1.
α = 1 is used for this research.

References

Baccianella S, Esuli A, Sebastiani F (2010) SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the seventh international conference on language resources and evaluation (LREC’10), (pp 2200–2204)
Bai X (2011) Predicting consumer sentiments from online text. Decis Support Syst 50(4):732–742. doi:10.1016/j.dss.2010.08.024
Article Google Scholar
Brooks M, Robinson JJ, Torkildson MK, Aragon CR (2014) Collaborative visual analysis of sentiment in twitter events. In: 11th international conference, CDVE 2014 (pp 1–8). Springer, Berlin. doi:10.1007/978-3-319-10831-5_1
Chinthala S, Mande R, Manne S, Vemuri S (2015) Sentiment analysis on twitter streaming data. In: Emerging ICT for bridging the future-proceedings of the 49th annual convention of the computer society of india (CSI), vol. 1. Springer, Berlin, 161–168
Gamon M (2004) Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis. In: Proceedings of the 20th international conference on Computational Linguistics (p 841). Association for Computational Linguistics. doi:10.3115/1220355.1220476
Ghiassi M, Burnley C (2010) Measuring effectiveness of a dynamic artificial neural network algorithm for classification problems. Expert Syst Appl 37(4):3118–3128. doi:10.1016/j.eswa.2009.09.017
Article Google Scholar
Ghiassi M, Saidane H, Zimbra DK (2005) A dynamic artificial neural network model for forecasting time series events. Int J Forecast 21(2):341–362. doi:10.1016/j.ijforecast.2004.10.008
Article Google Scholar
Ghiassi M, Skinner J, Zimbra D (2013) Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst Appl 40(16):6266–6282. doi:10.1016/j.eswa.2013.05.057
Article Google Scholar
Haddi E, Liu X, Shi Y (2013) The role of text pre-processing in sentiment analysis. Procedia Comput Sci 17:26–32. doi:10.1016/j.procs.2013.05.005
Article Google Scholar
Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 168–177. doi:10.1145/1014052.1014073
Jeragh M, AlQuraishi E, AlDwaisan E (2012) A twitter-based weighted reputation system. Procedia Comput Sci 10:902–908. doi:10.1016/j.procs.2012.06.119
Article Google Scholar
Jmal J, Faiz R (2013) Customer review summarization approach using Twitter and SentiWordNet. In: Proceedings of the 3rd international conference on web intelligence, mining and semantics. ACM. doi:10.1145/2479787.2479813
Joachims T (1999) Making large-scale support vector machine learning practical. In: Advances in kernel methods. MIT Press, Cambridge, pp 169–184
Khan FH, Bashir S, Qamar U (2014) TOM: twitter opinion mining framework using hybrid classification scheme. Decis Support Syst 57:245–257. doi:10.1016/j.dss.2013.09.004
Article Google Scholar
Kouloumpis E, Wilson T, Moore JD (2011) Twitter sentiment analysis: the good the bad and the OMG! In: Fifth international AAAI conference on weblogs and social media. Association for the advancement of artificial intelligence
Kushal D, Steve L, David MP (2003) Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: WWW ‘03 proceedings of the 12th international conference on World Wide Web. ACM, New York, 519–528. doi:10.1145/775152.775226
Lehman A, O’Rourke N, Hatcher L, Stepanski E (2013) MP for basic univariate and multivariate statistics: methods for researchers and social scientists, 2 ed.. SAS Institute
Liu B (2010) Sentiment analysis and subjectivity. In: Handbook of natural language processing, pp 627–666
Liu B (2014, 6 25). Retrieved from List of words: http://www.cs.uic.edu/~liub/FBS/opinion-lexicon-English.rar
Liu B, Li X, Lee WS, Yu PS (2004) Text classification by labeling words. In: AAAI’04 proceedings of the 19th national conference on artifical intelligence. AAAI Press, California, pp 425–430
Lulin Z, Yi Z, Ning Z (2011) A weighted multi-factor algorithm for microblog search. In: 7th international conference, AMT 2011. Springer, Lanzhou, pp 153–161. doi:10.1007/978-3-642-23620-4_19
McDonald B (2014, 3 26) Bill McDonald word lists page. Retrieved from http://www3.nd.edu/~mcdonald/Word_Lists.html
Micosoft (2010) Office 2010. Retrieved from https://products.office.com/en-us/office-2010
Microsoft (2010) Visual studio 2010. Retrieved 09 10, 2013, from https://msdn.microsoft.com/en-us/library/dd831853(v=vs.100).aspx
Montejo-Ráez A, Martínez-Cámara E, Martín-Valdivia MT, Ureña-López LA (2014) Ranked WordNet graph for sentiment polarity classification in twitter. Comput Speech Lang 28(1):93–107. doi:10.1016/j.csl.2013.04.001
Article Google Scholar
Moreo A, Castro JL, Zurita JM (2012) Handling context in Lexicon-based sentiment analysis. In: Advances in computational intelligence. Springer: Berlin, pp 245–254
Ortigosa A, Martín JM, Carro RM (2014) Sentiment analysis in facebook and its application to e-learning. Comput Hum Behav 31:527–541. doi:10.1016/j.chb.2013.05.024
Article Google Scholar
Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing, 10. Association for Computational Linguistics Stroudsburg, PA, USA, pp 79–86. doi:10.3115/1118693.1118704
Peter K, Marián Š (2014) Sentiment analysis on microblog utilizing appraisal theory. World Wide Web 17(4):847–867. doi:10.1007/s11280-013-0247-z
Article Google Scholar
Prokhorov A (2011) Kendall coefficient of rank correlation. (Encyclopedia of Mathematics), http://www.encyclopediaofmath.org/index.php?title=Kendall_coefficient_of_rank_correlation&oldid=13189
Read J (2005) Using emoticons to reduce dependency in machine learning techniques for sentiment classification. In: Proceedings of the ACL student research workshop. Association for Computational Linguistics, Michigan, pp 43–48
Sanders NJ (2011) Twitter sentiment Corpus. Retrieved 6 3, 2014, from Sanders Analytics: http://www.sananalytics.com/lab/twitter-sentiment/sanders-twitter-0.2.zip
Sommer S, Schieber A, Hilbert A, Heinrich K (2011) Analyzing customer sentiments in microblogs: a topic-model based approach for Twitter datasets. In: Proceedings of the Americas conference on information systems (AMCIS)
Tang H, Tan S, Cheng X (2009) A survey on sentiment detection of reviews. Expert Syst Appl 36(7):10760–10773. doi:10.1016/j.eswa.2009.02.063
Article Google Scholar
Turney P (2001) Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In: 12th European conference on machine learning. Springer, Berlin, pp 491–502. doi:10.1007/3-540-44795-4_42
Tweetinvi (2014) Retrieved 05 15, 2014, from https://tweetinvi.codeplex.com/
Twitter I (2014, 5 12) Twitter REST API. Retrieved from https://dev.twitter.com/docs/api/1.1
Wessa P (2012). Spearman rank correlation (v1.0.1) in free statistics software (v1.1.23-r7). (Office for Research Development and Education) Retrieved 06 15, 2014, from http://www.wessa.net/rwasp_spearman.wasp/
Wessa (2012) Kendall tau rank correlation (v1.0.11) in free statistics software (v1.1.23-r7). (Office for Research Development and Education) Retrieved 05 28, 2014, from http://www.wessa.net/rwasp_kendall.wasp/
Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: ICML ‘97 proceedings of the fourteenth international conference on machine learning, 97, pp 412–420

Download references

Author information

Authors and Affiliations

College of Electrical and Mechanical Engineering, National University of Sciences and Technology, Islamabad, Pakistan
Asad Bukhari & Usman Qamar
School of Electrical Engineering and Computer Sciences, National University of Sciences and Technology, Islamabad, Pakistan
Ume Ghazia

Authors

Asad Bukhari
View author publications
You can also search for this author in PubMed Google Scholar
Usman Qamar
View author publications
You can also search for this author in PubMed Google Scholar
Ume Ghazia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Asad Bukhari.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bukhari, A., Qamar, U. & Ghazia, U. URWF: user reputation based weightage framework for twitter micropost classification. Inf Syst E-Bus Manage 15, 623–659 (2017). https://doi.org/10.1007/s10257-016-0320-0

Download citation

Received: 11 September 2014
Revised: 10 December 2015
Accepted: 21 May 2016
Published: 08 June 2016
Issue Date: August 2017
DOI: https://doi.org/10.1007/s10257-016-0320-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

URWF: user reputation based weightage framework for twitter micropost classification

Abstract

Access this article

Similar content being viewed by others

NBA All-Star Prediction Using Twitter Sentiment Analysis

Sentiment Analysis of Smartphone Product Reviews Using Weightage Calculation

Machine Learning Techniques to Determine the Polarity of Messages on Social Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

URWF: user reputation based weightage framework for twitter micropost classification

Abstract

Access this article

Similar content being viewed by others

NBA All-Star Prediction Using Twitter Sentiment Analysis

Sentiment Analysis of Smartphone Product Reviews Using Weightage Calculation

Machine Learning Techniques to Determine the Polarity of Messages on Social Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation