Abstract
The present study aims to create a framework that analyses user posts related to a product of interest on social networking platforms. More precisely, by applying information mining techniques, posts are categorised according to the intention they express, the sentiment polarisation, and the type of opinion. The model operates based on linguistic rules, machine learning, and combinations. Six different methodologies are implemented to extract intent, sentiment, and type of opinion from a tweet. The final model automatically detects intention to buy or not to buy the product, intention to compare the product with other competitors, and finally, intention to search for information about the product. It then categorises the text according to the sentiment and depending on their expressed opinion. The dataset comprises tweets for each day of the iPhone 5’s life cycle, corresponding to 365 days. Additionally, it demonstrated that the business’s external or internal decisions affect the public purchasing audience’s opinions, sentiments, and intentions expressed on social media. Lastly, as a Business Intelligence tool, the framework recognises and analyses these points, which contribute substantially to the company’s decision-making through the findings.



















Similar content being viewed by others
References
Alfonseca E, Filippova K, Delort J, et al. (2012) Pattern learning for relation extraction with a hierarchical topic model. In: The 50th annual meeting of the association for computational linguistics, proceedings of the conference, July 8–14, 2012, Jeju Island, Korea-Vol 2: Short Papers. The Association for Computer Linguistics, pp 54–59. https://aclanthology.org/P12-2011/
Bagozzi RP (2010) Consumer intentions. Wiley, New York. https://doi.org/10.1002/9781444316568.wiem03057
Becker W, Schmid O (2020) The right digital strategy for your business: an empirical analysis of the design and implementation of digital strategies in smes and lses. Bus Res 13(3):985–1005
Blythe M, Cairns PA (2009) Critical methods and user generated content: the iphone on youtube. In: Jr. DRO, Arthur RB, Hinckley K, et al (eds) Proceedings of the 27th international conference on human factors in computing systems, CHI 2009, Boston, MA, USA, April 4–9, 2009. ACM, pp 1467–1476, https://doi.org/10.1145/1518701.1518923
Carlos CS, Yalamanchi M (2012) Intention analysis for sales, marketing and customer service. In: Kay M, Boitet C (eds) COLING 2012, 24th international conference on computational linguistics, proceedings of the conference: demonstration papers, 8–15 December 2012, Mumbai, India. Indian Institute of Technology Bombay, pp 33–40. https://www.aclweb.org/anthology/C12-3005/
Chatterjee S (2019) Explaining customer ratings and recommendations by combining qualitative and quantitative user generated contents. Decis Support Syst 119:14–22. https://doi.org/10.1016/j.dss.2019.02.008
Chatterjee S, Krystyanczuk M (2017) Python social media analytics. Packt Publishing Ltd, London
Chen Z, Liu B, Hsu M, et al (2013) Identifying intention posts in discussion forums. In: Vanderwende L, III HD, Kirchhoff K (eds) Human language technologies: conference of the North American chapter of the association of computational linguistics, proceedings, June 9–14, 2013, Westin Peachtree Plaza Hotel, Atlanta, Georgia, USA. The Association for Computational Linguistics, pp 1041–1050. https://www.aclweb.org/anthology/N13-1124/
Choi J, Yoon J, Chung J et al (2020) Social media analytics and business intelligence research: a systematic review. Inf Process Manag 57(6):102279. https://doi.org/10.1016/j.ipm.2020.102279
Ding X, Liu T, Duan J, et al (2015) Mining user consumption intention from social media using domain adaptive convolutional neural network. In: Bonet B, Koenig S (eds) Proceedings of the twenty-ninth AAAI conference on artificial intelligence, January 25–30, 2015, Austin, Texas, USA. AAAI Press, pp 2389–2395. http://www.aaai.org/ocs/index.php/AAAI/AAAI15/paper/view/9748
Eckerson WW (2010) Performance dashboards: measuring, monitoring, and managing your business. Wiley, London
Faulds DJ, Mangold WG, Raju P et al (2018) The mobile shopping revolution: redefining the consumer decision process. Bus Horiz 61(2):323–338. https://doi.org/10.1016/j.bushor.2017.11.012
Felt M (2016) Social media and the social sciences: How researchers employ big data analytics. Big Data Soc 3(1):2053951716645,828. https://doi.org/10.1177/2053951716645828
Gao W, Sebastiani F (2016) From classification to quantification in tweet sentiment analysis. Soc Netw Anal Min 6(1):1–22
Giachanou A, Crestani F (2016) Like it or not: a survey of twitter sentiment analysis methods. ACM Comput Surv 49(2):28:1-28:41. https://doi.org/10.1145/2938640
Gupta V, Varshney D, Jhamtani H, et al (2014) Identifying purchase intent from social posts. In: Adar E, Resnick P, Choudhury MD, et al (eds) Proceedings of the eighth international conference on weblogs and social media, ICWSM 2014, Ann Arbor, Michigan, USA, June 1–4, 2014. The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8037
Guzman E, Maalej W (2014) How do users like this feature? A fine grained sentiment analysis of app reviews. In: Gorschek T, Lutz RR (eds) IEEE 22nd international requirements engineering conference, RE 2014, Karlskrona, Sweden, August 25–29, 2014. IEEE Computer Society, pp 153–162, 10.1109/RE.2014.6912257,
Hamroun M, Gouider MS, Said LB (2016) Large scale microblogging intentions analysis with pattern based approach. In: Howlett RJ, Jain LC, Gabrys B, et al (eds) Knowledge-based and intelligent information and engineering systems: proceedings of the 20th international conference KES-2016, York, UK, 5–7 September 2016, Procedia computer science, vol 96. Elsevier, pp 1249–1257. https://doi.org/10.1016/j.procs.2016.08.169
He W, Wu H, Yan G et al (2015) A novel social media competitive analytics framework with sentiment benchmarks. Inf Manag 52(7):801–812. https://doi.org/10.1016/j.im.2015.04.006
Hollerit B, Kröll M, Strohmaier M (2013) Towards linking buyers and sellers: detecting commercial intent on twitter. In: Carr L, Laender AHF, Lóscio BF, et al (eds) 22nd international world wide web conference, WWW ’13, Rio de Janeiro, Brazil, May 13–17, 2013, Companion Volume. International World Wide Web Conferences Steering Committee / ACM, pp 629–632. https://doi.org/10.1145/2487788.2488009
Hung C, Lin H (2013) Using objective words in sentiwordnet to improve word-of-mouth sentiment classification. IEEE Intell Syst 28(2):47–54. https://doi.org/10.1109/MIS.2013.1
Hutto CJ, Gilbert E (2014) VADER: a parsimonious rule-based model for sentiment analysis of social media text. In: Adar E, Resnick P, Choudhury MD, et al (eds) Proceedings of the eighth international conference on weblogs and social media, ICWSM 2014, Ann Arbor, Michigan, USA, June 1–4, 2014. The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8109
Jhamtani H, Chhaya N, Karwa S, et al (2015) Identifying suggestions for improvement of product features from online product reviews. In: Liu T, Scollon CN, Zhu W (eds) Social informatics-7th international conference, SocInfo 2015, Beijing, China, December 9–12, 2015, Proceedings, Lecture Notes in Computer Science, vol 9471. Springer, pp 112–119. https://doi.org/10.1007/978-3-319-27433-1_8
Jindal N, Liu B (2006) Identifying comparative sentences in text documents. In: Efthimiadis EN, Dumais ST, Hawking D, et al (eds) SIGIR 2006: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, Seattle, Washington, USA, August 6–11, 2006. ACM, pp 244–251. https://doi.org/10.1145/1148170.1148215
Kalamatianos G, Symeonidis S, Mallis D et al (2018) Towards the creation of an emotion lexicon for microblogging. J Syst Inf Technol 20(2):130–151. https://doi.org/10.1108/JSIT-06-2017-0040
Kim Y, Dwivedi R, Zhang J et al (2016) Competitive intelligence in social media twitter: iphone 6 vs. galaxy S5. Online Inf Rev 40(1):42–61. https://doi.org/10.1108/OIR-03-2015-0068
Kumar N, Nagalla R, Marwah T, et al (2019) Sentiment dynamics in social media news channels. CoRR abs/1908.08147. arXiv:1908.08147
Kurnia PF, Suharjito (2018) Business intelligence model to analyze social media information. Procedia Comput Sci 135:5–14. https://doi.org/10.1016/j.procs.2018.08.144 (The 3rd International Conference on Computer Science and Computational Intelligence (ICCSCI 2018) : Empowering Smart Technology in Digital Era for a Bette Life)
Ladhari R, Michaud M (2015) ewom effects on hotel booking intentions, attitudes, trust, and website perceptions. Int J Hosp Manag 46:36–45. https://doi.org/10.1016/j.ijhm.2015.01.010
Li N, Wu DD (2010) Using text mining and sentiment analysis for online forums hotspot detection and forecast. Decis Support Syst 48(2):354–368. https://doi.org/10.1016/j.dss.2009.09.003
Liu B (2010) Sentiment analysis and subjectivity. In: Indurkhya N, Damerau FJ (eds) Handbook of natural language processing, 2nd edn. Chapman and Hall/CRC, New York, pp 627–666. https://doi.org/10.1201/9781420085938-c26
Liu X, Burns AC, Hou Y (2017) An investigation of brand-related user-generated content on twitter. J Advert 46(2):236–247. https://doi.org/10.1080/00913367.2017.1297273
Liu Y, Bi J, Fan Z (2017) Ranking products through online reviews: a method based on sentiment analysis technique and intuitionistic fuzzy set theory. Inf Fusion 36:149–161. https://doi.org/10.1016/j.inffus.2016.11.012
Lumezanu C, Feamster N (2012) Observing common spam in twitter and email. In: Proceedings of the 2012 internet measurement conference. Association for Computing Machinery, New York, NY, USA, IMC ’12, pp 461–466. https://doi.org/10.1145/2398776.2398824
Manning CD, Surdeanu M, Bauer J, et al (2014) The stanford corenlp natural language processing toolkit. In: Proceedings of the 52nd annual meeting of the association for computational linguistics, ACL 2014, June 22–27, 2014, Baltimore, MD, USA, System Demonstrations. The Association for Computer Linguistics, pp 55–60, https://doi.org/10.3115/v1/p14-5010
McKinney W, et al (2010) Data structures for statistical computing in python. In: Proceedings of the 9th Python in Science Conference, Austin, TX, pp 51–56
Montoyo A, Martínez-Barco P, Balahur A (2012) Subjectivity and sentiment analysis: an overview of the current state of the area and envisaged developments. Decis Support Syst 53(4):675–679. https://doi.org/10.1016/j.dss.2012.05.022
Morris MR, Teevan J, Panovich K (2010) A comparison of information seeking using search engines and social networks. In: Cohen WW, Gosling S (eds) Proceedings of the fourth international conference on weblogs and social media, ICWSM 2010, Washington, DC, USA, May 23–26, 2010. The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM10/paper/view/1518
Nashaat M, Ghosh A, Miller J, et al (2018) Hybridization of active learning and data programming for labeling large industrial datasets. In: IEEE international conference on big data (IEEE BigData 2018), Seattle, WA, USA, December 10–13, 2018. IEEE, pp 46–55, https://doi.org/10.1109/BigData.2018.8622459
Negash S, Gray P (2003) Business intelligence. In: 9th Americas conference on information systems, AMCIS 2003, Tampa, FL, USA, August 4–6, 2003. Association for Information Systems, p 423, http://aisel.aisnet.org/amcis2003/423
Paul SA, Hong L, Chi EH (2011) Is twitter a good place for asking questions? A characterization study. In: Adamic LA, Baeza-Yates R, Counts S (eds) Proceedings of the fifth international conference on weblogs and social media, Barcelona, Catalonia, Spain, July 17–21, 2011. The AAAI Press. http://www.aaai.org/ocs/index.php/ICWSM/ICWSM11/paper/view/2813
Poecze F, Ebster C, Strauss C (2018) Social media metrics and sentiment analysis to evaluate the effectiveness of social media posts. In: Shakshuki EM, Yasar A (eds) The 9th international conference on ambient systems, networks and technologies (ANT 2018) / The 8th International Conference on Sustainable Energy Information Technology (SEIT 2018) / Affiliated Workshops, May 8–11, 2018, Porto, Portugal, Procedia Computer Science, vol 130. Elsevier, pp 660–666. https://doi.org/10.1016/j.procs.2018.04.117
Purohit H, Dong G, Shalin VL, et al (2015) Intent classification of short-text on social media. In: 2015 IEEE international conference on Smart City/SocialCom/SustainCom 2015, Chengdu, China, December 19–21, 2015. IEEE Computer Society, pp 222–228. https://doi.org/10.1109/SmartCity.2015.75
Ramanand J, Bhavsar K, Pedanekar N (2010) Wishful thinking-finding suggestions and ’buy’ wishes from product reviews. In: Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text. Association for Computational Linguistics, Los Angeles, CA, pp 54–61. https://www.aclweb.org/anthology/W10-0207
Ratner A, Bach SH, Ehrenberg HR et al (2020) Snorkel: rapid training data creation with weak supervision. VLDB J 29(2–3):709–730. https://doi.org/10.1007/s00778-019-00552-1
Reeves M, Deimler MS (2009) Strategies for winning in the current and post-recession environment. Strategy Leadersh 2:1158
Rui H, Liu Y, Whinston AB (2013) Whose and what chatter matters? the effect of tweets on movie sales. Decis Support Syst 55(4):863–870. https://doi.org/10.1016/j.dss.2012.12.022
Sahoo SR, Gupta BB (2019) Hybrid approach for detection of malicious profiles in twitter. Comput Electr Eng 76:65–81. https://doi.org/10.1016/j.compeleceng.2019.03.003
Saif H, Fernández M, He Y, et al (2013) Evaluation datasets for twitter sentiment analysis: a survey and a new dataset, the sts-gold. In: Battaglino C, Bosco C, Cambria E, et al (eds) Proceedings of the first international workshop on emotion and sentiment in social and expressive media: approaches and perspectives from AI (ESSEM 2013) A workshop of the XIII International Conference of the Italian Association for Artificial Intelligence (AI*IA 2013), Turin, Italy, December 3, 2013, CEUR Workshop Proceedings, vol 1096. CEUR-WS.org, pp 9–21. http://ceur-ws.org/Vol-1096/paper1.pdf
Salehan M, Kim DJ (2016) Predicting the performance of online consumer reviews: a sentiment mining approach to big data analytics. Decis Support Syst 81:30–40. https://doi.org/10.1016/j.dss.2015.10.006
Sharma N (2013) Marketing strategy on different stages plc and its marketing implications on fmcg products. Int J Mark Financ Serv Manag Res 2(3):121–136
Shukri SE, Yaghi RI, Aljarah I, et al (2015) Twitter sentiment analysis: A case study in the automotive industry. In: 2015 IEEE Jordan conference on applied electrical engineering and computing technologies (AEECT), pp 1–5. https://doi.org/10.1109/AEECT.2015.7360594
Smith AN, Fischer E, Yongjian C (2012) How does brand-related user-generated content differ across Youtube, Facebook, and Twitter? J Interact Mark 26(2):102–113. https://doi.org/10.1016/j.intmar.2012.01.002
Stolcke A, Ries K, Coccaro N, et al (2000) Dialogue act modeling for automatic tagging and recognition of conversational speech. CoRR cs.CL/0006023. https://arxiv.org/abs/cs/0006023
Sun X, Zhang C, Li G et al (2018) Detecting users’ anomalous emotion using social media for business intelligence. J Comput Sci 25:193–200. https://doi.org/10.1016/j.jocs.2017.05.029
Symeonidis S, Effrosynidis D, Arampatzis A (2018) A comparative evaluation of pre-processing techniques and their interactions for twitter sentiment analysis. Expert Syst Appl 110:298–310. https://doi.org/10.1016/j.eswa.2018.06.022
Tsou M, Zhang H, Jung C (2017) Identifying data noises, user biases, and system errors in geo-tagged twitter messages (tweets). CoRR abs/1712.02433. arXiv:1712.02433
Tutunea MF, Rus RV (2012) Business intelligence solutions for sme’s. Procedia Econ Finance 3:865–870. https://doi.org/10.1016/S2212-5671(12)00242-0 (International Conference Emerging Markets Queries in Finance and Business, Petru Maior University of Tîrgu-Mures, ROMANIA, October 24th - 27th, 2012)
Varma P, Ré C (2018) Snuba: automating weak supervision to label training data. Proc VLDB Endow 12(3):223–236. https://doi.org/10.14778/3291264.3291268
Vidya NA, Fanany MI, Budi I (2015) Twitter sentiment to analyze net brand reputation of mobile phone providers. Procedia Comput Sci 72:519–526. https://doi.org/10.1016/j.procs.2015.12.159 (The Third Information Systems International Conference 2015)
Wang L, Yan J, Lin J et al (2017) Let the users tell the truth: self-disclosure intention and self-disclosure honesty in mobile social networking. Int J Inf Manag 37(1):1428–1440. https://doi.org/10.1016/j.ijinfomgt.2016.10.006
Watson HJ, Wixom BH (2007) The current state of business intelligence. Computer 40(9):96–99. https://doi.org/10.1109/MC.2007.331
Wiecek-Janka E, Papierz M, Kornecka M, et al (2017) Apple products: A discussion of the product life cycle. In: 4th international conference on management science and management innovation, pp 159–164
Wu T, Khan FM, Fisher TA, et al (2005) Posting act tagging using transformation-based learning. In: Lin TY, Ohsuga S, Liau C, et al (eds) Foundations of data mining and knowledge discovery, studies in computational intelligence, vol 6. Springer, pp 319–331. https://doi.org/10.1007/11498186_18
Wyrwoll C (2014) Social media-fundamentals, models, and ranking of user-generated content. Springer, Berlin. https://doi.org/10.1007/978-3-658-06984-1
Xu K, Liao SS, Li J et al (2011) Mining comparative opinions from customer reviews for competitive intelligence. Decis Support Syst 50(4):743–754. https://doi.org/10.1016/j.dss.2010.08.021
Xu X, Wang X, Li Y et al (2017) Business intelligence in online customer textual reviews: understanding consumer perceptions and influential factors. Int J Inf Manag 37(6):673–683. https://doi.org/10.1016/j.ijinfomgt.2017.06.004
Yao Y, Sun A (2016) Mobile phone name extraction from internet forums: a semi-supervised approach. World Wide Web 19(5):783–805. https://doi.org/10.1007/s11280-015-0361-1
Zhao Z, Mei Q (2013) Questions about questions: an empirical analysis of information needs on twitter. In: Schwabe D, Almeida VAF, Glaser H, et al (eds) 22nd international world wide web conference, WWW ’13, Rio de Janeiro, Brazil, May 13-17, 2013. International World Wide Web Conferences Steering Committee / ACM, pp 1545–1556. https://doi.org/10.1145/2488388.2488523
Zheng ZE, Fader PS, Padmanabhan B (2012) From business intelligence to competitive intelligence: inferring competitive measures using augmented site-centric data. Inf Syst Res 23(3–1):698–720. https://doi.org/10.1287/isre.1110.0385
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Symeonidis, S., Peikos, G. & Arampatzis, A. Unsupervised consumer intention and sentiment mining from microblogging data as a business intelligence tool. Oper Res Int J 22, 6007–6036 (2022). https://doi.org/10.1007/s12351-022-00714-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12351-022-00714-0