Analysis of the Structured Information for Subjectivity Detection in Twitter

Sixto, Juan; Almeida, Aitor; López-de-Ipiña, Diego

doi:10.1007/978-3-319-90287-6_9

Juan Sixto¹⁵,
Aitor Almeida¹⁵ &
Diego López-de-Ipiña¹⁵

Part of the book series: Lecture Notes in Computer Science ((TCCI,volume 10840))

458 Accesses
4 Citations

Abstract

In this paper, we analyze the opportunities of the structured information of the social networks for the subjectivity detection on Twitter micro texts. The sentiment analysis on Twitter has been usually performed through the automatic processing of the texts. However, the established limit of 140 characters and the particular characteristics of the texts reduce drastically the accuracy of Natural Language Processing (NLP) techniques when compared with other domains. Under these circumstances, it becomes necessary to study new data sources that allow us to extract new useful knowledge to represent and classify the texts. The structured information, also called meta-information or meta-data, provide us with alternative features of the texts that can improve the classification tasks. In this paper we analyze the features of the structured information and their usefulness in the opinion mining sub-domain, specially in the subjectivity detection task. Also present a novel classification of these features according to their origin.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.sepln.org/.
2.
@sanchezcastejon is a Spanish politician of the Spanish Socialist Workers’ Party (PSOE) and @cospedal is a Spanish politician of the People’s Party (PP).
3.
Workshop on Sentiment Analysis at SEPLN Conference.
4.
Extensible Markup Language.
5.
https://dev.twitter.com/rest/public.

References

Alonso, M.A., Vilares, D.: A review on political analysis and social media. Procesamiento del Lenguaje Natural 56, 13–24 (2016)
Google Scholar
Barbosa, L., Feng, J.: Robust sentiment detection on Twitter from biased and noisy data. In: Proceedings of 23rd International Conference on Computational Linguistics: Posters, pp. 36–44 (2010)
Google Scholar
Belkaroui, R., Faiz, R.: Towards events tweet contextualization using social influence model and users conversations. In: Proceedings of 5th International Conference on Web Intelligence, Mining and Semantics, p. 3. ACM (2015)
Google Scholar
Bermingham, A., Smeaton, A. F.: On using Twitter to monitor political sentiment and predict election results (2011)
Google Scholar
Bosco, C., Patti, V., Bolioli, A.: Developing corpora for sentiment analysis: the case of irony and senti-TUT. IEEE Intell. Syst. 28(2), 55–63 (2013)
Article Google Scholar
Cerón-Guzmán, J.A.: JACERONG at TASS 2016: an ensemble classifier for sentiment analysis of Spanish tweets at global level. In: Proceedings of TASS 2016: Workshop on Sentiment Analysis at SEPLN co-located with 32nd SEPLN Conference (SEPLN 2016), pp. 35–39 (2016)
Google Scholar
Cotelo, J.M., Cruz, F., Ortega, F.J., Troyano, J.A.: Explorando Twitter mediante la integración de información estructurada y no estructurada. Procesamiento del Lenguaje Natural 55, 75–82 (2015)
Google Scholar
Cui, A., Zhang, M., Liu, Y., Ma, S.: Emotion tokens: bridging the gap among multilingual Twitter sentiment analysis. In: Salem, M.V.M., Shaalan, K., Oroumchian, F., Shakery, A., Khelalfa, H. (eds.) AIRS 2011. LNCS, vol. 7097, pp. 238–249. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25631-8_22
Chapter Google Scholar
Davidov, D., Tsur, O., Rappoport, A.: Enhanced sentiment learning using twitter hashtags and smileys. In: Proceedings of 23rd International Conference on Computational Linguistics: Posters (2010)
Google Scholar
De Choudhury, M., Gamon, M., Counts, S., Horvitz, E.: Predicting depression via social media. In: ICWSM, p. 2 (2013)
Google Scholar
Esparza, S.G., O’Mahony, M.P., Smyth, B.: Mining the real-time web: a novel approach to product recommendation. Knowl.-Based Syst. 29, 3–11 (2012)
Article Google Scholar
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 1189–1232 (2001)
Google Scholar
Giorgis, S., Rousas, A., Pavlopoulos, J., Malakasiotis, P., Androutsopoulos, I.: aueb.twitter.sentiment at SemEval-2016 task 4: a weighted ensemble of SVMs for Twitter sentiment analysis. In: Proceedings of SemEval, pp. 96–99 (2016)
Google Scholar
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, vol. 1, no. 12 (2009)
Google Scholar
Han, B., Cook, P., Baldwin, T.: Unimelb: Spanish text normalisation. In: Tweet-Norm@SEPLN, pp. 32–36 (2013)
Google Scholar
Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)
Article Google Scholar
Hurtado, L.F., Pla, F., Buscaldi, D.: ELiRF-UPV en TASS 2015: Análisis de Sentimientos en Twitter. In: Proceedings of TASS 2015: Workshop on Sentiment Analysis at SEPLN Co-located with 31st SEPLN Conference (SEPLN 2015) (2015)
Google Scholar
Jeni, L.A., Cohn, J.F., De La Torre, F.: Facing imbalanced data-recommendations for the use of performance metrics. In: 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction (ACII), pp. 245–251. IEEE (2013)
Google Scholar
Jiang, L., Yu, M., Zhou, M., Liu, X., Zhao, T.: Target-dependent twitter sentiment classification. In: Proceedings of 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 151–160 (2011)
Google Scholar
Joshi, M.V., Agarwal, M.C., Kumar, V.: Mining needle in a haystack: classifying rare classes via two-phase rule induction. ACM SIGMOD Rec. 30(2), 91–102 (2001)
Article Google Scholar
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)
Article Google Scholar
Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 415–463. Springer, Boston (2012). https://doi.org/10.1007/978-1-4614-3223-4_13
Chapter Google Scholar
Liu, X.Y., Wu, J., Zhou, Z.H.: Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 39(2), 539–550 (2009)
Article Google Scholar
Martínez-Cámara, E., Martín-Valdivia, M.T., Ureña-López, L.A., Montejo-Ráez, A.R.: Sentiment analysis in Twitter. Nat. Lang. Eng. 20(01), 1–28 (2014)
Article Google Scholar
Martínez-Cámara, E., Gutiérrez-Vázquez, Y., Fernández, J., Montejo-Ráez, A., Muñoz-Guillena, R.: Ensemble classifier for Twitter sentiment analysis (2015)
Google Scholar
Martínez-Cámara, E., Martín-Valdivia, M.T., Ureña López, L.A., Mitkov, R.: Polarity classification for Spanish tweets using the COST corpus. J. Inf. Sci. 41(3), 263–272 (1015)
Article Google Scholar
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)
Article Google Scholar
Mejova, Y., Srinivasan, P., Boynton, B.: GOP primary season on Twitter: popular political sentiment in social media. In: Proceedings of 6th ACM International Conference on Web Search and Data Mining. ACM (2013)
Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Mislove, A., Lehmann, S., Ahn, Y.Y., Onnela, J.P., Rosenquist, J.N.: Understanding the demographics of Twitter users. In: ICWSM, vol. 11, no. 5 (2011)
Google Scholar
Montejo-Ráez, A., Díaz-Galiano, M.C.: Participación de SINAI en TASS 2016. In: Proceedings of TASS 2016: Workshop on Sentiment Analysis at SEPLN (2016)
Google Scholar
Monti, C., Rozza, A., Zapella, G., Zignani, M., Arvidsson, A., Colleoni, E.: Modelling political disaffection from Twitter data. In: Proceedings of 2nd International Workshop on Issues of Sentiment Discovery and Opinion Mining (WISDOM 2013) (2013)
Google Scholar
Nabil, M., Atyia, A., Aly, M.: CUFE at SemEval-2016 task 4: a gated recurrent model for sentiment classification. In: Proceedings of 10th International Workshop on Semantic Evaluation (SemEval-2016) (2016)
Google Scholar
Opitz, D., Maclin, R.: Popular ensemble methods: an empirical study. J. Artif. Intell. Res. 11, 169–198 (1999)
MATH Google Scholar
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)
Article Google Scholar
Park, S.: Sentiment classification using sociolinguistic clusters. In: Proceedings of TASS 2015: Workshop on Sentiment Analysis at SEPLN Co-located with 31st SEPLN Conference (SEPLN 2015), pp. 99–104 (2015)
Google Scholar
Pennacchiotti, M., Popescu, A.M.: A machine learning approach to Twitter user classification. ICWSM 11(1), 281–288 (2011)
Google Scholar
Porta, J., Sancho, J.L.: Word normalization in Twitter using finite-state transducers. In: Tweet-Norm@SEPLN, vol. 1086, pp. 49–53 (2013)
Google Scholar
Schapire, R.E.: A brief introduction to boosting. IJCAI 99, 1401–1406 (1999)
Google Scholar
Siordia, O.S., Guerrero, M.G., Avila, E.S.T., Jimenez, S.M., Moctezuma, D., García, E.A.V.: Sentiment analysis for Twitter: TASS 2015. In: Proceedings of TASS 2015: Workshop on Sentiment Analysis at SEPLN Co-located with 31st SEPLN Conference (SEPLN 2015) (2015)
Google Scholar
Sixto, J., Almeida, A., López-de-Ipiña, D.: An approach to subjectivity detection on Twitter using the structured information. In: International Conference on Computational Collective Intelligence, Part 1, pp. 121–130 (2016)
Chapter Google Scholar
Sixto, J., Almeida, A., López-de-Ipiña, D.: Improving the sentiment analysis process of Spanish tweets with BM25. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds.) NLDB 2016. LNCS, vol. 9612, pp. 285–291. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41754-7_26
Chapter Google Scholar
Smith, C.: DMR Twitter statistic report, Last Modified 26 Feb 2016. http://expandedramblings.com/index.php/downloads/twitter-statistic-report/. Accessed 28 Mar 2016
Ting, K.M., Witten, I.H.: Issues in stacked generalization. J. Artif. Intell. Res. (JAIR) 10, 271–289 (1999)
MATH Google Scholar
Villena-Román, J., Lana-Serrano, S., Martínez-Cámara, E., González-Cristóbal, J.C.: TASS - workshop on sentiment analysis at SEPLN. Procesamiento del Lenguaje Natural 50, 37–44 (2013)
Google Scholar
Villena-Román, J., García-Morera, J., García-Cumbreras, M.A., Martínez-Cámara, E., Martín-Valdivia, M.T., Ureã-López, L.A.: Overview of TASS 2015. In: Proceedings of TASS 2015: Workshop on Sentiment Analysis at SEPLN, vol. 1397. CEUR-WS.org (2015)
Google Scholar
Volkova, S., Wilson, T., Yarowsky, D.: Exploring demographic language variations to improve multilingual sentiment analysis in social media. In: EMNLP, pp. 1815–1827 (2013)
Google Scholar
Weiss, G.M.: Mining with rarity: a unifying framework. ACM SIGKDD Explor. Newsl. 6(1), 7–19 (2004)
Article Google Scholar
Weiss, G.M., Provost, F.: Learning when training data are costly: the effect of class distribution on tree induction. J. Artif. Intell. Res. 19, 315–354 (2003)
MATH Google Scholar
Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992)
Article Google Scholar
Pontiki, M., Galanis, D., Papageorgiou, H., Androutsopoulos, I., Manandhar, S., AL-Smadi, M., Hoste, V.: SemEval-2016 task 5: aspect based sentiment analysis. In: ProWorkshop on Semantic Evaluation (SemEval-2016) (2016)
Google Scholar

Download references

Acknowledgements

This work has been partially supported by the Spanish Ministry of Economy and Competitiveness under the project E-RMP (CSO2015-64495-R).

Author information

Authors and Affiliations

DeustoTech-Deusto Institute of Technology, Universidad de Deusto, Avenida de las Universidades 24, 48007, Bilbao, Spain
Juan Sixto, Aitor Almeida & Diego López-de-Ipiña

Authors

Juan Sixto
View author publications
You can also search for this author in PubMed Google Scholar
Aitor Almeida
View author publications
You can also search for this author in PubMed Google Scholar
Diego López-de-Ipiña
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Sixto .

Editor information

Editors and Affiliations

Institute of Informatics, Wrocław University of Technology, Wrocław, Poland
Ngoc Thanh Nguyen
Swinburne University of Technology, Hawthorn, Australia
Ryszard Kowalczyk

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sixto, J., Almeida, A., López-de-Ipiña, D. (2018). Analysis of the Structured Information for Subjectivity Detection in Twitter. In: Nguyen, N., Kowalczyk, R. (eds) Transactions on Computational Collective Intelligence XXIX. Lecture Notes in Computer Science(), vol 10840. Springer, Cham. https://doi.org/10.1007/978-3-319-90287-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-90287-6_9
Published: 21 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-90286-9
Online ISBN: 978-3-319-90287-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics