Abstract
Writers tend to follow a certain style that can be detected or at least sketched by an appropriate algorithm. Columnists in newspapers, being also writers, follow their specific style. The style tends to be stable once writers reach maturity, but it is subject to change when internal or external circumstances differ. Here, we apply a bag-of-words approach to approximate the style of several journalists working in Mexican newspapers, and we track their style for a long period of time with the aim of detecting changes when external circumstances, in particular political ones, change. This provided us with an environment for detecting variations in stylomics, which is the closest we can get to an experiment. In particular, we collected hundreds of writings of ten Mexican columnists from different newspapers, both previous to the Presidential Mexican elections of 2018 and posterior to it. We processed these documents on different supervised and not supervised learning algorithms, such as random forest, principal component analysis, and k-means. Likewise, we implemented different validation procedures. As a result, we detected that the style in all studied columnists suffered tangible changes in the frequency of use of some particular words, particularly at specific times, some of which may be related to the 2018 Mexican presidential elections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Juola, P.: Authorship Attribution. NOW Press, Delft (2008)
Rocha, A.: Authorship attribution for social media forensics. IEEE Trans. Inf. Forensics Secur. 12(1), 5–33 (2017). https://doi.org/10.1109/TIFS.2016.2603960
Varela, P., Justino, E., Britto, A., Bortolozzi, F.: A computational approach for authorship attribution of literary texts using sintatic features. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 4835–4842 (2016). doi: 10.1109/IJCNN.2016.7727835
Mexico election: historic landslide victory for leftist AMLO. The Guardian. https://www.theguardian.com/world/2018/jul/02/mexico-election-leftist-amlo-set-for-historic-landslide-victory. Retrieved on 01 Jun 2020
Mexican general election. https://en.wikipedia.org/wiki/2018_Mexican_general_election. Retrieved on 01 Jun 2020
Flannery, N.: Political Risk Analysis: What To Expect After Mexico’s 2018 Presidential Election. https://www.forbes.com/sites/nathanielparishflannery/2018/06/26/political-risk-analysis-what-to-expect-from-mexicos-2018-presidential-election/#73ebe8685a76. Retrieved on 01 Jun 2020
Laver, M., Benoit, K., Garry, J.: Extracting policy positions from political texts using words as data. Am. Political Sci. Rev. 97(2), 311–331 (2003)
Kumar, S., Santosh, R.: Effective information retrieval and feature minimization technique for semantic web data. Comput. Electr. Eng. 81, 106518 (2018). https://doi.org/10.1016/j.compeleceng.2019.106518
Hughes, G.F.: On the mean accuracy of statistical pattern recognizers. IEEE Trans. Inf. Theor. 14(1), 55–63 (1968)
Harris, Z.: Distributional structure. Word 10(2/3), 146–62 (1954). https://doi.org/10.1080/00437956.1954.11659520
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: AAAI’98 Workshop on Learning for Text Categorization (1988)
Ge, J., Alonso-Vazquez, M., Gretzel, U.: Sentiment analysis: a review. In: Sigala, M., Gretzel, U. (eds.) Advances in Social Media for Travel, Tourism, and Hospitality (2017)
Boughaci, D., Benmesbah, M., Zebiri, A.: An improved N-grams based model for authorship attribution. In: International Conference on Computer and Information Sciences (ICCIS), Sakaka, Saudi Arabia, pp. 1–6 (2019). doi: 10.1109/ICCISci.2019.8716391
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems. arXiv:1310.4546 (2013)
Gómez-Adorno, H., Sidorov, G., Pinto, D., Vilarino, D., Gelbukh, A.: Automatic authorship detection using textual patterns extracted from integrated syntactic graphs. Sensors 16, 1374 (2016). https://doi.org/10.3390/s16091374
Shrestha, P., et al.: Convolutional neural networks for authorship attribution of short texts. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol. 2 (2017)
Garrard, P., Maloney, L.M., Hodges, J.R., Patterson, K.: The effects of very early Alzheimer’s disease on the characteristics of writing by a renowned author. Brain 128, 250–260 (2005)
Neme, A., Pulido, J.R.G., Muńos, A., Hernández, S., Dey, T.: Stylistics analysis and authorship attribution algorithms based on self-organizing maps. Neurocomputing 147(5), 147–159 (2015)
Neme, A., Hernández, S., Nido, A., Islas, C.: Multilayer Perceptrons as Classifiers Guided by Mutual Information and Trained with Genetic Algorithms. In: Yin, H., Costa, J.A.F., Barreto, G. (eds.) IDEAL 2012. LNCS, vol. 7435, pp. 176–183. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32639-4_22
Ho, T.K.: Random decision forests. In: Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, pp. 278–282 (1995)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1, 81–106 (1996). https://doi.org/10.1007/BF00116251
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001). https://doi.org/10.1109/TIFS.2016.26039600
Principal, J.I., Analysis, C.: Springer Series in Statistics, vol. 195. Springer, New York (2002)
Mohammed, A.J., Yusof, Y., Husni, H.: Document clustering based on firefly algorithm. J. Comput. Sci. 11(3), 453–465 (2015). https://doi.org/10.1109/TIFS.2016.26039601
Kamvar, K., et al.: Spectral Learning (2003)
Acknowledgements
This work was partially supported by UNAM-PAPIIT IA103420. AN and EMMR thank SNI CONACyT.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Escobar, R., Juarez, L., Molino-Minero-Re, E., Neme, A. (2020). An Algorithm to Detect Variations in Writing Styles of Columnists After Major Political Changes. In: Martínez-Villaseñor, L., Herrera-Alcántara, O., Ponce, H., Castro-Espinoza, F.A. (eds) Advances in Computational Intelligence. MICAI 2020. Lecture Notes in Computer Science(), vol 12469. Springer, Cham. https://doi.org/10.1007/978-3-030-60887-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-60887-3_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60886-6
Online ISBN: 978-3-030-60887-3
eBook Packages: Computer ScienceComputer Science (R0)