Skip to main content

An Algorithm to Detect Variations in Writing Styles of Columnists After Major Political Changes

  • Conference paper
  • First Online:
  • 810 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12469))

Abstract

Writers tend to follow a certain style that can be detected or at least sketched by an appropriate algorithm. Columnists in newspapers, being also writers, follow their specific style. The style tends to be stable once writers reach maturity, but it is subject to change when internal or external circumstances differ. Here, we apply a bag-of-words approach to approximate the style of several journalists working in Mexican newspapers, and we track their style for a long period of time with the aim of detecting changes when external circumstances, in particular political ones, change. This provided us with an environment for detecting variations in stylomics, which is the closest we can get to an experiment. In particular, we collected hundreds of writings of ten Mexican columnists from different newspapers, both previous to the Presidential Mexican elections of 2018 and posterior to it. We processed these documents on different supervised and not supervised learning algorithms, such as random forest, principal component analysis, and k-means. Likewise, we implemented different validation procedures. As a result, we detected that the style in all studied columnists suffered tangible changes in the frequency of use of some particular words, particularly at specific times, some of which may be related to the 2018 Mexican presidential elections.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Juola, P.: Authorship Attribution. NOW Press, Delft (2008)

    Google Scholar 

  2. Rocha, A.: Authorship attribution for social media forensics. IEEE Trans. Inf. Forensics Secur. 12(1), 5–33 (2017). https://doi.org/10.1109/TIFS.2016.2603960

    Article  Google Scholar 

  3. Varela, P., Justino, E., Britto, A., Bortolozzi, F.: A computational approach for authorship attribution of literary texts using sintatic features. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 4835–4842 (2016). doi: 10.1109/IJCNN.2016.7727835

    Google Scholar 

  4. Mexico election: historic landslide victory for leftist AMLO. The Guardian. https://www.theguardian.com/world/2018/jul/02/mexico-election-leftist-amlo-set-for-historic-landslide-victory. Retrieved on 01 Jun 2020

  5. Mexican general election. https://en.wikipedia.org/wiki/2018_Mexican_general_election. Retrieved on 01 Jun 2020

  6. Flannery, N.: Political Risk Analysis: What To Expect After Mexico’s 2018 Presidential Election. https://www.forbes.com/sites/nathanielparishflannery/2018/06/26/political-risk-analysis-what-to-expect-from-mexicos-2018-presidential-election/#73ebe8685a76. Retrieved on 01 Jun 2020

  7. Laver, M., Benoit, K., Garry, J.: Extracting policy positions from political texts using words as data. Am. Political Sci. Rev. 97(2), 311–331 (2003)

    Article  Google Scholar 

  8. Kumar, S., Santosh, R.: Effective information retrieval and feature minimization technique for semantic web data. Comput. Electr. Eng. 81, 106518 (2018). https://doi.org/10.1016/j.compeleceng.2019.106518

    Article  Google Scholar 

  9. Hughes, G.F.: On the mean accuracy of statistical pattern recognizers. IEEE Trans. Inf. Theor. 14(1), 55–63 (1968)

    Article  Google Scholar 

  10. Harris, Z.: Distributional structure. Word 10(2/3), 146–62 (1954). https://doi.org/10.1080/00437956.1954.11659520

    Article  Google Scholar 

  11. Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian approach to filtering junk e-mail. In: AAAI’98 Workshop on Learning for Text Categorization (1988)

    Google Scholar 

  12. Ge, J., Alonso-Vazquez, M., Gretzel, U.: Sentiment analysis: a review. In: Sigala, M., Gretzel, U. (eds.) Advances in Social Media for Travel, Tourism, and Hospitality (2017)

    Google Scholar 

  13. Boughaci, D., Benmesbah, M., Zebiri, A.: An improved N-grams based model for authorship attribution. In: International Conference on Computer and Information Sciences (ICCIS), Sakaka, Saudi Arabia, pp. 1–6 (2019). doi: 10.1109/ICCISci.2019.8716391

    Google Scholar 

  14. Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems. arXiv:1310.4546 (2013)

  15. Gómez-Adorno, H., Sidorov, G., Pinto, D., Vilarino, D., Gelbukh, A.: Automatic authorship detection using textual patterns extracted from integrated syntactic graphs. Sensors 16, 1374 (2016). https://doi.org/10.3390/s16091374

    Article  Google Scholar 

  16. Shrestha, P., et al.: Convolutional neural networks for authorship attribution of short texts. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol. 2 (2017)

    Google Scholar 

  17. Garrard, P., Maloney, L.M., Hodges, J.R., Patterson, K.: The effects of very early Alzheimer’s disease on the characteristics of writing by a renowned author. Brain 128, 250–260 (2005)

    Article  Google Scholar 

  18. Neme, A., Pulido, J.R.G., Muńos, A., Hernández, S., Dey, T.: Stylistics analysis and authorship attribution algorithms based on self-organizing maps. Neurocomputing 147(5), 147–159 (2015)

    Article  Google Scholar 

  19. Neme, A., Hernández, S., Nido, A., Islas, C.: Multilayer Perceptrons as Classifiers Guided by Mutual Information and Trained with Genetic Algorithms. In: Yin, H., Costa, J.A.F., Barreto, G. (eds.) IDEAL 2012. LNCS, vol. 7435, pp. 176–183. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32639-4_22

    Chapter  Google Scholar 

  20. Ho, T.K.: Random decision forests. In: Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, pp. 278–282 (1995)

    Google Scholar 

  21. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1, 81–106 (1996). https://doi.org/10.1007/BF00116251

    Article  Google Scholar 

  22. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001). https://doi.org/10.1109/TIFS.2016.26039600

    Article  MATH  Google Scholar 

  23. Principal, J.I., Analysis, C.: Springer Series in Statistics, vol. 195. Springer, New York (2002)

    Google Scholar 

  24. Mohammed, A.J., Yusof, Y., Husni, H.: Document clustering based on firefly algorithm. J. Comput. Sci. 11(3), 453–465 (2015). https://doi.org/10.1109/TIFS.2016.26039601

    Article  Google Scholar 

  25. Kamvar, K., et al.: Spectral Learning (2003)

    Google Scholar 

Download references

Acknowledgements

This work was partially supported by UNAM-PAPIIT IA103420. AN and EMMR thank SNI CONACyT.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Antonio Neme .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Escobar, R., Juarez, L., Molino-Minero-Re, E., Neme, A. (2020). An Algorithm to Detect Variations in Writing Styles of Columnists After Major Political Changes. In: Martínez-Villaseñor, L., Herrera-Alcántara, O., Ponce, H., Castro-Espinoza, F.A. (eds) Advances in Computational Intelligence. MICAI 2020. Lecture Notes in Computer Science(), vol 12469. Springer, Cham. https://doi.org/10.1007/978-3-030-60887-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60887-3_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60886-6

  • Online ISBN: 978-3-030-60887-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics