Abstract
In comparative analyses of discourses that reflect particular cultural identities, it is often necessary to differentiate superficial distinctions that arise primarily as cultural markers from deeper distinctions that arise from differences in cultural structures. In this paper, we build on previous work in order to operationalize this distinction between deep and superficial relationships between discourses using computational methods. To do so, we draw on the notion of divergence from information theory to measure the extent to which lexical items from a discourse act as signals of one cultural identity over another. We carry out a series of three types of comparisons between the discourses of fourteen English-language online discussion communities primarily focused on religion and spirituality. In the first type of comparison, discourses are compared at the level of individual words and their frequencies. In the second type, they are compared at the level of word-usage patterns learned from topic models. In the third, they are also compared at the level of word-usage patterns, but from topic models trained on their discourses after removing highly distinguishing terms that represent superficial distinctions between them. Our results indicate that, while some discourses share close resemblances both superficial and deep, others may appear to share close resemblances only superficially or may only share close resemblances after accounting for their superficial differences. These findings suggest that the approach we describe may be of use to researchers studying language in a variety of comparative contexts.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
The code used in this analysis is available at https://github.com/zacharykstine/cc21_discourse_resemblances.
References
Prothero, S.: The White Buddhist: The Asian Odyssey of Henry Steel Olcott, 1st edn. Indiana University Press, Indianapolis (1996)
Deitrick, J.E.: Engaged buddhist ethics: mistaking the boat for the shore. In: Queen, C., Prebish, C., Keown, D. (eds.) Action Dharma: New Studies in Engaged Buddhism, pp. 252–269, 1st edn. RoutledgeCurzon, New York (2003)
Stine, Z.K., Deitrick, J.E., Agarwal, N.: Comparative religion, topic models, and conceptualization: towards the characterization of structural relationships between online religious discourses. In: Karsdrop, F., McGillivray, B., Nerghes, A., Wevers, M. (eds.) Proceedings of the Workshop on Computational Humanities Research (CHR 2020). CEUR Workshop Proceedings. Amsterdam, Netherlands, vol. 2723, pp. 128–148 (2020)
Campbell, H.A., Evolvi, G.: Contextualizing current digital religion research on emerging technologies. Human Behav. Emerg. Technol. 2(1), 5–17 (2020). https://doi.org/10.1002/hbe2.149
Lundmark, E., LeDrew, S.: Unorganized atheism and the secular movement: reddit as a site for studying “lived atheism.” Social Compass 66(1), 112–129 (2019). https://doi.org/10.1177/0037768618816096
Sanders, W.S., Ferré, J.P.: Reader responses to religion news: discussions about ark encounter on reddit. J. Relig. Media Digit. Cult. 9(1), 107–130 (2020). https://doi.org/10.1163/21659214-bja10008
Fuller, R.C.: Spiritual, But Not Religious: Understanding Unchurched America, 1st edn. Oxford University Press, New York (2001)
Mercadante, L.A.: Belief without Borders: Inside the Minds of the Spiritual but not Religious, 1st edn. Oxford University Press, New York (2014)
Drescher, E.: Choosing Our Religion: The Spiritual Lives of America’s Nones, 1st edn. Oxford University Press, New York (2016)
Jain, A.R.: Peace Love Yoga: The Politics of Global Spirituality, 1st edn. Oxford University Press, New York (2020)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Nichols, R., Slingerland, E., Nielbo, K., Bergeton, U., Logan, C., Kleinman, S.: Modeling the contested relationship between Analects, Mencius, and Xunzi: preliminary evidence from a machine-learning approach. J. Asian Stud. 77(1), 19–57 (2018). https://doi.org/10.1017/S0021911817000973
Slingerland, E., Nichols, R., Neilbo, K., Logan, C.: The distant reading of religious texts: a “big data” approach to mind-body concepts in early China. J. Am. Acad. Relig. 85(4), 985–1016 (2017). https://doi.org/10.1093/jaarel/lfw090
Hall, D., Jurafsky, D., Manning, C.D.: Studying the history of ideas using topic models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP ’08), pp. 363–371. Association for Computational Linguistics, USA (2008)
Allen, C., Murdock, J.: LDA topic modeling: contexts for the history & philosophy of science. In: 2020 Preprint of a Chapter to appear in Ramsey, G., De Block, A. (eds.) The Dynamics of Science: Computational Frontiers in History and Philosophy of Science. Pittsburgh University Press; Pittsburgh (2020, forthcoming)
Nguyen, D., Liakata, M., DeDeo, S., Eisenstein, J., Mimno, D., Tromble, R., Winters, J.: How we do things with words: analyzing text as social and cultural data. Front. Artif. Intell. 3 (2020). https://doi.org/10.3389/frai.2020.00062
Roberts, M.E., Stewart, B.M., Tingley, D.: Navigating the local modes of big data: the case of topic models. In: Alvarez, R.M. (ed.) Computational Social Science: Discovery and Prediction, pp. 51–97. 1st. ed., Cambridge University Press, New York (2016)
Thompson, L., Mimno, D.: Authorless topic models: biasing models away from known structure. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 3903–3914. Association for Computational Linguistics, Santa Fe (2018)
Chang, K.K., DeDeo, S.: Divergence and the complexity of differences in text and culture. J. Cult. Anal. 4(11), 1–36 (2020). https://doi.org/10.22148/001c.17585
Klingenstein, S., Hitchcock, T., DeDeo, S.: The civilizing process in London’s Old Bailey. PNAS 111(26), 9419–9424 (2014). https://doi.org/10.1073/pnas.1405984111
Gallagher, R.J., Reagan, A.J., Danforth, C.M., Dodds, P.S.: Divergent discourse between protests and counter-protests: #BlackLivesMatter and #AllLivesMatter. PLoS ONE 13(4), (2018). https://doi.org/10.1371/journal.pone.0195644
Stine, Z.K., Agarwal, N.: Comparative discourse analysis using topic models: contrasting perspectives on China from reddit. In: International Conference on Social Media and Society (SMSociety'20), pp. 73–84. Association for Computing Machinery, Toronto (2020). https://doi.org/10.1145/3400806.3400816
Sloman, S., Oppenheimer, D., DeDeo, S.: Can we detect conditioned variation in political speech? Two kinds of discussion and types of conversation. PLOS ONE 16(2), e0246689 (2021). https://doi.org/10.1371/journal.pone.0246689
Baumgartner, J., Zannettou, S., Keegan, B., Squire, M., Blackburn, J.: The pushshift reddit dataset. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 14, pp. 830–839 (2020)
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of LREC 2010 Workshop New Challenges for NLP Frameworks, pp. 46–50. University of Malta, Valletta (2010)
Acknowledgements
This research is funded in part by the U.S. National Science Foundation (OIA-1946391, OIA-1920920, IIS-1636933, ACI-1429160, and IIS-1110868), U.S. Office of Naval Research (N00014-10-1-0091, N00014-14-1-0489, N00014-15-P-1187, N00014-16-1-2016, N00014-16-1-2412, N00014-17-1-2675, N00014-17-1-2605, N68335-19-C-0359, N00014-19-1-2336, N68335-20-C-0540, N00014-21-1-2121), U.S. Air Force Research Lab, U.S. Army Research Office (W911NF-17-S-0002, W911NF-16-1-0189), U.S. Defense Advanced Research Projects Agency (W31P4Q-17-C-0059), Arkansas Research Alliance, the Jerry L. Maulden/Entergy Endowment at the University of Arkansas at Little Rock, and the Australian Department of Defense Strategic Policy Grants Program (SPGP) (award number: 2020-106-094) to the third co-author, Nitin Agarwal. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding organizations. The researcher gratefully acknowledges the support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Stine, Z.K., Deitrick, J.E., Agarwal, N. (2021). Using Information Divergence to Differentiate Deep from Superficial Resemblances Among Discourses. In: Rauterberg, M. (eds) Culture and Computing. Design Thinking and Cultural Computing. HCII 2021. Lecture Notes in Computer Science(), vol 12795. Springer, Cham. https://doi.org/10.1007/978-3-030-77431-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-77431-8_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77430-1
Online ISBN: 978-3-030-77431-8
eBook Packages: Computer ScienceComputer Science (R0)