Skip to main content

Using Information Divergence to Differentiate Deep from Superficial Resemblances Among Discourses

  • Conference paper
  • First Online:
  • 1123 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12795))

Abstract

In comparative analyses of discourses that reflect particular cultural identities, it is often necessary to differentiate superficial distinctions that arise primarily as cultural markers from deeper distinctions that arise from differences in cultural structures. In this paper, we build on previous work in order to operationalize this distinction between deep and superficial relationships between discourses using computational methods. To do so, we draw on the notion of divergence from information theory to measure the extent to which lexical items from a discourse act as signals of one cultural identity over another. We carry out a series of three types of comparisons between the discourses of fourteen English-language online discussion communities primarily focused on religion and spirituality. In the first type of comparison, discourses are compared at the level of individual words and their frequencies. In the second type, they are compared at the level of word-usage patterns learned from topic models. In the third, they are also compared at the level of word-usage patterns, but from topic models trained on their discourses after removing highly distinguishing terms that represent superficial distinctions between them. Our results indicate that, while some discourses share close resemblances both superficial and deep, others may appear to share close resemblances only superficially or may only share close resemblances after accounting for their superficial differences. These findings suggest that the approach we describe may be of use to researchers studying language in a variety of comparative contexts.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    The code used in this analysis is available at https://github.com/zacharykstine/cc21_discourse_resemblances.

References

  1. Prothero, S.: The White Buddhist: The Asian Odyssey of Henry Steel Olcott, 1st edn. Indiana University Press, Indianapolis (1996)

    Google Scholar 

  2. Deitrick, J.E.: Engaged buddhist ethics: mistaking the boat for the shore. In: Queen, C., Prebish, C., Keown, D. (eds.) Action Dharma: New Studies in Engaged Buddhism, pp. 252–269, 1st edn. RoutledgeCurzon, New York (2003)

    Google Scholar 

  3. Stine, Z.K., Deitrick, J.E., Agarwal, N.: Comparative religion, topic models, and conceptualization: towards the characterization of structural relationships between online religious discourses. In: Karsdrop, F., McGillivray, B., Nerghes, A., Wevers, M. (eds.) Proceedings of the Workshop on Computational Humanities Research (CHR 2020). CEUR Workshop Proceedings. Amsterdam, Netherlands, vol. 2723, pp. 128–148 (2020)

    Google Scholar 

  4. Campbell, H.A., Evolvi, G.: Contextualizing current digital religion research on emerging technologies. Human Behav. Emerg. Technol. 2(1), 5–17 (2020). https://doi.org/10.1002/hbe2.149

    Article  Google Scholar 

  5. Lundmark, E., LeDrew, S.: Unorganized atheism and the secular movement: reddit as a site for studying “lived atheism.” Social Compass 66(1), 112–129 (2019). https://doi.org/10.1177/0037768618816096

    Article  Google Scholar 

  6. Sanders, W.S., Ferré, J.P.: Reader responses to religion news: discussions about ark encounter on reddit. J. Relig. Media Digit. Cult. 9(1), 107–130 (2020). https://doi.org/10.1163/21659214-bja10008

    Article  Google Scholar 

  7. Fuller, R.C.: Spiritual, But Not Religious: Understanding Unchurched America, 1st edn. Oxford University Press, New York (2001)

    Book  Google Scholar 

  8. Mercadante, L.A.: Belief without Borders: Inside the Minds of the Spiritual but not Religious, 1st edn. Oxford University Press, New York (2014)

    Book  Google Scholar 

  9. Drescher, E.: Choosing Our Religion: The Spiritual Lives of America’s Nones, 1st edn. Oxford University Press, New York (2016)

    Book  Google Scholar 

  10. Jain, A.R.: Peace Love Yoga: The Politics of Global Spirituality, 1st edn. Oxford University Press, New York (2020)

    Book  Google Scholar 

  11. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  12. Nichols, R., Slingerland, E., Nielbo, K., Bergeton, U., Logan, C., Kleinman, S.: Modeling the contested relationship between Analects, Mencius, and Xunzi: preliminary evidence from a machine-learning approach. J. Asian Stud. 77(1), 19–57 (2018). https://doi.org/10.1017/S0021911817000973

    Article  Google Scholar 

  13. Slingerland, E., Nichols, R., Neilbo, K., Logan, C.: The distant reading of religious texts: a “big data” approach to mind-body concepts in early China. J. Am. Acad. Relig. 85(4), 985–1016 (2017). https://doi.org/10.1093/jaarel/lfw090

    Article  Google Scholar 

  14. Hall, D., Jurafsky, D., Manning, C.D.: Studying the history of ideas using topic models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP ’08), pp. 363–371. Association for Computational Linguistics, USA (2008)

    Google Scholar 

  15. Allen, C., Murdock, J.: LDA topic modeling: contexts for the history & philosophy of science. In: 2020 Preprint of a Chapter to appear in Ramsey, G., De Block, A. (eds.) The Dynamics of Science: Computational Frontiers in History and Philosophy of Science. Pittsburgh University Press; Pittsburgh (2020, forthcoming)

    Google Scholar 

  16. Nguyen, D., Liakata, M., DeDeo, S., Eisenstein, J., Mimno, D., Tromble, R., Winters, J.: How we do things with words: analyzing text as social and cultural data. Front. Artif. Intell. 3 (2020). https://doi.org/10.3389/frai.2020.00062

  17. Roberts, M.E., Stewart, B.M., Tingley, D.: Navigating the local modes of big data: the case of topic models. In: Alvarez, R.M. (ed.) Computational Social Science: Discovery and Prediction, pp. 51–97. 1st. ed., Cambridge University Press, New York (2016)

    Google Scholar 

  18. Thompson, L., Mimno, D.: Authorless topic models: biasing models away from known structure. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 3903–3914. Association for Computational Linguistics, Santa Fe (2018)

    Google Scholar 

  19. Chang, K.K., DeDeo, S.: Divergence and the complexity of differences in text and culture. J. Cult. Anal. 4(11), 1–36 (2020). https://doi.org/10.22148/001c.17585

    Article  Google Scholar 

  20. Klingenstein, S., Hitchcock, T., DeDeo, S.: The civilizing process in London’s Old Bailey. PNAS 111(26), 9419–9424 (2014). https://doi.org/10.1073/pnas.1405984111

    Article  Google Scholar 

  21. Gallagher, R.J., Reagan, A.J., Danforth, C.M., Dodds, P.S.: Divergent discourse between protests and counter-protests: #BlackLivesMatter and #AllLivesMatter. PLoS ONE 13(4), (2018). https://doi.org/10.1371/journal.pone.0195644

  22. Stine, Z.K., Agarwal, N.: Comparative discourse analysis using topic models: contrasting perspectives on China from reddit. In: International Conference on Social Media and Society (SMSociety'20), pp. 73–84. Association for Computing Machinery, Toronto (2020). https://doi.org/10.1145/3400806.3400816

  23. Sloman, S., Oppenheimer, D., DeDeo, S.: Can we detect conditioned variation in political speech? Two kinds of discussion and types of conversation. PLOS ONE 16(2), e0246689 (2021). https://doi.org/10.1371/journal.pone.0246689

    Article  Google Scholar 

  24. Baumgartner, J., Zannettou, S., Keegan, B., Squire, M., Blackburn, J.: The pushshift reddit dataset. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 14, pp. 830–839 (2020)

    Google Scholar 

  25. Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of LREC 2010 Workshop New Challenges for NLP Frameworks, pp. 46–50. University of Malta, Valletta (2010)

    Google Scholar 

Download references

Acknowledgements

This research is funded in part by the U.S. National Science Foundation (OIA-1946391, OIA-1920920, IIS-1636933, ACI-1429160, and IIS-1110868), U.S. Office of Naval Research (N00014-10-1-0091, N00014-14-1-0489, N00014-15-P-1187, N00014-16-1-2016, N00014-16-1-2412, N00014-17-1-2675, N00014-17-1-2605, N68335-19-C-0359, N00014-19-1-2336, N68335-20-C-0540, N00014-21-1-2121), U.S. Air Force Research Lab, U.S. Army Research Office (W911NF-17-S-0002, W911NF-16-1-0189), U.S. Defense Advanced Research Projects Agency (W31P4Q-17-C-0059), Arkansas Research Alliance, the Jerry L. Maulden/Entergy Endowment at the University of Arkansas at Little Rock, and the Australian Department of Defense Strategic Policy Grants Program (SPGP) (award number: 2020-106-094) to the third co-author, Nitin Agarwal. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding organizations. The researcher gratefully acknowledges the support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zachary K. Stine .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Stine, Z.K., Deitrick, J.E., Agarwal, N. (2021). Using Information Divergence to Differentiate Deep from Superficial Resemblances Among Discourses. In: Rauterberg, M. (eds) Culture and Computing. Design Thinking and Cultural Computing. HCII 2021. Lecture Notes in Computer Science(), vol 12795. Springer, Cham. https://doi.org/10.1007/978-3-030-77431-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-77431-8_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-77430-1

  • Online ISBN: 978-3-030-77431-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics