skip to main content
10.1145/3530190.3534850acmconferencesArticle/Chapter ViewAbstractPublication PagescompassConference Proceedingsconference-collections
short-paper
Public Access

Note: Using Causality to Mine Sjögren’s Syndrome related Factors from Medical Literature

Published: 29 June 2022 Publication History

Abstract

Research articles published in medical journals often present findings from causal experiments. In this paper, we use this intuition to build a model that leverages causal relations expressed in text to unearth factors related to Sjögren’s syndrome. Sjögren’s syndrome is an auto-immune disease affecting up to 3.1 million Americans. The uncommon nature of the disease, coupled with common symptoms with other autoimmune conditions make the timely diagnosis of this disease very hard. A centralized information system with easy access to common and uncommon factors related to Sjögren’s syndrome may alleviate the problem. We use automatically extracted causal relationships from text related to Sjögren’s syndrome collected from the medical literature to identify a set of factors, such as “signs and symptoms” and “associated conditions”, related to this disease. We show that our approach is capable of retrieving such factors with a high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN.

Supplementary Material

MP4 File (COMPASS_Note_23_P_Gujarathi_2022-06-29.mp4)
Hybrid Presentation Recording 2022-06-29

References

[1]
Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. 2016. An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086(2016).
[2]
Danushka Bollegala, Simon Maskell, Richard Sloane, Joanna Hajne, and Munir Pirmohamed. 2018. Causality Patterns for Detecting Adverse Drug Reactions From Social Media: Text Mining Approach. JMIR public health and surveillance 4, 2 (2018).
[3]
SJ Bowman, GH Ibrahim, G Holmes, John Hamburger, and JR Ainsworth. 2004. Estimating the prevalence among Caucasian women of primary Sjögren’s syndrome in two general practices in Birmingham, UK. Scandinavian journal of rheumatology 33, 1 (2004), 39–43.
[4]
SJ Bowman, GH Ibrahim, G Holmes, J Hamburger, and JR Ainsworth. 2004. Estimating the prevalence among Caucasian women of primary Sjögren’s syndrome in two general practices in Birmingham, UK. Scandinavian Journal of Rheumatology 33, 1 (2004), 39–43. https://doi.org/10.1080/03009740310004676 arXiv:https://doi.org/10.1080/03009740310004676
[5]
Lindsay E. Brown, Michelle L. Frits, Christine K. Iannaccone, Michael E. Weinblatt, Nancy A. Shadick, and Katherine P. Liao. 2014. Clinical characteristics of RA patients with secondary SS and association with joint damage. Rheumatology 54, 5 (10 2014), 816–820. https://doi.org/10.1093/rheumatology/keu400 arXiv:https://academic.oup.com/rheumatology/article-pdf/54/5/816/6699339/keu400.pdf
[6]
Lindsay E Brown, Michelle L Frits, Christine K Iannaccone, Michael E Weinblatt, Nancy A Shadick, and Katherine P Liao. 2015. Clinical characteristics of RA patients with secondary SS and association with joint damage. Rheumatology 54, 5 (2015), 816–820.
[7]
Quoc-Chinh Bui, Breanndán Ó Nualláin, Charles A Boucher, and Peter MA Sloot. 2010. Extracting causal relations on HIV drug resistance from literature. BMC bioinformatics 11, 1 (2010), 101.
[8]
Xiaoyun Chen, Huaxun Wu, and Wei Wei. 2018. Advances in the diagnosis and treatment of Sjogren’s syndrome. Clinical rheumatology 37, 7 (2018), 1743–1749.
[9]
Tirthankar Dasgupta, Rupsa Saha, Lipika Dey, and Abir Naskar. 2018. Automatic Extraction of Causal Relations from Text using Linguistically Informed Deep Neural Networks. In Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue. 306–316.
[10]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arxiv:1810.04805 [cs.CL]
[11]
Lingli Dong, Yu Chen, Yasufumi Masaki, Toshiro Okazaki, and Hisanori Umehara. 2013. Possible mechanisms of lymphoma development in Sjogren’s syndrome. Current immunology reviews 9, 1 (2013), 13–22.
[12]
Linda Douglas. 2018. Facilitating timely diagnosis of Sjögren’s syndrome. BDJ Team 5, 2 (2018), 18026.
[13]
Naoki Egami, Christian J Fong, Justin Grimmer, Margaret E Roberts, and Brandon M Stewart. 2018. How to make causal inferences using texts. arXiv preprint arXiv:1802.02163(2018).
[14]
George E Fragoulis, Sofia Fragkioudaki, James H Reilly, Shauna C Kerr, Iain B McInnes, and Haralampos M Moutsopoulos. 2016. Analysis of the cell populations composing the mononuclear cell infiltrates in the labial minor salivary glands from patients with rheumatoid arthritis and sicca syndrome. Journal of Autoimmunity 73 (2016), 85–91.
[15]
Roxana Girju and Dan I. Moldovan. 2002. Text Mining for Causal Relations. In Proceedings of the Fifteenth International Florida Artificial Intelligence Research Society Conference. AAAI Press, 360–364. http://dl.acm.org/citation.cfm?id=646815.708596
[16]
Harsha Gurulingappa, Abdul Mateen-Rajpu, and Luca Toldo. 2012. Extraction of potential adverse drug events from medical case reports. Journal of biomedical semantics 3, 1 (2012), 15.
[17]
Harsha Gurulingappa, Abdul Mateen Rajput, Angus Roberts, Juliane Fluck, Martin Hofmann-Apitius, and Luca Toldo. 2012. Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports. Journal of biomedical informatics 45, 5 (2012), 885–892.
[18]
Charles G Helmick, David T Felson, Reva C Lawrence, Sherine Gabriel, Rosemarie Hirsch, C Kent Kwoh, Matthew H Liang, Hilal Maradit Kremers, Maureen D Mayes, Peter A Merkel, 2008. Estimates of the prevalence of arthritis and other rheumatic conditions in the United States: Part I. Arthritis & Rheumatism 58, 1 (2008), 15–25.
[19]
Charles G. Helmick, David T. Felson, Reva C. Lawrence, Sherine Gabriel, Rosemarie Hirsch, C. Kent Kwoh, Matthew H. Liang, Hilal Maradit Kremers, Maureen D. Mayes, Peter A. Merkel, Stanley R. Pillemer, John D. Reveille, John H. Stone, and National Arthritis Data Workgroup. 2008. Estimates of the prevalence of arthritis and other rheumatic conditions in the United States: Part I. Arthritis & Rheumatism 58, 1 (2008), 15–25. https://doi.org/10.1002/art.23177 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1002/art.23177
[20]
Iris Hendrickx, Su Nam Kim, Zornitsa Kozareva, Preslav Nakov, Diarmuid Ó Séaghdha, Sebastian Padó, Marco Pennacchiotti, Lorenza Romano, and Stan Szpakowicz. 2010. SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations between Pairs of Nominals. In Proceedings of the 5th International Workshop on Semantic Evaluation. Association for Computational Linguistics, Uppsala, Sweden, 33–38. https://www.aclweb.org/anthology/S10-1006
[21]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.
[22]
Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991(2015).
[23]
Sho Ishikawa, Takuhei Shoji, Yuri Nishiyama, and Kei Shinoda. 2019. A case with acquired lacrimal fistula due to Sjögren’s syndrome. American Journal of Ophthalmology Case Reports 15 (2019), 100526.
[24]
Ashwin Ittoo and Gosse Bouma. 2011. Extracting explicit and implicit causal relations from sparse, domain-specific texts. In International Conference on Application of Natural Language to Information Systems. Springer, 52–63.
[25]
Roland Jonsson, Karl A Brokstad, Malin V Jonsson, Nicolas Delaleu, and Kathrine Skarstein. 2018. Current concepts on Sjögren’s syndrome–classification criteria and biomarkers. European journal of oral sciences 126 (2018), 37–48.
[26]
Armand Joulin, Edouard Grave, Piotr Bojanowski, Matthijs Douze, Hérve Jégou, and Tomas Mikolov. 2016. Fasttext. zip: Compressing text classification models. arXiv preprint arXiv:1612.03651(2016).
[27]
Dongyeop Kang, Varun Gangal, Ang Lu, Zheng Chen, and Eduard Hovy. 2017. Detecting and explaining causes from text for a time series event. arXiv preprint arXiv:1707.08852(2017).
[28]
Hongbin Kim, Junegak Joung, and Kwangsoo Kim. 2018. Semi-automatic extraction of technological causality from patents. Computers and Industrial Engineering 115 (2018), 532 – 542. https://doi.org/10.1016/j.cie.2017.12.004
[29]
Vijay Konda and John Tsitsiklis. 1999. Actor-critic algorithms. Advances in neural information processing systems 12 (1999).
[30]
Manolis Kyriakakis, Ion Androutsopoulos, Artur Saudabayev, and Joan Ginés i Ametllé. 2019. Transfer Learning for Causal Sentence Detection. In Proceedings of the 18th BioNLP Workshop and Shared Task. Association for Computational Linguistics, Florence, Italy, 292–297. https://doi.org/10.18653/v1/W19-5031
[31]
Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. 2019. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942(2019).
[32]
Dong-gi Lee and Hyunjung Shin. 2017. Disease causality extraction based on lexical semantics and document-clause frequency from biomedical literature. BMC medical informatics and decision making 17, 1 (2017), 53.
[33]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.
[34]
Zhaoning Li, Qi Li, Xiaotian Zou, and Jiangtao Ren. 2019. Causality extraction based on self-attentive BiLSTM-CRF with transferred embeddings. Neurocomputing 423(2019), 207–219.
[35]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602(2013).
[36]
Roland Mueller and Sebastian Hüttemann. 2018. Extracting Causal Claims from Information Systems Papers with Natural Language Processing for Theory Ontology Learning(Hawaii International Conference on System Sciences).
[37]
M. M. Muñoz, J. Sebastián, R. Roda, Y. J. Soriano, and María Gracia Sarrión Pérez. 2009. Sjögren’s syndrome of the oral cavity. Review and update.
[38]
Cuong Q. Nguyen and Ammon B. Peck. 2009. Unraveling the Pathophysiology of Sjogren Syndrome-Associated Dry Eye Disease. The Ocular Surface 7, 1 (2009), 11–27. https://doi.org/10.1016/S1542-0124(12)70289-6
[39]
Gaëtane Nocturne and Xavier Mariette. 2013. Advances in understanding the pathogenesis of primary Sjögren’s syndrome. Nature Reviews Rheumatology 9, 9 (2013), 544–556.
[40]
Ruchika Patel and Anupama Shahane. 2014. The epidemiology of Sjögren’s syndrome. Clinical epidemiology 6(2014), 247.
[41]
Michael J Paul. 2017. Feature Selection as Causal Inference: Experiments with Text Classification. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). 163–172.
[42]
Judea Pearl. 2009. Causal inference in statistics: An overview. Statist. Surv. 3(2009), 96–146. https://doi.org/10.1214/09-SS057
[43]
Shibani Santurkar, Dimitris Tsipras, Andrew Ilyas, and Aleksander Madry. 2018. How does batch normalization help optimization?arXiv preprint arXiv:1805.11604(2018).
[44]
Shota Sasaki, Sho Takase, Naoya Inoue, Naoaki Okazaki, and Kentaro Inui. 2017. Handling multiword expressions in causality estimation. In IWCS 2017—12th International Conference on Computational Semantics—Short papers.
[45]
Barbara Segal, Simon J Bowman, Philip C Fox, Frederick B Vivino, Nandita Murukutla, Jeff Brodscholl, Sarika Ogale, and Lachy McLean. 2009. Primary Sjögren’s Syndrome: health experiences and predictors of health quality among patients in the United States. Health and quality of life outcomes 7, 1 (2009), 1–9.
[46]
Ana-Luisa Stefanski, Christian Tomiak, Uwe Pleyer, Thomas Dietrich, Gerd Rüdiger Burmester, and Thomas Dörner. 2017. The diagnosis and treatment of Sjögren’s syndrome. Deutsches Ärzteblatt International 114, 20 (2017), 354.
[47]
Santosh Tirunagari. 2015. Data Mining of Causal Relations from Text: Analysing Maritime Accident Investigation Reports. arXiv preprint arXiv:1507.02447(2015).
[48]
Frederick B. Vivino. 2017. Sjogren’s syndrome: Clinical aspects. Clinical Immunology 182(2017), 48–54. https://doi.org/10.1016/j.clim.2017.04.005 Special issue: Sjogren’s Syndrome.
[49]
Jue Wang and Wei Lu. 2020. Two are Better than One: Joint Entity and Relation Extraction with Table-Sequence Encoders. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 1706–1721. https://doi.org/10.18653/v1/2020.emnlp-main.133
[50]
Linlin Wang, Zhu Cao, Gerard De Melo, and Zhiyuan Liu. 2016. Relation classification via multi-level attention cnns. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1298–1307.
[51]
Yujia Zhai, Shaojing Sun, Fang Wang, and Ying Ding. 2017. Multiplicity and uncertainty: Media coverage of autism causation. Journal of Informetrics 11, 3 (2017), 873 – 887. https://doi.org/10.1016/j.joi.2017.07.005
[52]
Yijia Zhang, Qingyu Chen, Zhihao Yang, Hongfei Lin, and Zhiyong Lu. 2019. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific data 6, 1 (2019), 1–9.
[53]
Shan Zhao, Minghao Hu, Zhiping Cai, and Fang Liu. 2020. Modeling Dense Cross-Modal Interactions for Joint Entity-Relation Extraction. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, Christian Bessiere (Ed.). International Joint Conferences on Artificial Intelligence Organization, 4032–4038. https://doi.org/10.24963/ijcai.2020/558 Main track.
[54]
Sendong Zhao, Meng Jiang, Ming Liu, Bing Qin, and Ting Liu. 2018. CausalTriad: Toward Pseudo Causal Relation Discovery and Hypotheses Generation from Medical Text Data. (2018).
[55]
Qile Zhu, Xiaolin Li, Ana Conesa, and Cécile Pereira. 2018. GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text. Bioinformatics 34, 9 (2018), 1547–1554.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
COMPASS '22: Proceedings of the 5th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies
June 2022
710 pages
ISBN:9781450393478
DOI:10.1145/3530190
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 June 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Sjögren’s syndrome
  2. causal relationships
  3. medical NLP

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Funding Sources

Conference

COMPASS '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 25 of 50 submissions, 50%

Upcoming Conference

COMPASS '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 230
    Total Downloads
  • Downloads (Last 12 months)120
  • Downloads (Last 6 weeks)15
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media