Skip to main content

Towards Rare Disease Knowledge Graph Learning from Social Posts of Patients

  • Conference paper
  • First Online:
Research and Innovation Forum 2020 (RIIFORUM 2020)

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

Included in the following conference series:

Abstract

Rare diseases pose particular challenges to patients, families, caregivers, clinicians and researchers. Due to the scarce availability of information and their disintegration, in recent years we are witnessing a strong growth of patient communities on social platforms such as Facebook. Although the data generated in this context are of high value, the currently existing ontologies and resources tend to ignore them. The work presented in this paper studies how to extract knowledge from the large availability of unstructured text generated by the users over time, in order to represent it in an organized way and to make logical reasoning above. Starting from the awareness of the need to integrate different methodologies in complex domains, the research shows a combined use of Text Mining and Semantic Web techniques. In particular, we describe the basis of a novel approach for Knowledge Graph Learning with the aim of introducing a patient-centered vision into the world of Linked Open Data. By identifying and representing correlations between concepts of interest, we show how it is possible to answer patients’ questions and provide them with an additional tool for decision making. The outlined contribute minimizes costs through automatic data retrieval and increases the productivity of investigators.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://bioportal.bioontology.org/.

  2. 2.

    https://www.orpha.net.

  3. 3.

    Model used to express the frequency and the provenance of associations, in an ontological design context.

References

  1. T. Anusua, T.W. yong, Transfer Learning for Text using Deep Learning Virtual Machine (DLVM). https://bit.ly/2YzR1uO (2018). Accessed 29 Mar 2020

  2. M.N. Asim, M. Wasim, M.U.G. Khan, W. Mahmood et al., A survey of ontology learning techniques and applications. Database 2018, (2018)

    Google Scholar 

  3. D.M. Blei, A.Y. Ng, M.I. Jordan, Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)

    Google Scholar 

  4. G.O. Blog, Introducing the knowledge graph: Thing, not strings. Introducing the knowledge graph: Thing, not strings (2012)

    Google Scholar 

  5. A. Carbonaro, Interlinking e-learning resources and the web of data for improving student experience. J. e-Learn. Knowl. Soc. 8(2), 33–44 (2012)

    Google Scholar 

  6. A. Carbonaro, F. Piccinini, R. Reda, Integrating heterogeneous data of healthcare devices to enable domain data management. J. e-Learning Knowl. Soc. 14(1), (2018)

    Google Scholar 

  7. G. Domeniconi, M. Masseroli, G. Moro, P. Pinoli, Discovering new gene functionalities from random perturbations of known gene ontological annotations, in KDIR, pp. 107–116 (2014)

    Google Scholar 

  8. G. Domeniconi, M. Masseroli, G. Moro, P. Pinoli, Cross-organism learning method to discover new gene functionalities. Comput. Methods Programs Biomed. 126, 20–34 (2016)

    Article  Google Scholar 

  9. G. Domeniconi, G. Moro, A. Pagliarani, R. Pasolini, On deep learning in cross-domain sentiment classification, in KDIR, pp. 50–60 (2017)

    Google Scholar 

  10. L. Ehrlinger, W. Wöß, Towards a definition of knowledge graphs, in SEMANTiCS (Posters, Demos, SuCCESS), 48 (2016)

    Google Scholar 

  11. G. Frisoni, G. Moro, A. Carbonaro, Learning interpretable and statistically significant knowledge from unlabeled Corpora of social text messages: A novel methodology of descriptive text mining, in International Conference on Data Science, Technology and Applications (2020)

    Google Scholar 

  12. Gartner Hype Cycle for Emerging Technologies 2019. https://gtnr.it/3dB37Is. Accessed 29 Mar 2020

  13. L. Gasparri, D. Marconi, Word meaning, in The Stanford Encyclopedia of Philosophy, ed. by E.N. Zalta (Metaphysics Research Lab, Stanford University, fall 2019 edition, 2019)

    Google Scholar 

  14. T. Groza, S. Köhler, D. Moldenhauer, N. Vasilevsky et al., The human phenotype ontology: Semantic unification of common and rare disease. Am. J. Human Genetics 97(1), 111–124 (2015)

    Article  Google Scholar 

  15. T. Hofmann, Probabilistic Latent Semantic Analysis. arXiv preprint arXiv:1301.6705 (2013)

  16. What is HOOM (The ORDO-HOOM Ontological Module)? http://www.orphadata.org/cgi-bin/img/PDF/WhatIsHOOM.pdf. Sept 2019

  17. R. Jia, P. Liang, Adversarial examples for evaluating reading comprehension systems. arXiv preprint arXiv:1707.07328 (2017)

  18. F.K. Khattak, S. Jeblee, C. Pou-Prom, M. Abdalla et al., A survey of word embeddings for clinical text. J. Biomed. Inf. X 4, 100057 (2019)

    Google Scholar 

  19. S. Köhler, N.A. Vasilevsky, M. Engelstad, E. Foster et al., The human phenotype ontology in 2017. Nucleic Acids Res. 45(D1), D865–D876 (2017)

    Article  Google Scholar 

  20. R. Kontchakov, M. Rezk, M. Rodriguez-Muro, G. Xiao et al., Answering SPARQL queries over databases under OWL 2 QL entailment regime, in International Semantic Web Conference (Springer, Berlin, 2014), pp. 552–567

    Google Scholar 

  21. S. Köhler, M.H. Schulz, P. Krawitz, S. Bauer et al., Clinical diagnostics in human genetics with semantic similarity searches in ontologies. Am. J. Human Genetics 85(4), 457–464 (2009)

    Article  Google Scholar 

  22. Z. Lan, M. Chen, S. Goodman, K. Gimpel, et al., Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019)

  23. T.K. Landauer, S.T. Dumais, A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211 (1997)

    Article  Google Scholar 

  24. Y. Liu, M. Ott, N. Goyal, J. Du, et al., Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)

  25. M.A. Magumba, P. Nabende, E. Mwebaze, Ontology boosted deep learning for disease name extraction from Twitter messages. J. Big Data 5(1), 31 (2018)

    Article  Google Scholar 

  26. S.M. Mathews, Explainable artificial intelligence applications in NLP, biomedical, and malware classification: A literature review, in Intelligent Computing-Proceedings of the Computing Conference (Springer, Berlin, 2019), pp. 1269–1292

    Google Scholar 

  27. Q&A with Tim Berners-Lee. Businessweek. Retrieved from https://www.bloomberg.com/news/articles/2007-04-09/q-and-a-with-tim-berners-leebusinessweek-business-news-stock-market-and-financial-advice (2007)

  28. EURORDIS Rare Barometer Graphic Report. Share and protect our health data! Rare disease patients’ preferences on data sharing and protection (2020)

    Google Scholar 

  29. A. Rath, A. Olry, F. Dhombres, M.M.Brandt et al., Representation of rare diseases in health information systems: The Orphanet approach to serve a wide range of end users. Human Mutation 33(5), 803–808 (2012)

    Google Scholar 

  30. S. Riccucci, A. Carbonaro, G. Casadei, Knowledge acquisition in intelligent tutoring system: A data mining approach, in Mexican International Conference on Artificial Intelligence (Springer, Berlin, 2007), pp. 1195–1205

    Google Scholar 

  31. W. Samek, T. Wiegand, K.-R. Müller, Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models. arXiv preprint arXiv:1708.08296 (2017)

  32. L.M. Schriml, C. Arze, S. Nadendla, Y.-W.W.Chang et al., Disease Ontology: A backbone for disease semantic integration. Nucleic Acids Res. 40(D1), D940–D946 (2012)

    Google Scholar 

  33. D. Vasant, L. Chanas, J. Malone, M. Hanauer et al., ORDO: An ontology connecting rare disease, epidemiology and genetic data, in Proceedings of ISMB, vol. 30 (2014)

    Google Scholar 

  34. B. Wang, A. Wang, F. Chen, Y. Wang et al., Evaluating word embedding models: Methods and experimental results. APSIPA Trans. Signal Inf. Process. 8, e19 (2019)

    Article  Google Scholar 

  35. X. Wilcke, P. Bloem, V. de Boer, The knowledge graph as the default data model for learning on heterogeneous knowledge. Data Sci. 1(1–2), 39–57 (2017)

    Google Scholar 

  36. I.H. Witten, E. Frank, M.A. Hall, C.J. Pal, Moving on: Applications and beyond (Chapt. 13), in Data Mining, 4th ed., ed. by I.H. Witten, E. Frank, M.A. Hall, C.J. Pal (Morgan Kaufmann, 2017), pp. 503 – 532

    Google Scholar 

  37. X. Zenuni, B. Raufi, F. Ismaili, J. Ajdari, State of the art of semantic web for healthcare. Proc.- Soc. Behav. Sci. 195, 1990–1998 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giacomo Frisoni .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Frisoni, G., Moro, G., Carbonaro, A. (2021). Towards Rare Disease Knowledge Graph Learning from Social Posts of Patients. In: Visvizi, A., Lytras, M.D., Aljohani, N.R. (eds) Research and Innovation Forum 2020. RIIFORUM 2020. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-030-62066-0_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-62066-0_44

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-62065-3

  • Online ISBN: 978-3-030-62066-0

  • eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics