Abstract
Rare diseases pose particular challenges to patients, families, caregivers, clinicians and researchers. Due to the scarce availability of information and their disintegration, in recent years we are witnessing a strong growth of patient communities on social platforms such as Facebook. Although the data generated in this context are of high value, the currently existing ontologies and resources tend to ignore them. The work presented in this paper studies how to extract knowledge from the large availability of unstructured text generated by the users over time, in order to represent it in an organized way and to make logical reasoning above. Starting from the awareness of the need to integrate different methodologies in complex domains, the research shows a combined use of Text Mining and Semantic Web techniques. In particular, we describe the basis of a novel approach for Knowledge Graph Learning with the aim of introducing a patient-centered vision into the world of Linked Open Data. By identifying and representing correlations between concepts of interest, we show how it is possible to answer patients’ questions and provide them with an additional tool for decision making. The outlined contribute minimizes costs through automatic data retrieval and increases the productivity of investigators.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
Model used to express the frequency and the provenance of associations, in an ontological design context.
References
T. Anusua, T.W. yong, Transfer Learning for Text using Deep Learning Virtual Machine (DLVM). https://bit.ly/2YzR1uO (2018). Accessed 29 Mar 2020
M.N. Asim, M. Wasim, M.U.G. Khan, W. Mahmood et al., A survey of ontology learning techniques and applications. Database 2018, (2018)
D.M. Blei, A.Y. Ng, M.I. Jordan, Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
G.O. Blog, Introducing the knowledge graph: Thing, not strings. Introducing the knowledge graph: Thing, not strings (2012)
A. Carbonaro, Interlinking e-learning resources and the web of data for improving student experience. J. e-Learn. Knowl. Soc. 8(2), 33–44 (2012)
A. Carbonaro, F. Piccinini, R. Reda, Integrating heterogeneous data of healthcare devices to enable domain data management. J. e-Learning Knowl. Soc. 14(1), (2018)
G. Domeniconi, M. Masseroli, G. Moro, P. Pinoli, Discovering new gene functionalities from random perturbations of known gene ontological annotations, in KDIR, pp. 107–116 (2014)
G. Domeniconi, M. Masseroli, G. Moro, P. Pinoli, Cross-organism learning method to discover new gene functionalities. Comput. Methods Programs Biomed. 126, 20–34 (2016)
G. Domeniconi, G. Moro, A. Pagliarani, R. Pasolini, On deep learning in cross-domain sentiment classification, in KDIR, pp. 50–60 (2017)
L. Ehrlinger, W. Wöß, Towards a definition of knowledge graphs, in SEMANTiCS (Posters, Demos, SuCCESS), 48 (2016)
G. Frisoni, G. Moro, A. Carbonaro, Learning interpretable and statistically significant knowledge from unlabeled Corpora of social text messages: A novel methodology of descriptive text mining, in International Conference on Data Science, Technology and Applications (2020)
Gartner Hype Cycle for Emerging Technologies 2019. https://gtnr.it/3dB37Is. Accessed 29 Mar 2020
L. Gasparri, D. Marconi, Word meaning, in The Stanford Encyclopedia of Philosophy, ed. by E.N. Zalta (Metaphysics Research Lab, Stanford University, fall 2019 edition, 2019)
T. Groza, S. Köhler, D. Moldenhauer, N. Vasilevsky et al., The human phenotype ontology: Semantic unification of common and rare disease. Am. J. Human Genetics 97(1), 111–124 (2015)
T. Hofmann, Probabilistic Latent Semantic Analysis. arXiv preprint arXiv:1301.6705 (2013)
What is HOOM (The ORDO-HOOM Ontological Module)? http://www.orphadata.org/cgi-bin/img/PDF/WhatIsHOOM.pdf. Sept 2019
R. Jia, P. Liang, Adversarial examples for evaluating reading comprehension systems. arXiv preprint arXiv:1707.07328 (2017)
F.K. Khattak, S. Jeblee, C. Pou-Prom, M. Abdalla et al., A survey of word embeddings for clinical text. J. Biomed. Inf. X 4, 100057 (2019)
S. Köhler, N.A. Vasilevsky, M. Engelstad, E. Foster et al., The human phenotype ontology in 2017. Nucleic Acids Res. 45(D1), D865–D876 (2017)
R. Kontchakov, M. Rezk, M. Rodriguez-Muro, G. Xiao et al., Answering SPARQL queries over databases under OWL 2 QL entailment regime, in International Semantic Web Conference (Springer, Berlin, 2014), pp. 552–567
S. Köhler, M.H. Schulz, P. Krawitz, S. Bauer et al., Clinical diagnostics in human genetics with semantic similarity searches in ontologies. Am. J. Human Genetics 85(4), 457–464 (2009)
Z. Lan, M. Chen, S. Goodman, K. Gimpel, et al., Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019)
T.K. Landauer, S.T. Dumais, A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211 (1997)
Y. Liu, M. Ott, N. Goyal, J. Du, et al., Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
M.A. Magumba, P. Nabende, E. Mwebaze, Ontology boosted deep learning for disease name extraction from Twitter messages. J. Big Data 5(1), 31 (2018)
S.M. Mathews, Explainable artificial intelligence applications in NLP, biomedical, and malware classification: A literature review, in Intelligent Computing-Proceedings of the Computing Conference (Springer, Berlin, 2019), pp. 1269–1292
Q&A with Tim Berners-Lee. Businessweek. Retrieved from https://www.bloomberg.com/news/articles/2007-04-09/q-and-a-with-tim-berners-leebusinessweek-business-news-stock-market-and-financial-advice (2007)
EURORDIS Rare Barometer Graphic Report. Share and protect our health data! Rare disease patients’ preferences on data sharing and protection (2020)
A. Rath, A. Olry, F. Dhombres, M.M.Brandt et al., Representation of rare diseases in health information systems: The Orphanet approach to serve a wide range of end users. Human Mutation 33(5), 803–808 (2012)
S. Riccucci, A. Carbonaro, G. Casadei, Knowledge acquisition in intelligent tutoring system: A data mining approach, in Mexican International Conference on Artificial Intelligence (Springer, Berlin, 2007), pp. 1195–1205
W. Samek, T. Wiegand, K.-R. Müller, Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models. arXiv preprint arXiv:1708.08296 (2017)
L.M. Schriml, C. Arze, S. Nadendla, Y.-W.W.Chang et al., Disease Ontology: A backbone for disease semantic integration. Nucleic Acids Res. 40(D1), D940–D946 (2012)
D. Vasant, L. Chanas, J. Malone, M. Hanauer et al., ORDO: An ontology connecting rare disease, epidemiology and genetic data, in Proceedings of ISMB, vol. 30 (2014)
B. Wang, A. Wang, F. Chen, Y. Wang et al., Evaluating word embedding models: Methods and experimental results. APSIPA Trans. Signal Inf. Process. 8, e19 (2019)
X. Wilcke, P. Bloem, V. de Boer, The knowledge graph as the default data model for learning on heterogeneous knowledge. Data Sci. 1(1–2), 39–57 (2017)
I.H. Witten, E. Frank, M.A. Hall, C.J. Pal, Moving on: Applications and beyond (Chapt. 13), in Data Mining, 4th ed., ed. by I.H. Witten, E. Frank, M.A. Hall, C.J. Pal (Morgan Kaufmann, 2017), pp. 503 – 532
X. Zenuni, B. Raufi, F. Ismaili, J. Ajdari, State of the art of semantic web for healthcare. Proc.- Soc. Behav. Sci. 195, 1990–1998 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Frisoni, G., Moro, G., Carbonaro, A. (2021). Towards Rare Disease Knowledge Graph Learning from Social Posts of Patients. In: Visvizi, A., Lytras, M.D., Aljohani, N.R. (eds) Research and Innovation Forum 2020. RIIFORUM 2020. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-030-62066-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-030-62066-0_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62065-3
Online ISBN: 978-3-030-62066-0
eBook Packages: EducationEducation (R0)