Skip to main content

MedTable: Extracting Disease Types from Web Tables

  • Conference paper
  • First Online:
The Semantic Web: ESWC 2020 Satellite Events (ESWC 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12124))

Included in the following conference series:

  • 821 Accesses

Abstract

Diseases and their symptoms are a frequent information need for Web users. Diseases often are categorized into sub-types, manifested through different symptoms. Extracting such information from textual corpora is inherently difficult. Yet, this can be easily extracted from semi-structured resources like tables. We propose an approach for identifying tables that contain information about sub-type classifications and their attributes. Often tables have diverse and redundant schemas, hence, we align equivalent columns in disparate schemas s.t. information about diseases are accessible through a unified and a common schema. Experimental evaluation shows that we can accurately identify tables containing disease sub-type classifications and additionally align equivalent columns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.ncbi.nlm.nih.gov/pubmed/.

  2. 2.

    https://disease-ontology.org/.

  3. 3.

    https://www.who.int/classifications/icd/en/.

  4. 4.

    https://github.com/koutraki/medtable.

  5. 5.

    Accessed 17.04.2019.

References

  1. Abacha, A.B., Demner-Fushman, D.: A question-entailment approach to question answering. arXiv e-prints, January 2019. https://arxiv.org/abs/1901.08079

  2. Abacha, A.B., Shivade, C., Demner-Fushman, D.: Overview of the MEDIQA 2019 shared task on textual inference, question entailment and question answering. In: BioNLP (2019)

    Google Scholar 

  3. Biswas, R., Koutraki, M., Sack, H.: Exploiting equivalence to infer type subsumption in linked graphs. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 11155, pp. 72–76. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98192-5_14

    Chapter  Google Scholar 

  4. Biswas, R., Koutraki, M., Sack, H.: Predicting Wikipedia infobox type information using word embeddings on categories. In: Proceedings of the EKAW 2018 Posters and Demonstrations Session (2018)

    Google Scholar 

  5. Biswas, R., Türker, R., Moghaddam, F.B., Koutraki, M., Sack, H.: Wikipedia infobox type prediction using embeddings. In: Proceedings of the First Workshop on Deep Learning for Knowledge Graphs and Semantic Technologies (DL4KGS) (2018)

    Google Scholar 

  6. Fetahu, B., Anand, A., Koutraki, M.: Tablenet: An approach for determining fine-grained relations for Wikipedia tables. In: The World Wide Web Conference, WWW 2019 (2019)

    Google Scholar 

  7. Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: KDD (2016)

    Google Scholar 

  8. Koutraki, M., Preda, N., Vodislav, D.: SOFYA: semantic on-the-fly relation alignment. In: Proceedings of the 19th International Conference on Extending Database Technology, EDBT 2016, Bordeaux, France, 15–16 March 2016 (2016)

    Google Scholar 

  9. Koutraki, M., Preda, N., Vodislav, D.: Online relation alignment for linked datasets. In: Blomqvist, E., Maynard, D., Gangemi, A., Hoekstra, R., Hitzler, P., Hartig, O. (eds.) ESWC 2017. LNCS, vol. 10249, pp. 152–168. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58068-5_10

    Chapter  Google Scholar 

  10. Zitnik, M., Rok Sosič, S.M., Leskovec, J.: BioSNAP Datasets: Stanford biomedical network dataset collection, August 2018. http://snap.stanford.edu/biodata

  11. Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maria Koutraki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Koutraki, M., Fetahu, B. (2020). MedTable: Extracting Disease Types from Web Tables. In: Harth, A., et al. The Semantic Web: ESWC 2020 Satellite Events. ESWC 2020. Lecture Notes in Computer Science(), vol 12124. Springer, Cham. https://doi.org/10.1007/978-3-030-62327-2_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-62327-2_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-62326-5

  • Online ISBN: 978-3-030-62327-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics