Skip to main content

Integrating Ontology-Based Knowledge to Improve Biomedical Multi-Document Summarization Model

  • Conference paper
  • First Online:
Intelligent Information and Database Systems (ACIIDS 2023)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13996))

Included in the following conference series:

  • 209 Accesses

Abstract

Most existing extractive summarization models use the original text’s internal information and calculate each sentence’s importance individually. When applied to specific domains (such as verbal text, biomedical literature, etc.), these models have some drawbacks: the variety of synonym terms, unknown words or terminologies, and the intra-document and inter-document relations between sentences or terms. In this work, we proposed an ontology-based summarization model that leverages many knowledge bases to understand the input documents. Our proposed model was built with an integrated ontology and a signal transmission-based method for extending domain knowledge such as related terms, and relationships between terms and sentences. The proposed model has been proven effective with the highest ROUGE-2 F1 score in the test dataset of the MEDIQA 2021 MAS shared tasks.

Q.-A. Nguyen and K.-V. Nguyen—Contributed Equally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://sites.google.com/view/mediqa2021.

  2. 2.

    https://disease-ontology.org/.

  3. 3.

    http://geneontology.org/.

  4. 4.

    https://www.ncbi.nlm.nih.gov/mesh/.

  5. 5.

    https://www.ncbi.nlm.nih.gov/mesh/.

  6. 6.

    https://mondo.monarchinitiative.org.

  7. 7.

    http://symptomontologywiki.igs.umaryland.edu.

  8. 8.

    http://ctdbase.org.

  9. 9.

    https://chiqa.nlm.nih.gov.

References

  1. Abualigah, L., Bashabsheh, M.Q., Alabool, H., Shehab, M.: Text summarization: a brief review. In: Recent Advances in NLP: The Case of Arabic Language, pp. 1–15 (2020)

    Google Scholar 

  2. Bennani-Smires, K., Musat, C., Hossmann, A., Baeriswyl, M., Jaggi, M.: Simple unsupervised keyphrase extraction using sentence embeddings. In: Proceedings of the 22nd Conference on Computational Natural Language Learning, pp. 221–229 (2018)

    Google Scholar 

  3. Blomqvist, E.: OntoCase-automatic ontology enrichment based on ontology design patterns. In: Bernstein, A., et al. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 65–80. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04930-9_5

    Chapter  Google Scholar 

  4. Can, D.C., et al.: UETrice at MEDIQA 2021: a prosper-thy-neighbour extractive multi-document summarization model. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 311–319 (2021)

    Google Scholar 

  5. Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)

    Article  Google Scholar 

  6. Gambhir, M., Gupta, V.: Recent automatic text summarization techniques: a survey. Artif. Intell. Rev. 47(1), 1–66 (2017)

    Article  Google Scholar 

  7. Hovy, E., Lin, C.Y., et al.: Automated text summarization in SUMMARIST. In: Advances in Automatic Text Summarization, vol. 14, pp. 81–94. MIT press Cambridge, MA (1999)

    Google Scholar 

  8. Ježek, K., Steinberger, J.: Automatic text summarization (the state of the art 2007 and new challenges). In: Proceedings of Znalosti, pp. 1–12. Citeseer (2008)

    Google Scholar 

  9. Kaynar, O., Görmez, Y., Işık, Y.E., Demirkoparan, F.: Comparison of graph-based document summarization method. In: 2017 International Conference on Computer Science and Engineering (UBMK), pp. 598–603. IEEE (2017)

    Google Scholar 

  10. Kogilavani, A., Balasubramanie, P.: Ontology enhanced clustering based summarization of medical documents. Int. J. Recent Trends Eng. 1(1), 546 (2009)

    Google Scholar 

  11. Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004)

    Google Scholar 

  12. Lubani, M., Noah, S.A.M., Mahmud, R.: Ontology population: approaches and design aspects. J. Inf. Sci. 45(4), 502–515 (2019)

    Article  Google Scholar 

  13. Mitra, P., Noy, N.F., Jaiswal, A.R.: OMEN: a probabilistic ontology mapping tool. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 537–547. Springer, Heidelberg (2005). https://doi.org/10.1007/11574620_39

    Chapter  Google Scholar 

  14. Mohammed, O., Benlamri, R., Fong, S.: Building a diseases symptoms ontology for medical diagnosis: an integrative approach. In: The First International Conference on Future Generation Communication Technologies, pp. 104–108. IEEE (2012)

    Google Scholar 

  15. Mrini, K., et al.: UCSD-adobe at MEDIQA 2021: transfer learning and answer sentence selection for medical summarization. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 257–262 (2021)

    Google Scholar 

  16. Nastase, V.: Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 763–772 (2008)

    Google Scholar 

  17. Osman, I., Yahia, S.B., Diallo, G.: Ontology integration: approaches and challenging issues. Inf. Fusion 71, 38–63 (2021)

    Article  Google Scholar 

  18. Ozyurt, I.B., Bandrowski, A., Grethe, J.S.: Bio-AnswerFinder: a system to find answers to questions from biomedical texts. Database 2020 (2020)

    Google Scholar 

  19. Rahman, N., Borah, B.: Improvement of query-based text summarization using word sense disambiguation. Complex Intell. Syst. 6(1), 75–85 (2020)

    Article  Google Scholar 

  20. Savery, M., Abacha, A.B., Gayen, S., Demner-Fushman, D.: Question-driven summarization of answers to consumer health questions. Sci. Data 7(1), 1–9 (2020)

    Article  Google Scholar 

  21. Yadav, S., Sarrouti, M., Gupta, D.: NLM at MEDIQA 2021: transfer learning-based approaches for consumer question and multi-answer summarization. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 291–301 (2021)

    Google Scholar 

  22. Zhu, W., et al.: paht_nlp@ MEDIQA 2021: multi-grained query focused multi-answer summarization. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 96–102 (2021)

    Google Scholar 

Download references

Acknowledgements

This research has been done under the research project QG “Research and Development of Vietnamese Multi-document Summarization Based on Advanced Language Models” of Vietnam National University, Hanoi (Code: QG.22.61). Quoc-An Nguyen was funded by the Master, PhD Scholarship Programme of Vingroup Innovation Foundation (VINIF), code VINIF.2022.ThS.001.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mai-Vu Tran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nguyen, QA. et al. (2023). Integrating Ontology-Based Knowledge to Improve Biomedical Multi-Document Summarization Model. In: Nguyen, N.T., et al. Intelligent Information and Database Systems. ACIIDS 2023. Lecture Notes in Computer Science(), vol 13996. Springer, Singapore. https://doi.org/10.1007/978-981-99-5837-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-5837-5_9

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-5836-8

  • Online ISBN: 978-981-99-5837-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics