Abstract
Most existing extractive summarization models use the original text’s internal information and calculate each sentence’s importance individually. When applied to specific domains (such as verbal text, biomedical literature, etc.), these models have some drawbacks: the variety of synonym terms, unknown words or terminologies, and the intra-document and inter-document relations between sentences or terms. In this work, we proposed an ontology-based summarization model that leverages many knowledge bases to understand the input documents. Our proposed model was built with an integrated ontology and a signal transmission-based method for extending domain knowledge such as related terms, and relationships between terms and sentences. The proposed model has been proven effective with the highest ROUGE-2 F1 score in the test dataset of the MEDIQA 2021 MAS shared tasks.
Q.-A. Nguyen and K.-V. Nguyen—Contributed Equally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
References
Abualigah, L., Bashabsheh, M.Q., Alabool, H., Shehab, M.: Text summarization: a brief review. In: Recent Advances in NLP: The Case of Arabic Language, pp. 1–15 (2020)
Bennani-Smires, K., Musat, C., Hossmann, A., Baeriswyl, M., Jaggi, M.: Simple unsupervised keyphrase extraction using sentence embeddings. In: Proceedings of the 22nd Conference on Computational Natural Language Learning, pp. 221–229 (2018)
Blomqvist, E.: OntoCase-automatic ontology enrichment based on ontology design patterns. In: Bernstein, A., et al. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 65–80. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04930-9_5
Can, D.C., et al.: UETrice at MEDIQA 2021: a prosper-thy-neighbour extractive multi-document summarization model. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 311–319 (2021)
Erkan, G., Radev, D.R.: LexRank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)
Gambhir, M., Gupta, V.: Recent automatic text summarization techniques: a survey. Artif. Intell. Rev. 47(1), 1–66 (2017)
Hovy, E., Lin, C.Y., et al.: Automated text summarization in SUMMARIST. In: Advances in Automatic Text Summarization, vol. 14, pp. 81–94. MIT press Cambridge, MA (1999)
Ježek, K., Steinberger, J.: Automatic text summarization (the state of the art 2007 and new challenges). In: Proceedings of Znalosti, pp. 1–12. Citeseer (2008)
Kaynar, O., Görmez, Y., Işık, Y.E., Demirkoparan, F.: Comparison of graph-based document summarization method. In: 2017 International Conference on Computer Science and Engineering (UBMK), pp. 598–603. IEEE (2017)
Kogilavani, A., Balasubramanie, P.: Ontology enhanced clustering based summarization of medical documents. Int. J. Recent Trends Eng. 1(1), 546 (2009)
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004)
Lubani, M., Noah, S.A.M., Mahmud, R.: Ontology population: approaches and design aspects. J. Inf. Sci. 45(4), 502–515 (2019)
Mitra, P., Noy, N.F., Jaiswal, A.R.: OMEN: a probabilistic ontology mapping tool. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 537–547. Springer, Heidelberg (2005). https://doi.org/10.1007/11574620_39
Mohammed, O., Benlamri, R., Fong, S.: Building a diseases symptoms ontology for medical diagnosis: an integrative approach. In: The First International Conference on Future Generation Communication Technologies, pp. 104–108. IEEE (2012)
Mrini, K., et al.: UCSD-adobe at MEDIQA 2021: transfer learning and answer sentence selection for medical summarization. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 257–262 (2021)
Nastase, V.: Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 763–772 (2008)
Osman, I., Yahia, S.B., Diallo, G.: Ontology integration: approaches and challenging issues. Inf. Fusion 71, 38–63 (2021)
Ozyurt, I.B., Bandrowski, A., Grethe, J.S.: Bio-AnswerFinder: a system to find answers to questions from biomedical texts. Database 2020 (2020)
Rahman, N., Borah, B.: Improvement of query-based text summarization using word sense disambiguation. Complex Intell. Syst. 6(1), 75–85 (2020)
Savery, M., Abacha, A.B., Gayen, S., Demner-Fushman, D.: Question-driven summarization of answers to consumer health questions. Sci. Data 7(1), 1–9 (2020)
Yadav, S., Sarrouti, M., Gupta, D.: NLM at MEDIQA 2021: transfer learning-based approaches for consumer question and multi-answer summarization. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 291–301 (2021)
Zhu, W., et al.: paht_nlp@ MEDIQA 2021: multi-grained query focused multi-answer summarization. In: Proceedings of the 20th Workshop on Biomedical Language Processing, pp. 96–102 (2021)
Acknowledgements
This research has been done under the research project QG “Research and Development of Vietnamese Multi-document Summarization Based on Advanced Language Models” of Vietnam National University, Hanoi (Code: QG.22.61). Quoc-An Nguyen was funded by the Master, PhD Scholarship Programme of Vingroup Innovation Foundation (VINIF), code VINIF.2022.ThS.001.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Nguyen, QA. et al. (2023). Integrating Ontology-Based Knowledge to Improve Biomedical Multi-Document Summarization Model. In: Nguyen, N.T., et al. Intelligent Information and Database Systems. ACIIDS 2023. Lecture Notes in Computer Science(), vol 13996. Springer, Singapore. https://doi.org/10.1007/978-981-99-5837-5_9
Download citation
DOI: https://doi.org/10.1007/978-981-99-5837-5_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5836-8
Online ISBN: 978-981-99-5837-5
eBook Packages: Computer ScienceComputer Science (R0)