Reference Hub1
A Hybrid Approach to Retrieve Knowledge from a Document

A Hybrid Approach to Retrieve Knowledge from a Document

Deepak Sahoo, Rakesh Chandra Balabantaray
Copyright: © 2020 |Volume: 16 |Issue: 1 |Pages: 18
ISSN: 1548-0666|EISSN: 1548-0658|EISBN13: 9781799804932|DOI: 10.4018/IJKM.2020010104
Cite Article Cite Article

MLA

Sahoo, Deepak, and Rakesh Chandra Balabantaray. "A Hybrid Approach to Retrieve Knowledge from a Document." IJKM vol.16, no.1 2020: pp.83-100. http://doi.org/10.4018/IJKM.2020010104

APA

Sahoo, D. & Balabantaray, R. C. (2020). A Hybrid Approach to Retrieve Knowledge from a Document. International Journal of Knowledge Management (IJKM), 16(1), 83-100. http://doi.org/10.4018/IJKM.2020010104

Chicago

Sahoo, Deepak, and Rakesh Chandra Balabantaray. "A Hybrid Approach to Retrieve Knowledge from a Document," International Journal of Knowledge Management (IJKM) 16, no.1: 83-100. http://doi.org/10.4018/IJKM.2020010104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The task of retrieving the theme of a document and presenting a shorter form compared to the original text to the user is a challenging assignment. In this article, a hybrid approach to extract knowledge from a text document is presented, in which three key sentence level relationships in association with the Markov clustering algorithm is used to cluster sentences in the document. After clustering, sentences are ranked in each cluster and the highest ranked sentences in each cluster are merged. In the end, to get the final theme of the document, the Gradient boosting technique XGboost is used to compress the newly generated sentence. The DUC-2002 data set is used to evaluate the proposed system and it has been observed that the performance of the proposed system is better than other existing systems.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.