Skip to main content
Log in

Concept Hierarchy-Based Text Database Categorization

  • Web Information Systems Engineering Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Document categorization as a technique to improve the retrieval of useful documents has been extensively investigated. One important issue in a large-scale metasearch engine is to select text databases that are likely to contain useful documents for a given query. We believe that database categorization can be a potentially effective technique for good database selection, especially in the Internet environment where short queries are usually submitted. In this paper, we propose and evaluate several database categorization algorithms. This study indicates that while some document categorization algorithms could be adopted for database categorization, algorithms that take into consideration the special characteristics of databases may be more effective. Preliminary experimental results are provided to compare the proposed database categorization algorithms. A prototype database categorization system based on one of the proposed algorithms has been developed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received 9 November 2000 / Revised 15 February 2001 / Accepted in revised form 29 May 2001

Rights and permissions

Reprints and permissions

About this article

Cite this article

Meng, W., Wang, W., Sun, H. et al. Concept Hierarchy-Based Text Database Categorization. Knowl Inform Sys 4, 132–150 (2002). https://doi.org/10.1007/s101150200001

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s101150200001

Navigation