skip to main content
10.1145/1099554.1099627acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Generating better concept hierarchies using automatic document classification

Published: 31 October 2005 Publication History

Abstract

This paper presents a hybrid concept hierarchy development technique for web returned documents retrieved by a meta-search engine. The aim of the technique is to separate the initial retrieved documents into topical oriented categories, prior to the actual concept hierarchy generation. The topical categories correspond to different semantic aspects of the query. This is done using a 1-of-n automatic document classification, on the initial set of returned documents. Then, an individual topical concept hierarchy is automatically generated inside each of the resulted categories. Both steps are executed on the fly at retrieval time. Due to the efficiency constraints imposed by the web retrieval context, the algorithm only uses document snippets (rather than full web pages) for both document classification and concept hierarchy generation. Experimental results show that the algorithm is able to improve the quality of the concept hierarchy presented to the searcher; at the same time, the efficiency parameters are kept within reasonable intervals.

References

[1]
Brill, E. (1995) "Transformation-based Error-driven Learning and Natural Language Processing: A Case study in Part-of-speech Tagging." Computational Linguistics 21(4), pp. 543--565.
[2]
Sanderson, M. and B. Croft (1999). "Deriving concept hierarchies from text." Proceedings of the 22nd annual international ACM SIGIR Conference on Research and Development in Information Retrieval. Berkely, CA, pp. 206--213.
[3]
Wu, Y. B., C. Rakthin and C. Li (2002). "Summarizing Search Results with Automatic Table of Contents." AMCIS 2002, Dallas, TX: pp 88--92.

Cited By

View all
  • (2013) Seeking beyond with IntegraL : A user study of sense‐making enabled by anchor‐based virtual integration of library systems Journal of the American Society for Information Science and Technology10.1002/asi.2290464:9(1927-1945)Online publication date: 22-Jul-2013

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management
October 2005
854 pages
ISBN:1595931406
DOI:10.1145/1099554
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic classification
  2. concept hierarchy
  3. document classification
  4. information retrieval
  5. manual classification

Qualifiers

  • Article

Conference

CIKM05
Sponsor:
CIKM05: Conference on Information and Knowledge Management
October 31 - November 5, 2005
Bremen, Germany

Acceptance Rates

CIKM '05 Paper Acceptance Rate 77 of 425 submissions, 18%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2013) Seeking beyond with IntegraL : A user study of sense‐making enabled by anchor‐based virtual integration of library systems Journal of the American Society for Information Science and Technology10.1002/asi.2290464:9(1927-1945)Online publication date: 22-Jul-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media