ABSTRACT
Web catalog integration is an interesting problem in current digital content management. Past studies have shown that using a flattened structure with auxiliary information extracted from the source catalog can improve the integration results. However, the nature of a flattened structure ignores the hierarchical relationships, and thus the performance improvement of catalog integration may be reduced. In this paper, we propose an enhanced hierarchical catalog integration (EHCI) approach with conceptual thesauri extracted from the source catalog. The results show that our enhanced hierarchical integration approach effectively boosts the accuracy of hierarchical catalog integration.
- R. Agrawal and R. Srikant. On integrating catalogs. Proc. WWW10 pages 603--612, May 2001. Google ScholarDigital Library
- I.-X. Chen, J.-C. Ho, and C.-Z. Yang. An iterative approach for web catalog integration with support vector machines. Proc. AIRS'05 pages 703--708, Oct. 2005. Google ScholarDigital Library
- S. Dumais and H. Chen. Hierarchical classification of web content. Proc. SIGIR'00 pages 256--263, Jul. 2000. Google ScholarDigital Library
- S. Sarawagi, S. Chakrabarti, and S. Godbole. Cross-training: Learning probabilistic mappings between topics. Proc. SIGKDD'03 pages 177--186, Aug. 2003. Google ScholarDigital Library
- A. Sun, E.-P. Lim, and W.-K. Ng. Performance measurement framework for hierarchical text classification. JASIST 54(11): 1014--1028, Jun 2003.Google ScholarCross Ref
Index Terms
- On hierarchical web catalog integration with conceptual relationships in thesaurus
Recommendations
Learning to integrate web catalogs with conceptual relationships in hierarchical thesaurus
AIRS'06: Proceedings of the Third Asia conference on Information Retrieval TechnologyWeb catalog integration has been addressed as an important issue in current digital content management. Past studies have shown that exploiting a flattened structure with auxiliary information extracted from the source catalog can improve the ...
Architecting a cross-disciplinary thesaurus for the semantic web
DCMI '04: Proceedings of the 2004 international conference on Dublin Core and metadata applications: metadata across languages and culturesAn environmental health science thesaurus is needed to facilitate Semantic Web operations and aid with problem solving in this important cross-domain area. This paper demonstrates the need for an environmental health science thesaurus and reviews ...
From thesaurus to ontology
K-CAP '01: Proceedings of the 1st international conference on Knowledge captureThesauri such as the Art and Architecture Thesaurus (AAT) provide structured vocabularies for describing art objects. However, if we want to create a knowledge-rich description of an (image of an) art object, such as required by the "semantic web", ...
Comments