Abstract
The INN system is a dynamic hypertext tool for searching and exploring the WWW. It uses a dynamically built ancillary layer to support easy interaction. This layer features the subexpressions of index expressions that are extracted from rendered documents. Currently, the INN system uses keyword based matching. The effectiveness of the INN system may be increased by using matching functions for index expressions. In the design of such functions, several constraints stemming from the INN must be taken into account. Important constraints are a limited response time and storage space, a focus on discriminating (different notions of) subexpressions for index expressions, and domain independency. With these contextual constraints in mind, several matching functions are designed and both theoretically and practically evaluated.
Article PDF
Similar content being viewed by others
References
Arampatzis AT, Tsoris T, Koster CHA and van der Weide ThP (1998) Phrase-based information retrieval. Information Processing & Management, 34(6):693–707.
Berger FC (1998) Navigational query construction in a hypertext environment. PhD Thesis, Department of Computer Science, University of Nijmegen.
Brill E (1994) Some advances in rule-based part of speech tagging. In: Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94), Seattle, Wa.
Bruza PD (1993) Stratified information disclosure: A synthesis between information retrieval and hypermedia. PhD Thesis, University of Nijmegen, Nijmegen, The Netherlands.
Bruza PD and van der Weide ThP (1992) Stratified hypermedia structures for information disclosure. The Computer Journal, 35(3):208–220.
Evans DA, Ginther-Webster K, Hart M, Lefferts RG and Monarch I (1991) Automatic indexing using selective NLP and first-order thesauri. In: Lichnerowicz A, Ed., Proceedings of RIAO'91, Barcelona, Spain, pp. 624–643.
Evans D, Lefferts R, Grefenstette G, Handerson S, Hersch W and Archbold S (1992) Clarit trec design, experiments, and results. In: Harman DK, Ed., Proceedings of TREC-1, Gaithersburg, MD, US, pp. 251–286.
Farradane J (1980a) Relational indexing part I. Journal of Information Science, 1(5):267–276.
Farradane J (1980b) Relational indexing part II. Journal of Information Science, 1(6):313–324.
Iannella R, Ward N, Wood A, Sue H and Bruza P (1995) The open information locator project. Technical Report, Resource Discovery Unit, Cooperative Research Centre, University of Queensland, Brisbane, Australia.
Kilpelaïnen P and Mannila H (1993) Retrieval from hierarchical texts by partial patterns. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Pittsburgh, PA, USA, pp. 214–222.
Lucarella D and Zanzi Z (1993) Information retrieval from hypertext: An approach using plausible inference. Information Processing & Management, 29(3):299–312.
Mauldin ML (1989) Retrieval performance in FERRET. In: Proceedings of the ACMSIGIR Conference, pp. 347–355.
Metzler DP and Haas SW (1989) The constituent object parser: Syntactic structure for information retrieval. ACM Transactions of Information Systems, 7(3):292–316.
Salton G and Smith M (1989) On the application of syntactic methodologies in automatic text indexing. In: Proceedings of the ACM SIGIR Conference, pp. 137–150.
Smeaton AF and Sheridan P (1991) Using morpho-syntactic language analysis in phrase matching. In: Lichnerowicz A, Ed., Proceedings of RIAO'91, Barcelona, Spain, pp. 414–430.
Sparck Jones K and Tait JI (1984) Automatic search term variant generation. Journal of Documentation, 40(1):50–66.
Strzalkowski T (1995) Natural language information retrieval. Information Processing&Management, 31(3):397–417.
van Rijsbergen CJ (1975) Information Retrieval. Butterworths, London, United Kingdom.
van der Vet P and Mars NJI (1998) Bottom-up construction of ontologies. IEEE Transactions on Knowledge and Data Engineering, 10(4):513–526.
Wilkinson R and Fuller M (1996) Integrated information access via structure. In: Agosti M and Smeaton A, Eds., Hypertext and Information Retrieval, Kluwer, Boston, U.S.A., pp. 257–271.
Wondergem BCM, van Bommel P and van der Weide ThP (2000) Nesting and defoliation of index expressions for information retrieval. Knowledge and Information Systems, 2(1).
Wondergem BCM, van Uden M, van Bommel P and van der Wei de ThP (1999) INdex navigator for searching and exploring the WWW. Technical Report CSI-R9917, University of Nijmegen, Nijmegen, The Netherlands.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wondergem, B., van Bommel, P. & van der Weide, T. Matching Index Expressions for Information Retrieval. Information Retrieval 2, 337–360 (2000). https://doi.org/10.1023/A:1009928328710
Issue Date:
DOI: https://doi.org/10.1023/A:1009928328710