Abstract
A Natural Language Processing based Information Retrieval System that was one of the original systems developed in Phase I of TIPSTER, was the basis of research in TIPSTER III the goal of which was to add two extended capabilities to the core system. Following a description of the multiple levels of linguistic processing that were developed for the original DR-LINK System, details are provided on research into query-specific data fusion and query-specific cross-document summarization. Experimental results show that there is potential for improving retrieval through query-specific fusion and that analysts found the Detailed Multiple Document Summary to be extremely useful for almost every query, while the Thumbnail sketch was useful in approximately 50% of the queries.
Article PDF
Similar content being viewed by others
References
Bartell B, Cottrell GW and Belew RK (1994) Automatic combination of multiple ranked systems. In: Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. pp. 173-181.
Belkin N, Kantor P, Fox E and Shaw J (1995) Combining the evidence of multiple query representations for information retrieval. Information Processing and Management, 31(3):431-448.
Carletta J (1996) Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2):249-254.
Dreilinger D and Howe A (1997) Experiences with selecting search engines using metasearch. ACM Transactions on Information Systems, 15(3):195-222.
Fox E and Shaw J (1994) Combination of multiple searches. In: Harman D., Ed., The Second Text REtrieval Conference (TREC-2), National Institute of Standards and Technology Special Publications 500-215. Gaithersburg, MD, pp. 242-252.
Grimes J (1975) The Thread of Discourse. Mouton Publishers, The Hague.
Hull D, Pedersen J and Schuetze H (1996) Method combination for document filtering. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Zurich, pp. 279-288.
Krippendorff K (1980) Content Analysis: An Introduction to its Methodology. Sage, Newbury Park.
Landis JR and Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics. 33:159-174.
Liddy ED (1998) Enhanced text retrieval using natural language processing. Bulletin of the American Society for Information Science. 24(4)
Liddy ED and Myaeng SH (1993) DR-LINK's linguistic-conceptual approach to document detection. In: Proceedings of First Text Retrieval Conference (TREC-1). NIST.
Liddy ED and Myaeng SH (1994a) DR-LINK system: Phase I summary. In: Proceedings of the TIPSTER Phase I Final Report.
Liddy ED and Myaeng SH (1994b) DR-LINK: A system update for TREC-2. In: Proceedings of Second Text Retrieval Conference (TREC-2). National Institute of Standards and Technology.
Liddy ED, Paik W and Yu ES (1994c) Text categorization for multiple users based on semantic information from a MRD. ACM Transactions on Information Systems.
Liddy ED, Paik W, Yu ES and McKenna M (1994d) Document retrieval using linguistic knowledge. In: Proceedings of RIAO '94 Conference.
Liddy ED (1995) Development and implementation of a discourse model for newspaper texts. In: Proceedings of the Dagstuhl on Summarizing Text for Intelligent Communication. Saarbruken, Germany.
Miller G (1990) WordNet: An online lexical database. International Journal of Lexicography, 3(4), special issue.
Savoy J, Ndarugendawmo M and Vrajitoru D (1996). Report on the TREC-4 experiment: combining probabilistic and vector-space schemes. In: Harman D, Ed., The Fourth Text REtrieval Conference (TREC-4). National Institute of Standards and Technology Special Publications 500-236, Gaithersburg, MD, pp. 537-547.
Singhal A, Buckley C and Mitra M (1996) Pivoted document length normalization. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Zurich, pp. 21-29.
Strzalkowski T (Ed) (1999) Natural Language Information Retrieval. Kluwer Academic Publishers, Dordrecht, The Netherlands.
Vogt C and Cottrell GW (1998) Predicting the performance of linearly combined IR systems. In: Proceedings of the Twenty First Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. pp. 190-196.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Liddy, E.D., Diamond, T. & McKenna, M. DR-LINK in TIPSTER III. Information Retrieval 3, 291–311 (2000). https://doi.org/10.1023/A:1009986331526
Issue Date:
DOI: https://doi.org/10.1023/A:1009986331526