Abstract
In most retrieval systems the answer to a query is a ranked list of documents. There is little information about the ranking and no support for exploring the relationships that may exist between the documents. In this paper we consider the use of clustering answers to better support users satisfying their information needs. We show how clustering reflects the nature of some information needs, and how the clustering can be used to find more relevant documents than would be the case using simple lists. This work contributes to our approach of building answers to information needs, rather than simply providing lists.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
N. J. Belkin, P. G. Marchetti, and C. CooL. Braque: Design of an interface to support user interaction in inform ation retrieval. Information Processing and Management, 29(3):325–344, 1993.
M.E. Brown. A general model of information-seeking behavior. Journal of the American Society for Information Science, 42(1):9–14, 1991.
C. Buckley, G. Salton, and J. Allan. The effect of adding relevance information in a relevance feedback environment. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 292–300, 1994.
W. B. Croft. A model of cluster searching based on classification. Information Systems, 5:189–195, 1980.
J.O. Cutting, D.R. Pedersen, D. Karger, and J.W. Tukey. Scatter/gather: A cluster-based approach to browsing large document collections. In Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 318–329, 1992.
W.B. Frakes and Baeza-Yates. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, 1992.
J.I.B. Gonzales. A theory of organization. In The 12th Annual International Conference on Systems Documentation, pages 145–155, Alberta, Canada, 1994. ACM.
D. Harman and E. Vorhees, editors. Proceedings of the Fifth Text Retrieval Conference, number 500–238 in NIST Special Publication, Gaithersburg, Maryland, 1996. Department of Commerce, National Institute of Standards and Technology.
R.M. Hayes. Mathematical Models in Information Retrieval. McGraw-Hill, New York, 1963.
M.A. Hearst and J.O. Pedersen. Re-examining the cluster hypothesis: Scatter/gather on retrieval results. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1996.
M.A. Hearst, J.O. Pedersen, P. Pirolli, and H. Schutze. Xerox site report: Four TREC-4 tracks. In D. Harman, editor, Proceedings of the Fourth Text Retrieval Conference, 1995.
A.K. Jain and R.C. Dubes. Algorithms for Clustering Data. Prentice Hall, 1988.
G. Marchionini and B. Shneiderman. Finding facts vs. browsing knowledge in hypertext systems. IEEE Computer, 21(1):70–80, 1988.
Daniel E. Rose, Richard Mander, Tim Oren, Dulce B. Poncelon, Gitta Salo man, and Yin Yin Wong. Content awareness in a file system interface: Implementing the “pile” metaphor for organizing information. In Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 260–269, 1993.
G. Salton. Cluster search strategies and the optimization of retrieval effectiveness. In G. Salton, editor, The SMART Retrieval System, pages 223–242. Prentice Hall, 1971.
G. Salton. Automatic Text Processing. Addison-Wesley, Reading, Massachusetts, 1989.
H. Schutze and C. Silverstein. Projections for efficient document clustering. In Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 74–81, 1997.
C. Silverstein and J.O. Pedersen. Almost-constant-time clustering of arbitrary corpus subsets. In Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 60–66, 1997.
R.S. Taylor. The process of asking questions. American Documentation, pages 391–396, October 1962.
C. J. van Rijsbergen. Information Retrieval. Butterworths, London, 1979.
R. Wilkinson and M. Fuller. Integrated information access via structure. In M. Agosti and A. Smeaton, editors, Hypertext and Information Retrieval, pages 257–271. Kluwer, Boston, U.S.A., 1996.
P. Willett. Recent trends in hierarchic document clustering: A critical review. Information Processing and Management, 24(5):577–591, 1988.
I.H. Witten, A. Moffat, and T.C. Bell. Managing Gigabytes: Compressing and Indexing Documents and Images. Van Nostrand Reinhold, New York, 1994.
M. Wu and M.S. Fuller. Supporting the answering process. In J. Thom, editor, Proceeding of the Second Australian Document Computing Symposium, pages 65–73, Melbourne, Australia, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, M., Wilkinson, R. (1998). Using Document Relationships for Better Answers. In: Munson, E.V., Nicholas, C., Wood, D. (eds) Principles of Digital Document Processing. PODDP 1998. Lecture Notes in Computer Science, vol 1481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49654-8_4
Download citation
DOI: https://doi.org/10.1007/3-540-49654-8_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65086-7
Online ISBN: 978-3-540-49654-0
eBook Packages: Springer Book Archive