ABSTRACT
This paper describes a novel tool, SmartSkim, for content-based browsing or skimming of documents. The tool integrates concepts from passage retrieval and from interfaces, such as TileBars, which provide a compact overview of query term hits within a document. We base our tool on the concept of relevance profiling, in which a plot of retrieval status values at each word position of a document is generated. A major contribution of this paper is applying language modelling to the task of relevance profiling. We describe in detail the design of the SmartSkim tool, and provide a critique of the design. Possible applications of the tool are described, and we consider how an operational version of SmartSkim might be designed.
- Hearst, M. A.: TileBars: visualization of term distribution information in full text information access. Proc. CHI'95, (1995), 56--66 Google ScholarDigital Library
- Whittaker, S., Hirschberg, J., Choi, J., Hindle, D., Pereira, F. and Singhal, A.: SCAN: Designing and evaluating user interfaces to support retrieval from speech archives. In Proceedings ACM SIGIR '99. ACM Press (1999) 26--33 Google ScholarDigital Library
- Kaszkiel, M. and Zobel, J.: Passage Retrieval Revisited. In: Proceedings of the Twentieth International ACM-SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, July 1997. ACM Press (1997) 178--185 Google ScholarDigital Library
- Kaszkiel, M.: Indexing and Retrieval of Passages in Full-Text Databases, PhD thesis. RMIT Computer Science Technical Report (RT-17), May 2000 (2000)Google Scholar
- Kaszkiel, M., Zobel, J. and Sacks-Davis, R.: Efficient Passage Ranking for Document Databases. ACM Transactions on Information Systems, Vol 17, No. 4 (1999) 406--439 Google ScholarDigital Library
- Landauer, T., Egan, D., Remde, J., Lesk, M., Lochbaum, C., and Ketchum, D.: Enhancing the usability of text through computer delivery and formative evaluation: The SuperBook project. In: McKnight, C., Dillon, A., and Richardson, J. (eds): Hypertext: A Psychological Perspective. Ellis Horwood (1993) 71--136Google Scholar
- Marchionini. G.: Information Seeking in Electronic Environments. Cambridge University Press, Cambridge (1995) Google ScholarDigital Library
- Byrd, D.: A Scrollbar-based Visualization for Document Navigation. In Proceedings of ACM Digital Libraries 99. ACM Press (1999) Google ScholarDigital Library
- de Kretser, O. and Moffat, A.: Effective Document Presentation with a Locality-Based Similarity Heuristic. In: Proceedings of the Twenty Second International ACM-SIGIR Conference on Research and Development in Information Retrieval, Berkeley, August 1999. ACM Press (1999) 113--120 Google ScholarDigital Library
- Tombros, A. and Sanderson, M.: Advantages of Query Biased Summaries in Information Retrieval. In: Proceedings of 1998 ACM SIGIR Conference on Research and Development in Information Retrieval (1998) 2--10 Google ScholarDigital Library
- Ponte, J. and Croft, W. B.: A language modeling approach to information retrieval. In: Proceedings of the 1998 ACM SIGIR Conference on Research and Development in Information Retrieval (1998) 275--281 Google ScholarDigital Library
- Song, F. and Croft, W.B.: A general language model for information retrieval in Proceedings of the 1999 ACM SIGIR Conference on Research and Development in Information Retrieval (1999) 279--280 Google ScholarDigital Library
- Green, T.R.G.: Describing information artifacts with cognitive dimensions and structure maps. In: Diaper, D, and Hammond, N. V. (eds.): Proceedings of the HCI'91 Conference on People and Computers VI. Cambridge University Press, Cambridge (1991)Google Scholar
- Hendry, D. G. and Green, T.R.G.: Creating, comprehending and explaining spreadsheets: a cognitive interpretation of what discretionary users think of the spreadsheet model. Intl. J. of Human-Computer Studies, 40 (1994) 1033--1065 Google ScholarDigital Library
- Nielson, J.: Hypertext '87 Trip Report. ACM SIGCHI Bulletin 10, (1998) 27--35 Google ScholarDigital Library
- Schilit, B. N., Golovchinsky, G. and Price, M. N.: Beyond paper: Supporting Active Reading with free-form digital ink annotations. In: Proceedings of CHI98, ACM Press (1998) 149--156 Google ScholarDigital Library
- O'Day, V. L. and Jeffries, R. Orienteering in an Information Landscape: How Information Seekers get from here to there. In: Proceedings of INTERCHI '93, ACM Press (1993) 438--445 Google ScholarDigital Library
Index Terms
- A language modelling approach to relevance profiling for document browsing
Recommendations
Enhancing relevance models with adaptive passage retrieval
ECIR'08: Proceedings of the IR research, 30th European conference on Advances in information retrievalPassage retrieval and pseudo relevance feedback/query expansion have been reported as two effective means for improving document retrieval in literature. Relevance models, while improving retrieval in most cases, hurts performance on some heterogeneous ...
Model for simulating result document browsing in focused retrieval
IIIX '12: Proceedings of the 4th Information Interaction in Context SymposiumA search process is a ternary relationship between the user, the retrieval system and the user interface. A focused retrieval system aims at retrieving the most relevant parts within a relevant document. In focused retrieval the user interface may show ...
Document-based and term-based linear methods for pseudo-relevance feedback
Query expansion is a successful approach for improving Information Retrieval effectiveness. This work focuses on pseudo-relevance feedback (PRF) which provides an automatic method for expanding queries without explicit user feedback. These techniques ...
Comments