skip to main content
10.1145/544220.544234acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

A language modelling approach to relevance profiling for document browsing

Authors Info & Claims
Published:14 July 2002Publication History

ABSTRACT

This paper describes a novel tool, SmartSkim, for content-based browsing or skimming of documents. The tool integrates concepts from passage retrieval and from interfaces, such as TileBars, which provide a compact overview of query term hits within a document. We base our tool on the concept of relevance profiling, in which a plot of retrieval status values at each word position of a document is generated. A major contribution of this paper is applying language modelling to the task of relevance profiling. We describe in detail the design of the SmartSkim tool, and provide a critique of the design. Possible applications of the tool are described, and we consider how an operational version of SmartSkim might be designed.

References

  1. Hearst, M. A.: TileBars: visualization of term distribution information in full text information access. Proc. CHI'95, (1995), 56--66 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Whittaker, S., Hirschberg, J., Choi, J., Hindle, D., Pereira, F. and Singhal, A.: SCAN: Designing and evaluating user interfaces to support retrieval from speech archives. In Proceedings ACM SIGIR '99. ACM Press (1999) 26--33 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Kaszkiel, M. and Zobel, J.: Passage Retrieval Revisited. In: Proceedings of the Twentieth International ACM-SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, July 1997. ACM Press (1997) 178--185 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Kaszkiel, M.: Indexing and Retrieval of Passages in Full-Text Databases, PhD thesis. RMIT Computer Science Technical Report (RT-17), May 2000 (2000)Google ScholarGoogle Scholar
  5. Kaszkiel, M., Zobel, J. and Sacks-Davis, R.: Efficient Passage Ranking for Document Databases. ACM Transactions on Information Systems, Vol 17, No. 4 (1999) 406--439 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Landauer, T., Egan, D., Remde, J., Lesk, M., Lochbaum, C., and Ketchum, D.: Enhancing the usability of text through computer delivery and formative evaluation: The SuperBook project. In: McKnight, C., Dillon, A., and Richardson, J. (eds): Hypertext: A Psychological Perspective. Ellis Horwood (1993) 71--136Google ScholarGoogle Scholar
  7. Marchionini. G.: Information Seeking in Electronic Environments. Cambridge University Press, Cambridge (1995) Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Byrd, D.: A Scrollbar-based Visualization for Document Navigation. In Proceedings of ACM Digital Libraries 99. ACM Press (1999) Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. de Kretser, O. and Moffat, A.: Effective Document Presentation with a Locality-Based Similarity Heuristic. In: Proceedings of the Twenty Second International ACM-SIGIR Conference on Research and Development in Information Retrieval, Berkeley, August 1999. ACM Press (1999) 113--120 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Tombros, A. and Sanderson, M.: Advantages of Query Biased Summaries in Information Retrieval. In: Proceedings of 1998 ACM SIGIR Conference on Research and Development in Information Retrieval (1998) 2--10 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ponte, J. and Croft, W. B.: A language modeling approach to information retrieval. In: Proceedings of the 1998 ACM SIGIR Conference on Research and Development in Information Retrieval (1998) 275--281 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Song, F. and Croft, W.B.: A general language model for information retrieval in Proceedings of the 1999 ACM SIGIR Conference on Research and Development in Information Retrieval (1999) 279--280 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Green, T.R.G.: Describing information artifacts with cognitive dimensions and structure maps. In: Diaper, D, and Hammond, N. V. (eds.): Proceedings of the HCI'91 Conference on People and Computers VI. Cambridge University Press, Cambridge (1991)Google ScholarGoogle Scholar
  14. Hendry, D. G. and Green, T.R.G.: Creating, comprehending and explaining spreadsheets: a cognitive interpretation of what discretionary users think of the spreadsheet model. Intl. J. of Human-Computer Studies, 40 (1994) 1033--1065 Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Nielson, J.: Hypertext '87 Trip Report. ACM SIGCHI Bulletin 10, (1998) 27--35 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Schilit, B. N., Golovchinsky, G. and Price, M. N.: Beyond paper: Supporting Active Reading with free-form digital ink annotations. In: Proceedings of CHI98, ACM Press (1998) 149--156 Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. O'Day, V. L. and Jeffries, R. Orienteering in an Information Landscape: How Information Seekers get from here to there. In: Proceedings of INTERCHI '93, ACM Press (1993) 438--445 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A language modelling approach to relevance profiling for document browsing

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
                July 2002
                448 pages
                ISBN:1581135130
                DOI:10.1145/544220

                Copyright © 2002 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 14 July 2002

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                JCDL '02 Paper Acceptance Rate69of240submissions,29%Overall Acceptance Rate415of1,482submissions,28%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader