Skip to main content
Log in

Exploiting citation overlaps for Information Retrieval: Generating a boomerang effect from the network of scientific papers

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

A new citation search strategy is proposed for Information Retrieval (IR) based on the principle of polyrepresentation (Ingwersen, 1992, 1996). The strategy exploits logical overlaps between a range of cognitively different interpretations of the same documents in a structured manner, i.e. so-called cognitive overlaps of representations. The strategy is essentially a "cycling strategy" starting with documents retrieved by a subject search, wherefrom new documents are identified automatically by following the network of citations in scientific papers backwards and forwards in time. In contrast to earlier citation search strategies the proposed strategy does not require known relevant documents (seed documents) as a starting point, but may be based on a subject search. A pilot study is reported where the ability of the strategy to retrieve additional relevant documents is analysed. Results show that a very large amount of documents can be retrieved by the strategy, and that these may be segmented in a number of distinct "overlap levels". It is demonstrated that the combined core of the higher-level overlaps contains higher relevance density than found in the original retrieval results. Based on these results it is suggested that the documents be displayed in order of their presence in higher-level overlaps, so as to maximise the chances that as many relevant documents as possible will be presented first to a user.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Borlund, P. (2000a), Experimental components for the evaluation of interactive information retrieval systems, Journal of Documentation, 56: 71–90.

    Article  Google Scholar 

  • Borlund, P. (2000b), Evaluation of Interactive Information Retrieval Systems, Åbo: Åbo Akademi University Press, 276 p. (PhD thesis).

    MATH  Google Scholar 

  • Case, D. O., Higgins, G. M. (2000), How can we investigate citation behaviour? A study of reasons for citing literature in communication. Journal of the American Society for Information Science, 51: 635–645.

    Article  Google Scholar 

  • Cawkell, A. E. (1968), Search strategies using the science citation index. In: Houghton, B. (Ed.), Computer Based Information Retrieval Systems, London: Clive Bingley, pp. 49-

    Google Scholar 

  • Cawkell, A. E. (1974), Search strategy, construction and use of citation networks, with a socio-scientific example: “Amorphus semi-conductors and S. R. Ovshinsky”, Journal of the American Society for Information Science, 35: 123–130.

    Google Scholar 

  • Cawkell, A. E. (1998), Checking research progress on 'image retrieval by shape-matching' using the Web of Science, Aslib Proceedings, 50 (2): 27–31.

    Article  Google Scholar 

  • Cawkell, A. E. (2000), Methods of information retrieval using Web of Science: pulmonary hypertension as a subject example, Journal of Information Science, 26: 66–70.

    Google Scholar 

  • Cleverdon, C. W., Mills, J., Keen, E. M. (1966), Factors Determining the Performance of Indexing Systems (Volume 1: Design), Cranfield: College of Aeronautics.

    Google Scholar 

  • Croft, W. B., Thompson, R. H. (1987), I 3 R: A new approach to the design of document retrieval systems, Journal of the American Society for Information Science, 38: 389–404.

    Article  Google Scholar 

  • Cronin, B. (1984), The Citation Process: The Role and Significance of Citations in Scientific Communication, London: Taylor Graham, 103 p.

    Google Scholar 

  • Egghe, L., Rousseau, R. (1990), Introduction to Informetrics: Quantitative Methods in Library, Documentation and Information Science, Amsterdam: Elsevier.

    Google Scholar 

  • Egghe, L., Rousseau, R. (2002), Co-citation, bibliographic coupling and a characterization of lattice citation networks, Scientometrics. (In press)

  • Ellis, D. (1989), A behavioural approach to information retrieval system design, Journal of Documentation, 45: 171–212.

    Google Scholar 

  • Garfield, E. (1955), Citation indexes for science: A new dimension in documentation through association of ideas, Science, 122, pp. 108–111. (http://www.garfield.library.upenn.edu/essays/v6p468y1983.pdf)

    Google Scholar 

  • Garfield, E. (1964), Science citation index-a new dimension in indexing, Science, 144, pp. 649–654.

    Google Scholar 

  • Garfield, E. (1965), Can citation indexing be automated? In: Stevens, M. E., Giuliano, V. E., Heilprin, L. B. (Eds): Statistical Association Methods for Mechanized Documentation, Symposium Proceedings, Washington, 1964. Washington: National Bureau of Standards, pp. 189–192. (National Bureau of Standards Miscellaneous Publication; 26

    Google Scholar 

  • Garfield, E. (1970), Trained scientists use Science Citation Index to complete bibliographic citations & update searches, Current Contents, (August 19): 119.

  • Garfield, E. (1979), Citation Indexing-Its Theory and Application in Science, Technology, and Humanities, New York: John Wiley & Sons, 274 p. (Information Science Series).

    Google Scholar 

  • Ingwersen, P. (1992), Information retrieval interaction. London: Taylor Graham 264 p.

    Google Scholar 

  • Ingwersen, P. (1996), Cognitive perspectives of Information Retrieval Interaction–elements of a cognitive IR theory, Journal of Documentation, 52: 3–50.

    Article  Google Scholar 

  • Ingwersen, P., Larsen, B., Wormell, I. (2001), The publication-citation matrix and its derived quantities. Chinese Science Bulletin, 46: 524–528.

    Article  Google Scholar 

  • Ingwersen, P., Christensen, F. H. (1997), Data set isolation for bibliometric online analyses of research publications: Fundamental methodological issues, Journal of the American Society for Information Science, 48: 205–217.

    Article  Google Scholar 

  • Katzer, J., McGill, M. J., Tessier, J. A., Frakes, W., Dasgupta, P. (1982), A study of the overlap among document representations, Information Technology: Research and Development, 1: 261–274.

    Google Scholar 

  • Kochen, M. (1974), Principles of Information Retrieval. Los Angeles: Melville Publishing Company, 203. p. (Information Science Series).

    MATH  Google Scholar 

  • Lancaster, F. W. (1991), Indexing and Abstracting in Theory and Practice, London: Library Association, 328 p.

    Google Scholar 

  • McC ain, K. W. (1989), Descriptor and citation retrieval in the medical behavioral sciences literature: Retrieval overlaps and novelty distribution. Journal of the American Society for Information Science, 40: 110–114.

    Article  Google Scholar 

  • Pao, M. L. (1993), Term and citation retrieval-a field-study. Information Processing & Management, 29: 95–112.

    Article  Google Scholar 

  • Spencer, C. C. (1967), Subject searching with Science Citation Index: Preparation of a drug bibliography using Chemical Abstracts, Index Medicus, and Science Citation Index 1962 and 1964, American Documentation, (April): 87–96.

  • Tenopir, C. (1985), Full text database retrieval performance. Online Review, 9, pp. 149–164. (Reprinted in: TENOPIR, C. (1989), Issues in Online Database Searching, New York: Greenwood Press, 188 p.).

    Google Scholar 

  • Thorne, F. C. (1977), The citation index: another case a spurious validity, Journal of Clinical Psychology, 33: 1157–1161.

    Google Scholar 

  • White, H. D., McCain, K. W. (1989), Bibliometrics, Annual Review of Information Science and Technology, 24: 119–186.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Larsen, B. Exploiting citation overlaps for Information Retrieval: Generating a boomerang effect from the network of scientific papers. Scientometrics 54, 155–178 (2002). https://doi.org/10.1023/A:1016011326300

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1016011326300

Keywords

Navigation