Skip to main content

A decision theory approach to optimal automatic indexing

  • Conference paper
  • First Online:
Research and Development in Information Retrieval (SIGIR 1982)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 146))

Abstract

A decision theory approach to the development of retrieval systems is presented. Within this framework, optimal indexing is defined. Both the searching and the indexing problem turn out to have a common structure which is described using the concept of a ‘recognition problem’. A knowledge based approach to an approximately optimal indexing, strictly related to the information need of the user is outlined. The theory and the used approximation methods are illustrated by a brief description of the WAI/AIR projects and some of their results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Literatur

  1. ROBERTSON, S.E., VAN RIJSBERGEN, C.V., HARPER, D.J. Probabilistic models in indexing and searching. In: (eds. ODDY, et al.) Information retrieval research. Butterworths London et al., 1981 35–56.

    Google Scholar 

  2. ROBERTSON, S.E., Progress in documentation. Theories and models in information retrieval. J. of Documentation 33, 1977, 126–48.

    Google Scholar 

  3. ROBERTSON, S.E., SPARCK JONES, K. Relevance Weighting of search terms. J.ASIS 27, 1976, 129–46.

    Google Scholar 

  4. BOOKSTEIN, A., SWANSON, D.R. Probabilistic models for automatic indexing. J.ASIS 25, 1974, 312–18.

    Google Scholar 

  5. HARTER, S.P. A probabilistic approach to automatic keyword indexing. Part 1 J.ASIS 26, 1975, 45–50; Part 2 J.ASIS 26, 1975, 280–9.

    Google Scholar 

  6. SALTON, G. A theory of indexing. Regional Conference Series in Applied Mathematics. Philadelphia: SIAM, 1975.

    Google Scholar 

  7. COOPER, W.S. Indexing documents by gedanken experimentations. J.ASIS 29, 1978, 107–119.

    Google Scholar 

  8. MARON, M.E., KUHNS, J.L. On relevance, probabilistic indexing, and information retrieval. J.ACM 7, 1960, 216–44.

    Google Scholar 

  9. COOPER, W.P., MARON, M.E. Foundations of probabilistic and utility-theoretic indexing. J.ACM 25, 1978, 67–80.

    Google Scholar 

  10. VAN RIJSBERGEN, C.J. A theoretical basis for the use of co-occurrence data in information retrieval. J.of Documentation 33, 1977, 106–19.

    Google Scholar 

  11. Bookstein, A., KRAFT, D. Operations research applied to document indexing and retrieval decisions. J.ACM 24, 1977, 418–27.

    Google Scholar 

  12. SPARCK JONES, K. A statistical interpretation of term specifity and its application in retrieval. J. of Documentation 28, 1972, 11–21.

    Google Scholar 

  13. YU, C.T., SALTON, G. Precision weighting — an effective automatic indexing method. J.ACM, 23, 1976, 76–88.

    Google Scholar 

  14. KNORZ, G. Recognizing abstract objects — a decision theory approach within natural language processing. To appear in the proceedings of Coling 82, North Holland Publishing House.

    Google Scholar 

  15. COOPER, W.S. Expected search length: a single measure of retrieval effectiveness based on the weak ordering action of retrieval systems. Amer.Doc. 9, 1968, 30–41.

    Google Scholar 

  16. BOOKSTEIN, A. Relevance. J.ASIS 30, 1979, 269–73.

    Google Scholar 

  17. ROBERTSON, S.E., BELKIN, N.J. Ranking in principle. J. of Documentation 34,2, 1978, 93–100.

    Google Scholar 

  18. ROBERTSON, S.E. The probabilistic character of relevance. Information Processing and Management 13, 1977, 247–51.

    Google Scholar 

  19. RADECKI, T. A new approach to the problem of information system effectiveness evaluation. Information Processing and Management, 12, 1976, 319–26.

    Google Scholar 

  20. MARON, M.E. On indexing, retieval, and the meaning of about. J.ASIS 28, 1977, 28–43.

    Google Scholar 

  21. SCHÜRMANN, J. Polynomklassifikatoren für die Zeichenerkennung — Ansatz, Adaption, Anwendung-. Oldenbourg Verlag München, 1977.

    Google Scholar 

  22. KNORZ, G. Automatic indexing as an application of pattern recognition methods to document-descriptor-relationship. Angewandte Informatik 1,1982, 1–10.

    Google Scholar 

  23. SWETS, J.A. Information retrieval systems. Science 141, 1963, 245–50.

    Google Scholar 

  24. SWETS, J.A. Effectiveness of information retrieval methods. Amer. Doc. 20, 1969, 72–89.

    Google Scholar 

  25. LUSTIG, G. Über die Entwicklung eines automatischen Indexierungs-systems. In: (ed. KRALLMANN, D.) Dialogsysteme und Textverarbeitung. LDV-Fitting, Essen, 1880, 1–16.

    Google Scholar 

  26. LUSTIG, G. Das Projekt WAI: Wörterbuchentwicklung für automatisches Indexing. In: Deutscher Dokumentartag 1981, K.G. SAUR Verlag München et al., 1982, 584–89.

    Google Scholar 

  27. KNORZ, G. Softwaresystem ALIBABA 3.0 Adaptives lernstichproben-orientiertes Indexierungssystem, basierend auf Beschreibungen abstrakter Objekte. (DV II 82-1), Techn. Hochsch. Darmstadt, FB 20, FG DVS II; 1982.

    Google Scholar 

  28. FIELD, B.J. Towards automatic indexing: Automatic assignement of controlled-language indexing and classification from free indexing. J. of Documentation 31, 1975, 246–65.

    Google Scholar 

  29. KNORZ, G. Mustererkennung im Bereich der inhaltlichen Erschliessung von Texten. In: Modelle und Strukturen, DAGM-Symposium Hamburg (ed. RADIG, B.), Springer Verlag Berlin Heidelberg New York, 1981, 31–39.

    Google Scholar 

  30. SCHANK, R.C., LEBOWITZ, M., BIRNBAUM, L. An integrated understander. AJCL 6, 1980, 13–30.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Gerard Salton Hans-Jochen Schneider

Rights and permissions

Reprints and permissions

Copyright information

© 1983 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Knorz, G. (1983). A decision theory approach to optimal automatic indexing. In: Salton, G., Schneider, HJ. (eds) Research and Development in Information Retrieval. SIGIR 1982. Lecture Notes in Computer Science, vol 146. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0036346

Download citation

  • DOI: https://doi.org/10.1007/BFb0036346

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-11978-4

  • Online ISBN: 978-3-540-39440-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics