Abstract
A decision theory approach to the development of retrieval systems is presented. Within this framework, optimal indexing is defined. Both the searching and the indexing problem turn out to have a common structure which is described using the concept of a ‘recognition problem’. A knowledge based approach to an approximately optimal indexing, strictly related to the information need of the user is outlined. The theory and the used approximation methods are illustrated by a brief description of the WAI/AIR projects and some of their results.
Preview
Unable to display preview. Download preview PDF.
Literatur
ROBERTSON, S.E., VAN RIJSBERGEN, C.V., HARPER, D.J. Probabilistic models in indexing and searching. In: (eds. ODDY, et al.) Information retrieval research. Butterworths London et al., 1981 35–56.
ROBERTSON, S.E., Progress in documentation. Theories and models in information retrieval. J. of Documentation 33, 1977, 126–48.
ROBERTSON, S.E., SPARCK JONES, K. Relevance Weighting of search terms. J.ASIS 27, 1976, 129–46.
BOOKSTEIN, A., SWANSON, D.R. Probabilistic models for automatic indexing. J.ASIS 25, 1974, 312–18.
HARTER, S.P. A probabilistic approach to automatic keyword indexing. Part 1 J.ASIS 26, 1975, 45–50; Part 2 J.ASIS 26, 1975, 280–9.
SALTON, G. A theory of indexing. Regional Conference Series in Applied Mathematics. Philadelphia: SIAM, 1975.
COOPER, W.S. Indexing documents by gedanken experimentations. J.ASIS 29, 1978, 107–119.
MARON, M.E., KUHNS, J.L. On relevance, probabilistic indexing, and information retrieval. J.ACM 7, 1960, 216–44.
COOPER, W.P., MARON, M.E. Foundations of probabilistic and utility-theoretic indexing. J.ACM 25, 1978, 67–80.
VAN RIJSBERGEN, C.J. A theoretical basis for the use of co-occurrence data in information retrieval. J.of Documentation 33, 1977, 106–19.
Bookstein, A., KRAFT, D. Operations research applied to document indexing and retrieval decisions. J.ACM 24, 1977, 418–27.
SPARCK JONES, K. A statistical interpretation of term specifity and its application in retrieval. J. of Documentation 28, 1972, 11–21.
YU, C.T., SALTON, G. Precision weighting — an effective automatic indexing method. J.ACM, 23, 1976, 76–88.
KNORZ, G. Recognizing abstract objects — a decision theory approach within natural language processing. To appear in the proceedings of Coling 82, North Holland Publishing House.
COOPER, W.S. Expected search length: a single measure of retrieval effectiveness based on the weak ordering action of retrieval systems. Amer.Doc. 9, 1968, 30–41.
BOOKSTEIN, A. Relevance. J.ASIS 30, 1979, 269–73.
ROBERTSON, S.E., BELKIN, N.J. Ranking in principle. J. of Documentation 34,2, 1978, 93–100.
ROBERTSON, S.E. The probabilistic character of relevance. Information Processing and Management 13, 1977, 247–51.
RADECKI, T. A new approach to the problem of information system effectiveness evaluation. Information Processing and Management, 12, 1976, 319–26.
MARON, M.E. On indexing, retieval, and the meaning of about. J.ASIS 28, 1977, 28–43.
SCHÜRMANN, J. Polynomklassifikatoren für die Zeichenerkennung — Ansatz, Adaption, Anwendung-. Oldenbourg Verlag München, 1977.
KNORZ, G. Automatic indexing as an application of pattern recognition methods to document-descriptor-relationship. Angewandte Informatik 1,1982, 1–10.
SWETS, J.A. Information retrieval systems. Science 141, 1963, 245–50.
SWETS, J.A. Effectiveness of information retrieval methods. Amer. Doc. 20, 1969, 72–89.
LUSTIG, G. Über die Entwicklung eines automatischen Indexierungs-systems. In: (ed. KRALLMANN, D.) Dialogsysteme und Textverarbeitung. LDV-Fitting, Essen, 1880, 1–16.
LUSTIG, G. Das Projekt WAI: Wörterbuchentwicklung für automatisches Indexing. In: Deutscher Dokumentartag 1981, K.G. SAUR Verlag München et al., 1982, 584–89.
KNORZ, G. Softwaresystem ALIBABA 3.0 Adaptives lernstichproben-orientiertes Indexierungssystem, basierend auf Beschreibungen abstrakter Objekte. (DV II 82-1), Techn. Hochsch. Darmstadt, FB 20, FG DVS II; 1982.
FIELD, B.J. Towards automatic indexing: Automatic assignement of controlled-language indexing and classification from free indexing. J. of Documentation 31, 1975, 246–65.
KNORZ, G. Mustererkennung im Bereich der inhaltlichen Erschliessung von Texten. In: Modelle und Strukturen, DAGM-Symposium Hamburg (ed. RADIG, B.), Springer Verlag Berlin Heidelberg New York, 1981, 31–39.
SCHANK, R.C., LEBOWITZ, M., BIRNBAUM, L. An integrated understander. AJCL 6, 1980, 13–30.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1983 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Knorz, G. (1983). A decision theory approach to optimal automatic indexing. In: Salton, G., Schneider, HJ. (eds) Research and Development in Information Retrieval. SIGIR 1982. Lecture Notes in Computer Science, vol 146. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0036346
Download citation
DOI: https://doi.org/10.1007/BFb0036346
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-11978-4
Online ISBN: 978-3-540-39440-2
eBook Packages: Springer Book Archive