Skip to main content
Log in

Linguistically based functions in information retrieval: PADOK and the German Patent Information System

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

This paper reports on methodological considerations and the results of the Information Retrieval (IR) project PADOK I and II. PADOK has been carried out by the Linguistic Information Science Group of the University of Regensburg (LIR) since November 1984 and has been sponsored by the German Ministry for Research and Technology. The long term objective is to integrate artificial intelligence topics and the methods of information retrieval research without neglecting traditional IR methodology. In PADOK we consider a type of mass data IR system which indexes its documents rather shallowly (freetext or morphological components) and adds an intelligent information retrieval component to this kernel system. So far we have obtained, on the basis of two large-scale retrieval tests of the German Patent Information System results which show how the linguistically based functions of an indexing system contribute to its performance, and indicate what is the most reasonable basic content analysis program for a German Patent Information System. This paper focusses on the general principles and aims of PADOK I and PADOK R and on the statistical evaluation of the retrieval tests.

Christa Womser-Hacker has a Ph.D. in Linguistic Information Science. From 1985 until 1990 she was involved in several LIR-Projects concerning text processing, evaluation of the German Patent Information System, man-machine-interaction, intelligent interfaces for databases. Since May 1990 she has been an LIR staff member. She is interested in information retrieval, (statistical) evaluation methods of man-machine-interaction, intelligent interfaces. She has published Der PADOK-Retrieval-test (1989) and “Die statistische Auswertung des Retrievaltests” (1990).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • ACM SIGIR. Proceedings of the ACM SIGIR. 11th International Conference on Research & Development in Information Retrieval. Grenoble, June 13–15 1988.

  • Badgett, T. “Tapping into On-line knowledge.” PC Magazine, 12, 5 (1987), 237–73.

    Google Scholar 

  • Bauer, G., Ch. Schneider and Ch. Womser-Hacker. Die Analyse der Texterschließung. PADOK-Arbeitsbericht 17. Regensburg, 1988.

  • Bauer, G. and Ch. Womser-Hacker. “Qualitative Analyse von Retrievalprozessen. Untersuchungen zur Trunkierung als Ersatz für morphologische Reduktionsalgorithmen.” In Deutsche Gesellschaft für Dokumentation e. V., Deutscher Dokumentartag 1988. 1989.

  • Belkin, N. J., C. L. Borgman, H. M. Brooks and T. Bylander. “Distributed Expert-based Information Systems: An Interdisciplinary Approach.” Information Processing & Management, 23, 5 (1987), 395–409.

    Google Scholar 

  • Brooks, H. M. ”Expert Systems and Intelligent Information Retrieval.“ Information Processing & Management, 23, 4 (1987), 367–82.

    Google Scholar 

  • Cleverdon, C. W. ”On the Inverse Relationship of Recall and Precision.“ Journal of Documentation, 28 (1972), 195–201.

    Google Scholar 

  • Cooper, W. S. “A Definition of Relevance for Information Retrieval.” Information Storage & Retrieval, 7 (1971), 19–37.

    Google Scholar 

  • Croft, W. B. “Approaches to Intelligent Information Retrieval.” Information Processing & Management, 23, 4 (1987), 249–54.

    Google Scholar 

  • Davies, R. “Outlines of the Emerging Paradigm in Cataloguing.” Information Processing & Management, 23, 2 (1987), 89–98.

    Google Scholar 

  • Deutsches Patentamt. Jahresbericht 1986. München, 1986.

  • Gräbnitz, V. “PASSAT: Programm zur Automatischen Selektion von Stichwörtern aus Texten.” In Inhaltserschließung von Massendaten. Zur Wirksamkeit informationslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Ed. J. Krause. Hildesheim et al., 1987.

  • Hawkins, D. T. Applications of Artificial Intelligence (AI) and Expert Systems for Online Searching. Online, 1988.

  • Information Processing & Management. Special Issue on Artificial Intelligence for Information Retrieval, 23, 4 (1987).

  • Jacobs, P. S. and L. F. Rau. “Natural Language Techniques for Intelligent Information Retrieval.” In ACM SIGIR: Proceedings of the ACM SIGIR. 11th International Conference on Research & Development in Information Retrieval. Grenoble, June 13–15, 1988, pp. 85–99.

  • Krause, J. “Linguistic Components in (Office) Information Systems and a General Evaluation Strategy for Automatic Indexing.” Journal of Information & Optimization Sciences (JIOS), 5 (1984), 227–59.

    Google Scholar 

  • Krause, J. (ed.). Inhaltserschließung von Massendaten. Zur Wirksamkeit informationslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Hildesheim et al., 1987a.

  • Krause, J. “Was leisten informationslinguistische Komponenten von Referenz-Retrievalsystemen far Massendaten? Von der Pragmatik im Computer zur Pragmatikanalyse als Designgrundlage.” In Deutsche Gesellschaft für Dokumentation e. V. Deutscher Dokumentartag 1986. München et al., 1987b, pp. 283–93

  • Krause, J. and Ch. Womser-Hacker. Das Deutsche Patentinformationssystem. Entwicklungstendenzen, Retrievaltests and Bewertungen. Köln et al., 1990.

  • Lancaster, F. W. Information Retrieval Systems: Characteristics, Testing and Evaluation. New York et al., 1979.

  • Saracevic, T. “Relevance: A Review of a Framework for the Thinking on the Notion in Information Science.” Journal of the ASIS, 26 (1975), 321–43.

    Google Scholar 

  • Schneider, Ch. “Analyse der Texterschließung.” In Inhaltserschließung von Massendaten. Zur Wirksamkeit informatinslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Ed. J. Krause. Hildesheim et al., 1987.

  • Schneider, Ch. and Ch. Womser-Hacker. “Inhaltserschließungssysteme für Patenttexte. Test and Systemvergleich im Projekt PADOK.” In Deutsche Gesellschaft für Dokumentation e. V. Deutscher Dokumentartag 1986. München et al., 1987, pp. 251–69. Smeaton, A. F. and C. J. Van Rijsbergen. “Experiments on Incorporating Syntactic Processing of User Queries into a Document Retrieval Strategy.” In ACM SIGIR: Proceedings of the ACM SIGIR. 11th International Conference on Research & Development in Information Retrieval. Grenoble, June 13–15, 1988, pp. 31–51.

  • Sparck Jones, K. (ed.). Information Retrieval Experiment. London et al., 1981.

  • Spettel, G. and Ch. Womser-Hacker. “Statistische Auswertung des Retrievaltests auf der Grundlage von recall and precision.” In Inhaltserschließung von Massendaten. Zur Wirksamkeit informationslinguistischer Verfahren am Beispiel des Deutschen Patentinformationssystems. Ed. J. Krause. Hildesheim et al., 1987.

  • Van Rijsbergen, C. J. “Foundation of Evaluation.” Journal of Documentation, 30 (1974),365–73.

    Google Scholar 

  • Wahlster, W. and A. Kobsa. User Models in Dialog Systems. XTRA-Bericht Nr. 30, Saarbrücken, 1988.

  • Womser-Hacker, Ch. Der PADOK-Retrievaltest. ZurMethode und Verwendung statistischer Verfahren bei der Bewertung von Information-Retrieval-Systemen. Hildesheim et al., 1989.

  • Womser-Hacker, Ch. “Die statistische Auswertung des Retrievaltests.” In Das Deutsche Patentinformationssystem. Entwicklungstendenzen, Retrievaltests and Bewertungen. Ed. J. Krause and Ch. Womser-Hacker. Kö1n et al., 1990.

  • Zimmermann, H., E. Kroupa and G. Keil. CTX — Ein Verfahren zur computergesaitzten Texterschließung. BMFTForschungsbericht D 83-006, 1983.

Download references

Author information

Authors and Affiliations

Authors

Additional information

Jürgen Krause is professor of Linguistic Information Science at the University of Regensburg. He is a member of the editorial boards of the periodicals Computer and the Humanities and GLDV-Forum, and co-editor of Sprache and Computer. His research interests include office automation, artificial intelligence help system, information retrieval, evaluation of natural language systems. He is co-editor (with Christa Womser-Hacker) of Das Deutsche Patentinformationssystem, Entwicklungstendenzen, Retrievaltests and Bewertungen (1990) and co-editor of Computer Talk (1991).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Krause, J., Womser-Hacker, C. Linguistically based functions in information retrieval: PADOK and the German Patent Information System. Comput Hum 25, 103–114 (1991). https://doi.org/10.1007/BF00124147

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00124147

Key Words

Navigation