Article

Free Access

Automatic indexing based on Bayesian inference networks

SIGIR '93: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrievalJuly 1993Pages 22–35https://doi.org/10.1145/160688.160691

Published:01 July 1993Publication History

SIGIR '93: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval

Pages 22–35

ABSTRACT

In this paper, a Bayesian inference network model for automatic indexing with index terms (descriptors) from a prescribed vocabulary is presented. It requires an indexing dictionary with rules mapping terms of the respective subject field onto descriptors and inverted lists for terms occuring in a set of documents of the subject field and descriptors manually assigned to these documents. The indexing dictionary can be derived automatically from a set of manually indexed documents. An application of the network model is described, followed by an indexing example and some experimental results about the indexing performance of the network model.

References

Fangmeyer, H.; Lustig, G. (1969). The EURATOM Automatic Indexing Project. In" International Federation for information Processing (ed.): IFIP Congress 88, Edinburgh, pages 1310-1314. North Holland Publishing Company, Amsterdam.Google Scholar
Fuhr, N.; Knorz, G. (1984). Retrieval Test Evaluation of a Rule Based Automatic Indexing (AIR/PHYS). In: Van Rijsbergen, C. (ed.): 1~esearch and Development in Information Retrieval, pages 391-408. Cambridge University Press, Cambridge. Google ScholarDigital Library
Fuhr, N.; Hartmann, S.; Lustig, G.; Schwantnet, M.; Tzeras, K.; gnorz, G. (1991). AIR/X- a Rule-Based Multistage Indexing System for Large Subject Fields. In: Proceedings of the RIAO'91, Barcelona, Spain, April 2-5, 1991, pages 606-623.Google Scholar
Fuhr, N. (1989a). Models for Retrieval with probabilistic Indexing. Information Processing and Management 25(1), pages 55-72. Google ScholarDigital Library
Fuhr, N. (1989b). Optimum Polynomial Retrieval Functions Based on the Probability Ranking Principle. A CM Transactions on Information Systems 7(3), pages 183-204. Google ScholarDigital Library
Hartmann, S. (1993). Weiterentwicklung der automatischen Indexierung. Dissertation. TH Darmstadt, Fachbereich Informatik (In Preparation).Google Scholar
Jaene, H.; Seelbach, D. (1975). Maschinelle Eztrak. tion yon zusammengesetzten A usdriicken aus englischen Fachtezten. Report ZMD-A-29, Beuth, Berlin, Frankfurt.Google Scholar
Kienitz-Vollmer, B.; Reichard, J. (1986). Bestimmung yon Mehrwortgruppen mithilfe des Begrenzerverfahrens. In: Lustig, G. (ed.): Automatische indexierung zwischen Forschung und Anwendung, pages 18-30. Olms, Hildesheim.Google Scholar
Knorz, G. (1983). Automatisches Indexieren als Erkennen abstrakter Objekte. Niemeyer, Tiibingen.Google Scholar
Kuhlen, R. (1977). Experimentelle Morphologie in der Informationswissenschafl. Verlag Dokumentation, Miinchen.Google Scholar
Pearl, J. (1988). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufman, San Mateo, Cal. Google ScholarDigital Library
van Rijsbergen, C. J. (1977). A Theoretical Basis for the Use of Co-Occurrence Data in Information Retrieval. Journal of Documentation 33, pages 106- 119.Google Scholar
Savoy J.; Desbois D. (1991). Bayesian Inference Networks in Hypertext. In: Proceedings of the RIA O'91, Barcelona, Spain, April P-5, 1991, pages 662-683.Google Scholar
Turtle H.; Croft B. (1990). Inference Network for Document Retrieval. In: Vidick, J.-L. (ed.): Proceedings of the 13th Conference on Research 8~ Development in Information Retrieval, pages 1-24. ACM, New York. Google Scholar
Turtle H.; Croft B. (1991). Efficient Probabilistic Inference for Text Retrieval. In: Proceedings of the 1~IA0'91, Barcelona, Spain, April P.5, 1991, pages 644-661.Google Scholar

Index Terms

Recommendations

An experiment in automatic indexing with Korean texts: a comparison of syntactico-statistical and manual methods
Read More
The hyperdyadic index and generalized indexing and query with PIQUE
SSDBM '15: Proceedings of the 27th International Conference on Scientific and Statistical Database Management

Many scientists rely on indexing and query to identify trends and anomalies within extreme-scale scientific data. Compressed bitmap indexing (e.g., FastBit) is the go-to indexing method for many scientific datasets and query workloads. Recently, the ...
Read More
Stronger Lempel-Ziv Based Compressed Text Indexing

Given a text T[1..u] over an alphabet of size σ, the full-text search problem consists in finding the occ occurrences of a given pattern P[1..m] in T. In indexed text searching we build an index on T to improve the search time, yet increasing the space ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '93: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
July 1993
361 pages
ISBN:0897916050
DOI:10.1145/160688
Editors:
Robert Korfhage,
Edie Rasmussen,
Peter Willett
Copyright © 1993 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 July 1993
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate792of3,983submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 67
  Total Citations
  View Citations
- 680
  Total Downloads
- Downloads (Last 12 months)42
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Automatic indexing based on Bayesian inference networks

SIGIR '93: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

An experiment in automatic indexing with Korean texts: a comparison of syntactico-statistical and manual methods

The hyperdyadic index and generalized indexing and query with PIQUE

Stronger Lempel-Ziv Based Compressed Text Indexing

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Automatic indexing based on Bayesian inference networks

SIGIR '93: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

An experiment in automatic indexing with Korean texts: a comparison of syntactico-statistical and manual methods

The hyperdyadic index and generalized indexing and query with PIQUE

Stronger Lempel-Ziv Based Compressed Text Indexing

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media