Abstract
It is often undesirable or impossible to provide redundant indices for all domains of a file existing on a secondary storage device. The problem considered in this paper is the selection of a limited number of indices which best facilitate interaction with a file. A probabilistic model of interaction activity encompassing queries and updates is presented, and a parametric description of the storage medium is assumed. Significant results which are independent of many file and storage characteristics are found concerning the best choice of indices in two cases. The first is the choice of domains to include in a partial inversion. Here it is desired to find the best possible subset of domains for which to provide indices. The second case concerns the choice of combined indices. In this situation the best way of grouping domains is sought in order to provide one index for each group.
Similar content being viewed by others
References
E. F. Codd, “A relational model of data for large shared data banks,”CACM 13(6) (1970).
R. Blier and A. Vorhaus, “File organization in the SDC time shared data management (TDMS) system,” inProc. 1968 IFIP Congress.
V. Y. Lum, “Multiattribute retrieval with combined indices,”CACM 13(11) (1970).
F. Palermo, “A quantitative approach to the selection of secondary indexes,”Formatted File Organization Techniques (IBM Research, San Jose, March 1970).
J. Mullin, “Retrieval-update speed tradeoffs using combined indices,”CACM 14(12) (1971).
V. Y. Lum and H. Ling, “An optimization problem on the selection of secondary keys,” inProc. ACM Nat. Conf., 1971.
E. Wong and T. Chiang, “Canonical structure in attribute based file organization,”CACM 14(9) (1971).
CODASYL Systems Committee, “Feature analysis of generalized data base management systems,” May 1971.
E. F. Codd, “A data base sublanguage founded on the relational calculus,” inProc. 1971 ACM-SIGFIDET Workshop on Data Description, Access and Control, San Diego.
Author information
Authors and Affiliations
Additional information
Research sponsored in part by the General Foods Corporation, New York, New York.
Rights and permissions
About this article
Cite this article
Stonebraker, M. The choice of partial inversions and combined indices. International Journal of Computer and Information Sciences 3, 167–188 (1974). https://doi.org/10.1007/BF00976642
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF00976642