Skip to main content
Log in

The order of merging operations for queries in inverted file systems

  • Published:
International Journal of Computer & Information Sciences Aims and scope Submit manuscript

Abstract

In inverted file systems, queries can be written as Boolean expressions of inverted attributes. In response to a query, the system accesses address lists associated with the attributes in the query, merges them, and selects those records that satisfy the search logic. In this paper we consider the minimization of the CPU time needed for the merging operation. The time can possibly be reduced by taking address lists that occur in several product terms as a common factor of these products. This means that the union operation must be performed before the intersection operation. We present formulas which can be used to decide whether the above method is advantageous. The time can also be reduced by choosing the order of intersection operations so that it takes into consideration the occurrences of the address lists in the products and the lengths of the address lists. For choosing the order of intersection operations we give a heuristic algorithm that minimizes the total time needed for intersections.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. D. H. Anderson and P. B. Berra, “Minimum cost selection of secondary indexes for formated files,”ACM Trans. Database Systems 2(1): 68–90 (1977).

    Google Scholar 

  2. A. F. Cárdenas, “Evaluation and selection of file organization—A model and system,”Comm. ACM 16(9): 540–548 (1973).

    Google Scholar 

  3. A. F. Cárdenas, “Analysis and performance of inverted database structures,”Comm. ACM 18(5): 253–263 (1975).

    Google Scholar 

  4. D. A. Huffman, “A method for the construction of minimum redundancy codes,”Proc. IRE 40:1098–1101 (1952).

    Google Scholar 

  5. F. K. Hwang and S. Lin, “A simple algorithm for merging two disjoint linearly ordered sets,”Siam J. Comp. 1(1): 31–39 (1972).

    Google Scholar 

  6. E. E. Knuth,The Art of Computer Programming (Addison-Wesley, Reading, Massachusetts, 1975).

    Google Scholar 

  7. D. Lefkovitz,File Structures for On-Line Systems (Spartan Books, New York, 1969).

    Google Scholar 

  8. J. W. S. Liu, “Algorithms for parsing search queries in systems with inverted file organization,”ACM Trans. Database Systems 1(4): 299–316 (1976).

    Google Scholar 

  9. H. Wedekind,Datenbanksysteme II (B. I.-Wissenschaftsverlag, Zürich, 1976).

    Google Scholar 

  10. S. B. Yao, “An attribute based model for database access cost analysis,”ACM Trans. Database Systems 2(1): 45–67 (1977).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Putkonen, A. The order of merging operations for queries in inverted file systems. International Journal of Computer and Information Sciences 9, 351–369 (1980). https://doi.org/10.1007/BF00978519

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00978519

Key words

Navigation