Abstract
One way to save memory space in inverted file organizations is to map each address list to a bit-vector and compress it by a suitable compression technique. Eight such techniques are discussed for nonuniformly distributed bit-vectors in this study. Occurrences of clusters with high 1-bit densities are simulated using a n-state bit-vector generating process. Experiments with a real life file are also reported.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
L.R. Bahl, H. Kobayashi: Image Data Compression by Predictive Coding II, IBM J. Res. Develop., Vol 18, pp 172–179, (1974).
S.D. Bradley: Optimizing a Scheme for Run Length Encoding, Proc. IEEE, Vol 57, pp 108–109, (1969).
A.F. Cardenas: Analysis and Performance of Inverted Data Base Management Systems, Comm. ACM, Vol 18, pp 253–263, (1975).
S.W. Golomb: Run-Length Encodings, IEEE Trans. Inf. Theory, IT-12, pp 399–401, (1966).
R.A. Howard: Dynamic Probabilistic Systems, Vol I: Markov Models, John Wiley & Sons, Inc., New York, (1971).
M. Jakobsson, O. Nevalainen: On the Compression of Inverted Files, Rept. B 14, Dept. of Comp. Sci., Univ. of Turku, Finland, (1977).
D.R. King: The Binary Vector as a Basis of an Inverted Index File, J. Libr. Autom., Vol 7, pp 307–315, (1974).
D. Lefkowitz: File Structures for On-line Systems, Spartan Books, New York, (1969).
E.J. Schuegraf: Compression of Large Inverted Files with Hyperbolic Term Distribution, Inf. Proc. Manag., Vol 12, pp 377–384, (1976).
L.H. Thiel, H.S. Heaps: Program Design for Retrospective Searches on Large Data Bases, Inf. Stor. Retr., Vol 8, No 1, (1972).
H. Wedekind, T. Härder: Datenbanksysteme II, B.I.-Wissenschaftswerlag Mannheim/Wien/Zürich, (1976).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1978 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nevalainen, O., Jakobsson, M., Berg, R. (1978). Compression of clustered inverted files. In: Winkowski, J. (eds) Mathematical Foundations of Computer Science 1978. MFCS 1978. Lecture Notes in Computer Science, vol 64. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-08921-7_87
Download citation
DOI: https://doi.org/10.1007/3-540-08921-7_87
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-08921-6
Online ISBN: 978-3-540-35757-5
eBook Packages: Springer Book Archive