Parallelism for high performance query processing

Winters, Vincent G.

doi:10.1007/BFb0032441

Vincent G. Winters¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 580))

Included in the following conference series:

International Conference on Extending Database Technology

141 Accesses

Abstract

We present a new method for a type of processing required in data base management systems. The method efficiently determin relevance of a given query value to each of many (target) sets of data. By using a new type of data structure, the method allows complete parallelism both for operations on different target sets and for those within each target set. The method never generates a false drop (i.e. indicates that an irrelevant target set is relevant to the query) and always identifies all relevant target sets. This eliminates the the overhead of reading each selected target set to ensure that the selection was not a false drop. A good deterministic bound on the system's performance is established.

With O(ln N _v+ln ln M) processors, the relevance of any target set can be completely determined in O(1) time against a query consisting of a subset of N _v vocabulary items. The space complexity is O(N _i(ln N _v+ln lnN _v)) bits, where N _i is the number of items relevant to target set i. As a concrete example, for a database using 64 byte keys, having a 100,000 word vocabulary (potentially valid keys) and in which a target set can have up to 64 distinct relevant elements, the relevance of a target set can be determined in 2 parallel operations using 6 processors. In other words, with 64K processors a database of one million target sets can be processed in 184 parallel operations. No probability distribution assumptions are necessary.

This article was processed using the LaT_EX macro package with LMAMULT style

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. R. Ahuja and C. S. Roberts, “An Associative/Parallel Processor for Partial Match Retrieval Using Superimposed Codes,” in Annual Symposium on Computer Architecture, 1980, pp. 218–227.
Google Scholar
J. Bentley, “A Spelling Checker,” Communications of the ACM,” Vol. 28, no. 5, pp. 456–462, 1985.
Google Scholar
C. Faloutsos, “Access Methods for Text,” Computing Surveys, vol. 17, no. 1, pp. 49–74, 1985.
Google Scholar
M. Fredman, J. Komlos and E. Szemeredi, “Storing a Sparse Table with O(1) Worst Case Access Time,” Journal of the ACM, vol. 31, no. 3, pp. 538–544, 1984.
Google Scholar
L. L. Gremillion, “Designing a Bloom Filter for Differential Access,” Communications of the ACM, vol. 25, no. 7, pp. 600–604, 1980.
Google Scholar
D. E. Knuth, 1973. The Art of Computer Programming, vol. S: Sorting and Searching. Reading, Mass.: Addison-Wesley, 1973.
Google Scholar
J. W. Lloyd, “Optimal Partial Match Retrieval,” BIT, vol. 20, pp. 406–413, 1980.
Google Scholar
P. E. McKenney, “High Speed Event Counting and Classification Using a Dictionary Hash Technique,” in Proceedings of the International Conference on Parallel Processing, pp. 218–227, 1989.
Google Scholar
H. N. Shapiro, Introduction to the Theory of Numbers. New York: John Wiley and Sons, 1983.
Google Scholar
D. Tsichritzis D. Christodoulakis and S. Christodoulakis, “Message Files,” ACM Trans. Office Inf. Systems, vol. 1, no. 1, pp. 88–98, 1983.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Cincinnati, Cincinnati, Ohio, USA
Vincent G. Winters

Authors

Vincent G. Winters
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alain Pirotte Claude Delobel Goerg Gottlob

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Winters, V.G. (1992). Parallelism for high performance query processing. In: Pirotte, A., Delobel, C., Gottlob, G. (eds) Advances in Database Technology — EDBT '92. EDBT 1992. Lecture Notes in Computer Science, vol 580. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0032441

Download citation

DOI: https://doi.org/10.1007/BFb0032441
Published: 26 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-55270-3
Online ISBN: 978-3-540-47003-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics