Skip to main content
Log in

Application of principal component analysis to multi-disk concurrent accessing

  • Part I Computer Science
  • Published:
BIT Numerical Mathematics Aims and scope Submit manuscript

Abstract

This paper is concerned with the allocation of multi-attribute records on several disks so as to achieve high degree of concurrency of disk access when responding to partial match queries.

An algorithm to distribute a set of multi-attribute records onto different disks is presented. Since our allocation method will use the principal component analysis, this concept is first introduced. We then use it to generate a set of real numbers which are the projections on the first principal component direction and can be viewed as hashing addresses.

Then we propose an algorithm based upon these hashing addresses to allocate multi-attribute records onto different disks. Some experimental results show that our method can indeed be used to solve the multi-disk data allocation problem for concurrent accessing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. M. Y. Chan,Multidisk file design: An analysis of folding buckets to disks, BIT, Vol. 24, 1984, pp. 262–268.

    Google Scholar 

  2. M. Y. Chan,A note on redundant disk modulo allocation, Information Processing Letters, Vol. 20, April 1985, pp. 121–123.

    Google Scholar 

  3. C. C. Chang and C. Y. Chen,Lower bounds of using disk modulo allocation method to allocate cartesian product files in a two-disk system, Proc. ICS 1986 Conference, Tainan, Taiwan, Dec. 1986, pp. 770–774.

  4. C. C. Chang and C. Y. Chen,Performance analysis of the generalized disk modulo allocation method for multiple key hashing files on multi-disk systems, to appear in the Computer Journal, United Kingdom.

  5. C. C. Chang and C. Y. Chen,Performance of two-disk partition data allocations, BIT, Vol. 27, 1987, pp. 306–314.

    Google Scholar 

  6. C. C. Chang, M. D. Hsiao, and C. H. Lin,Algorithms to allocate a file for concurrent disk accessing, Proceedings of the 6th Advanced Database System Symposium, Tokyo, Japan, August 1986, pp. 201–205.

  7. C. C. Chang and L. S. Liang,On strict optimality property of allocating binary Cartesian product files on multiple disk systems, Proceedings of the International Conference on Foundation of Data Organization, Kyoto, Japan, May 1985, pp. 104–112.

  8. C. C. Chang and Y. L. Lu,The complexity of multi-disk data allocation problem, Proc. NCS 1985 Conference, Kaoshung, Taiwan, Dec. 1985, pp. 468–471.

  9. C. C. Chang and J. J. Shen,tPerformance analysis of the disk modulo allocation method for concurrent accessing on multiple disk systems, Journal of the Chinese Institute of Engineers, Vol. 8, No. 3, pp. 271–283.

  10. C. C. Chang and J. J. Shen,Consecutive retrieval organization as a file allocation scheme on multiple disk systems, the Proceedings of the International Conference on Foundations of Data Organization, Kyoto, Japan, May 1985, pp. 74–80.

  11. Y. T. Chien and K. S. Fu,On the generalized Karhunen-Loeve expansion, IEEE Transactions on Information Theory, Vol. IT-13, 1967, pp. 518–520.

    Google Scholar 

  12. H. C. Du,Disk allocation methods for binary Cartesian product files, BIT, Vol. 26, 1986, pp. 138–147.

    Google Scholar 

  13. H. C. Du and J. S. Sobolewski,Disk allocation for Cartesian product files on multiple disk systems, ACM Trans. Database Systems, Vol. 7, March 1982, pp. 82–101.

    Google Scholar 

  14. M. T. Fang, R. C. T. Lee, and C. C. Chang,The idea of de-clustering and its applications, 12th International Conference on Very Large Data Bases, Kyoto, Japan, August 1986, pp. 181–188.

  15. K. S. Fu,Sequential Methods in Pattern Recognition and Machine Learning, Academic Press, Reading, New York, 1968.

    Google Scholar 

  16. S. P. Ghosh,Data Base Organization for Data Management, Academic Press, Reading, New York, 1977.

    Google Scholar 

  17. R. C. T. Lee, Y. H. Chin, and S. C. Chang,Application of principal component analysis to multikey searching, IEEE Transactions on Software Engineering, Vol. SE-2, No. 3, September 1976, pp. 185–193.

    Google Scholar 

  18. T. S. Yuen and H. C. Du,Dynamic file structure for partial match retrieval based on overflow bucket sharing, IEEE Transactions on Software Engineering, Vol. SE-12, No. 8, August 1986, pp. 801–810.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, C.C. Application of principal component analysis to multi-disk concurrent accessing. BIT 28, 205–214 (1988). https://doi.org/10.1007/BF01934086

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01934086

C.R. categories

Navigation