Skip to main content

Comparison of Clustering Algorithms in a Single User Environment through OO7 Benchmark

  • Conference paper
East/West Database Workshop

Part of the book series: Workshops in Computing ((WORKSHOPS COMP.))

  • 34 Accesses

Abstract

In this paper, we present a new clustering algorithm called the High Fan Out algorithm and then give the performance comparison of the High Fan Out (HFO) algorithm, Kemighan-Lin based algorithms, and the Probability Ranking Partitioning algorithm for a persistent C++(C**) implementation in a single user environment where the global request stream follows a pattern most of the time. The global request stream is obtained through OO7 Benchmark. It is shown than HFO algorithm performs the best when object sizes are uniform and the cache sizes are relatively large. We conclude with a table that indicates the best clustering algorithm to be used depending on the characteristics of the database application at hand and the restrictions imposed by the computer system. It is also indicated that, the performance of a clustering algorithm can not be based solely on the communication cost, or on the amount of internal fragmentation. On the contrary both of the measures should be taken into account to predict the number of cache misses.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. V. Benzaken and C. Delobel, “Enhancing Performance in Persistent Object Store: Clustering Strategies in 02 ”, Proceedings of the fourth workshop on persistent object systems, pages 403–412, July 17, 1990.

    Google Scholar 

  2. M. J. Carey, D. J. DeWitt, and J. F. Naughton, “The OO7 Benchmark”, University of Wisconsin,Technical Report, April 1993.

    Google Scholar 

  3. M. J. Carey, D. J. DeWitt, and J. F. Naughton, “The OO7 Benchmark”, in Proc. of the ACM SIGMOD Int. Conf. on Management of Data, June 1993.

    Google Scholar 

  4. P. Drew, and R. King, “The Performance and Utility of the Cactis Implementation Algorithms”, Proceedings of the 16th VLDB Conference, pages 135–147, Australia 1990.

    Google Scholar 

  5. Evrendilek, C., Dogac, A., Gesli, T., “ A Preprocessor A pproach to Persistent C++”, Submitted for publication.

    Google Scholar 

  6. C. M. Fiduccia and R. M. Mattheyses, “A Linear-Time Heuristic for Improving Network Partitions”, 19th design Automation Conference, 1982.

    Google Scholar 

  7. J. G. Hughes, Object-Oriented Databases, Prentice Hall, 1991.

    Google Scholar 

  8. B. W. Kernighan and S. Lin, “An efficient Heuristic Procedure for Partitioning Graphs”, Bell System Technical Journal, 49 (2), pages 291–307, February 1970.

    MATH  Google Scholar 

  9. J. W. Stamos, “Static Grouping of Small Objects to Enhance Performance of a Paged Virtual Memory”, ACM Transactions on Computer Systems, 2 (2): 155–180, May 1984.

    Article  Google Scholar 

  10. M. M. Tsangaris and J. F. Naughton, “A stochastic Approach for Clustering in Object Stores”, In Proceedings of the SIGMOD International Conference on Management of Data, pages 12–21, Denver, Colorado, May 1991.

    Google Scholar 

  11. M. M. Tsangaris and J. F. Naughton, “On the Performance of Object Clustering Techniques”, In Proceedings of the SIGMOD International Conference on Management of Data, pages 144–153, San Diego, California, June 1992.

    Google Scholar 

  12. P. C. Yue and C. K. Wong, “On the Optimality of the Probability Ranking Scheme in Storage Applications”, JACM, 20 (4), October 1973.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Koc, K., Dogac, A., Evrendilek, C. (1995). Comparison of Clustering Algorithms in a Single User Environment through OO7 Benchmark. In: Eder, J., Kalinichenko, L.A. (eds) East/West Database Workshop. Workshops in Computing. Springer, London. https://doi.org/10.1007/978-1-4471-3577-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-3577-7_6

  • Publisher Name: Springer, London

  • Print ISBN: 978-3-540-19946-5

  • Online ISBN: 978-1-4471-3577-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics