Skip to main content

Performance Debugging of Parallel Compression on Multicore Machines

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6068))

Abstract

The power of contemporary processors is based more and more on multicore architectures. This kind of power is accessible only to parallel applications, which are able to provide work for each core. Creating a scalable parallel/multithreaded application efficiently using available cores is a difficult task, especially if I/O performance must be considered as well. We consider a multithreaded database loader with a compressing function. The performance of the loader is examined from a number of perspectives. Because compression is a computationally intensive task, parallel execution can potentially provide a big advantage in this case. A list of performance related areas we encountered is presented and discussed. We identify and verify tools allowing us to deal with specific performance areas. We find out, that only an orchestrated employment of several tools can bring the desired effect. The discussion provides a general procedure one can follow when improving the performance of multithreaded programs. Key performance areas specific to the database loader are pointed out. A special interest is directed towards performance variations observed when many parallel threads are active on a multicore CPU. A significant slowdown of computations is observed if many threads are computing simultaneously. The slowdown is related mainly to memory access and cache behavior and it is much larger for Core2 Quad system than a dual Xeon machine.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. AMD, http://multicore.amd.com/us-en/AMD-multi-core/multi-core-advantage.aspx

  2. Intel, http://www.intel.com/technology/architecture/downloads/quad-core-06.pdf

  3. Gepner, P., Fraser, D.L., Kowalik, M.F.: Performance evolution and power benefits of cluster system utilizing quad-core and dual-core intel xeon processors. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 20–28. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Tao, J., Kunze, M., Karl, W.: Evaluating the cache architecture of multicore processors. In: Proc. of the 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing, PDP 2008, pp. 12–19. IEEE, Los Alamitos (2008)

    Chapter  Google Scholar 

  5. Papadopoulos, K., Stavrou, K., Trancoso, P.: HelperCoreDB: Exploiting multicore technology for databases. In: 16th International Conference on Parallel Architecture and Compilation Techniques PaCT 2007 (2007)

    Google Scholar 

  6. Hardavellas, N., Pandis, I., Johnson, R., Mancheril, N.G., Ailamaki, A., Falsafi, B.: Database servers on chip multiprocessors: Limitations and opportunities. In: Proceedings of the Biennial Conference on Innovative Data Systems Research (2007), http://www.cidrdb.org/

  7. Poess, M., Potapov, D.: Compression in oracle. In: VLDB 2003, pp. 937–947 (2003)

    Google Scholar 

  8. Holloway, L., Raman, V., Swart, G., DeWitt, D.J.: How to barter bits for chronons: Compression and bandwidth trade offs for database scans. In: SIGMOD Conference 2007, pp. 937–947 (2007)

    Google Scholar 

  9. Infobright: http://www.infobright.org , www.infobright.com

  10. OProfile - a system profiler for linux, http://oprofile.sourceforge.net

  11. Valgrind, http://valgrind.org/

  12. Slezak, D., Wroblewski, J., Eastwood, V., Synak, P.: Brighthouse: An analytic data warehouse for ad-hoc queries. In: Proceedings of the VLDB Endowment, vol. 1(2), pp. 1337–1345 (2008)

    Google Scholar 

  13. Transaction Processing Performance Council: http://www.tpc.org/tpch/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Borkowski, J. (2010). Performance Debugging of Parallel Compression on Multicore Machines. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2009. Lecture Notes in Computer Science, vol 6068. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14403-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14403-5_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14402-8

  • Online ISBN: 978-3-642-14403-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics