Regular Article
Essential Misses and Data Traffic in Coherence Protocols

https://doi.org/10.1006/jpdc.1995.1112Get rights and content

Abstract

In this paper, we introduce a classification of misses and of components of the data traffic in shared-memory multiprocessors based on interprocessor communication. We consider protocols with invalidations, updates, and prefetches in systems with infinite and finite caches. We identify the set of essential misses and the essential traffic, i.e., the smallest set of misses and the smallest amount of traffic necessary for correct execution. The rest of the misses and of the data traffic is nonessential and could be ignored without affecting the correctness of program execution. To illustrate the classification of misses and traffic, we apply it to a set of parallel scientific programs and observe the overhead created by different hardware mechanisms when block sizes and cache sizes are varied.

References (0)

Cited by (13)

  • Transactional memory

    2010, Journal of Parallel and Distributed Computing
  • A comparative evaluation of hardware-only and software-only directory protocols in shared-memory multiprocessors

    2004, Journal of Systems Architecture
    Citation Excerpt :

    For the other applications the best block size, as indicated by our results, seems to be 64 (Radix) or 128 bytes (Water-Nsquared, Water-Spatial, and LU). Larger block sizes increase the execution time as a result of false sharing [10] for those applications. By looking at the execution times for the software-only directory protocol, similar effects as for the hardware-only protocol can be observed but they are more pronounced.

  • A metric for the temporal characterization of parallel programs

    1997, Journal of Parallel and Distributed Computing
  • Edge chasing delayed consistency: Pushing the limits of weak memory models

    2012, SPLASH 2012: RACES 2012 - Proceedings of the 2012 ACM Workshop on Relaxing Synchronization for Multicore and Manycore Scalability
  • Constraint graph analysis of multithreaded programs

    2004, Journal of Instruction-Level Parallelism
View all citing articles on Scopus
View full text