@gron

Fast, Contention-Free Combining Tree Barriers for Shared-Memory Multiprocessors

, and . International Journal of Parallel Programming, 22 (4): 449--481 (August 1994)

Abstract

Abstract  In a previous article,(1) Gupta and Hill introduced anadaptive combining tree algorithm for busy-wait barrier synchronization on shared-memory multiprocessors. The intent of the algorithm was to achieve a barrier in logarithmic time when processes arrive simultaneously, and in constant time after the last arrival when arrivaltimes are skewed. Afuzzy(2) version of the algorithm allows a process to perform useful work between the point at which it notifies other processes ofits arrival and the point at which it waits for all other processes to arrive. Unfortunately, adaptive combining tree barriersas originally devised perform a large amount of work at each node of the tree, including the acquisition and release of locks.They also perform an unbounded number of accesses to nonlocal locations, inducing large amounts of memory and interconnectcontention. We present new adaptive combining tree barriers that eliminate these problems. We compare the performance of thenew algorithms to that of other fast barriers on a 64-node BBN Butterfly 1 multiprocessor, a 35-node BBN TC2000, and a 126-nodeKSR 1. The results reveal scenarios in which our algorithms outperform all known alternatives, and suggest that both adaptationand the combination of fuzziness with tree-style synchronization will be of increasing importance on future generations ofshared-memory multiprocessors.

Description

SpringerLink - Journal Article

Links and resources

Tags

community

  • @gron
  • @dblp
@gron's tags highlighted