Abstract
The recent advent of parallel machines with tens of thousands of processors is presenting new challenges for obtaining scalability. A particular challenge for large-scale scientific software is determining the inter-processor communications required by the computation when a global description of the data is unavailable or too costly to store. We present a type of rendezvous algorithm that determines communication partners in a scalable manner by assuming the global distribution of the data. We analyze the algorithm theoretically and demonstrate its scaling properties on up to 32,768 processors of BlueGene/L in the context of determining communication patterns for a matrix-vector multiply in the hypre software library. Our algorithm is very general and is applicable to a variety of situations in parallel computing.
| Original language | English |
|---|---|
| Pages (from-to) | 394-414 |
| Number of pages | 21 |
| Journal | Parallel Computing |
| Volume | 32 |
| Issue number | 5-6 |
| DOIs | |
| State | Published - Jun 2006 |
Keywords
- Distributed directory
- Global partition
- Processor inter-communication
- Rendezvous algorithm
- Scalability