next up previous contents
Next: Known issues remaining Up: wave_write Previous: wave_write   Contents

Band distribution

The initial implementation of the band-parallelism gave each node a contiguous block of bands. Whilst this allowed greater optimisation on each node, the load balancing was problematic. The two greatest problems for load-balancing were:

There is a further issue with a distribution by contiguous blocks of bands that is not related to load-balancing: traditional optimisation methods are not band-local. This means that the nodes cannot optimise their own local set of bands independently of the other nodes-in particular, higher bands (i.e. more energetic bands) should not be optimised until the lower bands have converged.

For all of these reasons, the decision was made to switch the band-distribution to a round-robin scheme, whereby each of $n$ nodes gets every $n$th band. This improves the load-balancing greatly, and also allows the existing optimisation algorithm to be used with few changes. This distribution is not `hard-wired', and it is trivial to change to a different scheme.

It should be noted that the proposed optimisation scheme in Work Package 3 (see chapter 6) of this project is band-local, and the detail of the band-distribution may be revisited in that stage of the project.


next up previous contents
Next: Known issues remaining Up: wave_write Previous: wave_write   Contents
Sarfraz A Nadeem 2008-09-01