Test Case 2
The system simulated is a unit cell of Litharge (a-PbO), a total of 4 atoms. 108 k-points are used.
Table:
Scaling of Test Case 2.
Test Case 2 |
Cores |
Time (secs) |
Speedup |
VASP 5.2.2 |
144 |
2141.206 |
1 |
KPAR=2 |
288 |
1097.981 |
1.950 |
KPAR=3 |
432 |
766.196 |
2.794 |
KPAR=4 |
576 |
607.69 |
3.524 |
KPAR=6 |
864 |
410.018 |
5.222 |
KPAR=9 |
1296 |
278.585 |
7.686 |
KPAR=12 |
1728 |
215.757 |
9.924 |
KPAR=18 |
2592 |
150.177 |
14.258 |
KPAR=27 |
3888 |
105.563 |
20.284 |
KPAR=36 |
5184 |
87.569 |
24.452 |
KPAR=54 |
7776 |
65.332 |
32.774 |
KPAR=108 |
15552 |
41.055 |
52.154 |
|
Table (
) shows that the k-point parallelized code scales rather satisfactorily to 3888 cores, and at 1738 cores it is 7 times faster than the original code at the same number of cores (see Table (
)).
Table:
The optimal NPAR for the original code is 18, 18, 18, 36, 36, 36 and 36 for 144, 288, 432, 576, 864, 1296 and 1728 cores respectively. For the k-point parallelized code the optimal NPAR is 18 for any number of cores, provided one k-group consists of 144 cores.
Test Case 2 |
Cores |
Time (secs) |
Speedup |
VASP 5.2.2 |
288 |
1530.244 |
1 |
KPAR=2 |
288 |
1097.981 |
1.394 |
VASP 5.2.2 |
432 |
1348.436 |
1 |
KPAR=3 |
432 |
766.196 |
1.76 |
VASP 5.2.2 |
576 |
1473.4 |
1 |
KPAR=4 |
576 |
607.69 |
2.425 |
VASP 5.2.2 |
864 |
1562.76 |
1 |
KPAR=6 |
864 |
410.018 |
3.811 |
VASP 5.2.2 |
1296 |
1899.499 |
1 |
KPAR=9 |
1296 |
278.585 |
6.818 |
VASP 5.2.2 |
1728 |
1532.32 |
1 |
KPAR=12 |
1728 |
215.757 |
7.1021 |
|
Figure 2:
Speedup for Test Case 2 (where Speedup is taken to be 1 for 144 cores).
![\includegraphics[angle=0,width=14cm]{TC_2.eps}](img7.png) |
Asimina Maniopoulou 2011-07-09