Using device 2 (rank 6, local rank 6, local size 9) : Tesla P100-PCIE-16GB Using device 2 (rank 7, local rank 7, local size 9) : Tesla P100-PCIE-16GB Using device 2 (rank 8, local rank 8, local size 9) : Tesla P100-PCIE-16GB Using device 1 (rank 4, local rank 4, local size 9) : Quadro GP100 Using device 1 (rank 5, local rank 5, local size 9) : Quadro GP100 Using device 0 (rank 1, local rank 1, local size 9) : Tesla P100-PCIE-16GB Using device 1 (rank 3, local rank 3, local size 9) : Quadro GP100 Using device 0 (rank 0, local rank 0, local size 9) : Tesla P100-PCIE-16GB Using device 0 (rank 2, local rank 2, local size 9) : Tesla P100-PCIE-16GB running on 9 total cores distrk: each k-point on 9 cores, 1 groups distr: one band on 1 cores, 9 groups ******************************************************************************* You are running the GPU port of VASP! When publishing results obtained with this version, please cite: - M. Hacene et al., http://dx.doi.org/10.1002/jcc.23096 - M. Hutchinson and M. Widom, http://dx.doi.org/10.1016/j.cpc.2012.02.017 in addition to the usual required citations (see manual). GPU developers: A. Anciaux-Sedrakian, C. Angerer, and M. Hutchinson. ******************************************************************************* ----------------------------------------------------------------------------- | | | W W AA RRRRR N N II N N GGGG !!! | | W W A A R R NN N II NN N G G !!! | | W W A A R R N N N II N N N G !!! | | W WW W AAAAAA RRRRR N N N II N N N G GGG ! | | WW WW A A R R N NN II N NN G G | | W W A A R R N N II N N GGGG !!! | | | | Please note that VASP has recently been ported to GPU by means of | | OpenACC. You are running the CUDA-C GPU-port of VASP, which is | | deprecated and no longer actively developed, maintained, or | | supported. In the near future, the CUDA-C GPU-port of VASP will be | | dropped completely. We encourage you to switch to the OpenACC | | GPU-port of VASP as soon as possible. | | | ----------------------------------------------------------------------------- vasp.6.2.1 16May21 (build Apr 11 2022 11:03:26) complex MD_VERSION_INFO: Compiled 2022-04-11T18:25:55-UTC in devlin.sd.materialsdesign. com:/home/medea2/data/build/vasp6.2.1/16685/x86_64/src/src/build/gpu from svn 1 6685 This VASP executable licensed from Materials Design, Inc. POSCAR found type information on POSCAR C Ru POSCAR found : 2 types and 217 ions NWRITE = 1 NWRITE = 1 NWRITE = 1 NWRITE = 1 NWRITE = 1 NWRITE = 1 NWRITE = 1 NWRITE = 1 NWRITE = 1 ----------------------------------------------------------------------------- | | | W W AA RRRRR N N II N N GGGG !!! | | W W A A R R NN N II NN N G G !!! | | W W A A R R N N N II N N N G !!! | | W WW W AAAAAA RRRRR N N N II N N N G GGG ! | | WW WW A A R R N NN II N NN G G | | W W A A R R N N II N N GGGG !!! | | | | For optimal performance we recommend to set | | NCORE = 2 up to number-of-cores-per-socket | | NCORE specifies how many cores store one orbital (NPAR=cpu/NCORE). | | This setting can greatly improve the performance of VASP for DFT. | | The default, NCORE=1 might be grossly inefficient on modern | | multi-core architectures or massively parallel machines. Do your | | own testing! More info at https://www.vasp.at/wiki/index.php/NCORE | | Unfortunately you need to use the default for GW and RPA | | calculations (for HF NCORE is supported but not extensively tested | | yet). | | | ----------------------------------------------------------------------------- LDA part: xc-table for Pade appr. of Perdew WARNING: The GPU port of VASP has been extensively tested for: ALGO=Normal, Fast, and VeryFast. Other algorithms may produce incorrect results or yield suboptimal performance. Handle with care! POSCAR, INCAR and KPOINTS ok, starting setup creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUDA streams... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... creating 32 CUFFT plans with grid size 108 x 150 x 84... FFT: planning ... WAVECAR not read entering main loop N E dE d eps ncg rms rms(c) DAV: 1 0.780821537900E+04 0.78082E+04 -0.34058E+05 1476 0.981E+02 DAV: 2 -0.532282165716E+03 -0.83405E+04 -0.79456E+04 1692 0.268E+02 DAV: 3 -0.205886911588E+04 -0.15266E+04 -0.15008E+04 1440 0.118E+02 DAV: 4 -0.212473077806E+04 -0.65862E+02 -0.64340E+02 1458 0.294E+01 DAV: 5 -0.212699135291E+04 -0.22606E+01 -0.22262E+01 1575 0.450E+00 0.824E+01 DAV: 6 -0.227763433963E+04 -0.15064E+03 -0.52026E+02 1467 0.480E+01 0.541E+02 DAV: 7 -0.226783236348E+04 0.98020E+01 -0.22074E+02 1422 0.148E+01 0.151E+03 DAV: 8 -0.226629806393E+04 0.15343E+01 -0.37181E+01 1818 0.356E+00 0.171E+03 DAV: 9 -0.226359076585E+04 0.27073E+01 -0.37558E+00 1647 0.822E-01 0.183E+03 DAV: 10 -0.225222275301E+04 0.11368E+02 -0.67590E-01 1521 0.139E+00 0.169E+03 DAV: 11 -0.225222781565E+04 -0.50626E-02 -0.44245E-02 1431 0.272E-01 0.170E+03 DAV: 12 -0.225504641781E+04 -0.28186E+01 -0.27535E-01 1449 0.989E-01 0.169E+03 DAV: 13 -0.222953105148E+04 0.25515E+02 -0.23985E+00 1341 0.373E+00 0.169E+03 DAV: 14 -0.227986677702E+04 -0.50336E+02 -0.28908E+01 1431 0.142E+01 0.171E+03 DAV: 15 -0.220029257448E+04 0.79574E+02 -0.67030E+01 1332 0.160E+01 0.173E+03 DAV: 16 -0.223203701575E+04 -0.31744E+02 -0.20222E+01 1368 0.109E+01 0.177E+03 DAV: 17 -0.222323930865E+04 0.87977E+01 -0.26001E+01 1350 0.982E+00 0.181E+03 DAV: 18 -0.223700860742E+04 -0.13769E+02 -0.29051E+01 1638 0.416E+00 0.186E+03