Using device 2 (rank 6, local rank 6, local size 9) : Tesla P100-PCIE-16GB
Using device 2 (rank 7, local rank 7, local size 9) : Tesla P100-PCIE-16GB
Using device 2 (rank 8, local rank 8, local size 9) : Tesla P100-PCIE-16GB
Using device 1 (rank 4, local rank 4, local size 9) : Quadro GP100
Using device 1 (rank 5, local rank 5, local size 9) : Quadro GP100
Using device 0 (rank 1, local rank 1, local size 9) : Tesla P100-PCIE-16GB
Using device 1 (rank 3, local rank 3, local size 9) : Quadro GP100
Using device 0 (rank 0, local rank 0, local size 9) : Tesla P100-PCIE-16GB
Using device 0 (rank 2, local rank 2, local size 9) : Tesla P100-PCIE-16GB
 running on    9 total cores
 distrk:  each k-point on    9 cores,    1 groups
 distr:  one band on    1 cores,    9 groups
  
 *******************************************************************************
  You are running the GPU port of VASP! When publishing results obtained with
  this version, please cite:
   - M. Hacene et al., http://dx.doi.org/10.1002/jcc.23096
   - M. Hutchinson and M. Widom, http://dx.doi.org/10.1016/j.cpc.2012.02.017
  
  in addition to the usual required citations (see manual).
  
  GPU developers: A. Anciaux-Sedrakian, C. Angerer, and M. Hutchinson.
 *******************************************************************************
  
 -----------------------------------------------------------------------------
|                                                                             |
|           W    W    AA    RRRRR   N    N  II  N    N   GGGG   !!!           |
|           W    W   A  A   R    R  NN   N  II  NN   N  G    G  !!!           |
|           W    W  A    A  R    R  N N  N  II  N N  N  G       !!!           |
|           W WW W  AAAAAA  RRRRR   N  N N  II  N  N N  G  GGG   !            |
|           WW  WW  A    A  R   R   N   NN  II  N   NN  G    G                |
|           W    W  A    A  R    R  N    N  II  N    N   GGGG   !!!           |
|                                                                             |
|     Please note that VASP has recently been ported to GPU by means of       |
|     OpenACC. You are running the CUDA-C GPU-port of VASP, which is          |
|     deprecated and no longer actively developed, maintained, or             |
|     supported. In the near future, the CUDA-C GPU-port of VASP will be      |
|     dropped completely. We encourage you to switch to the OpenACC           |
|     GPU-port of VASP as soon as possible.                                   |
|                                                                             |
 -----------------------------------------------------------------------------

 vasp.6.2.1 16May21 (build Apr 11 2022 11:03:26) complex                        
  
 MD_VERSION_INFO: Compiled 2022-04-11T18:25:55-UTC in devlin.sd.materialsdesign.
 com:/home/medea2/data/build/vasp6.2.1/16685/x86_64/src/src/build/gpu from svn 1
 6685
 
 This VASP executable licensed from Materials Design, Inc.
 
 POSCAR found type information on POSCAR C Ru
 POSCAR found :  2 types and     217 ions
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 -----------------------------------------------------------------------------
|                                                                             |
|           W    W    AA    RRRRR   N    N  II  N    N   GGGG   !!!           |
|           W    W   A  A   R    R  NN   N  II  NN   N  G    G  !!!           |
|           W    W  A    A  R    R  N N  N  II  N N  N  G       !!!           |
|           W WW W  AAAAAA  RRRRR   N  N N  II  N  N N  G  GGG   !            |
|           WW  WW  A    A  R   R   N   NN  II  N   NN  G    G                |
|           W    W  A    A  R    R  N    N  II  N    N   GGGG   !!!           |
|                                                                             |
|     For optimal performance we recommend to set                             |
|       NCORE = 2 up to number-of-cores-per-socket                            |
|     NCORE specifies how many cores store one orbital (NPAR=cpu/NCORE).      |
|     This setting can greatly improve the performance of VASP for DFT.       |
|     The default, NCORE=1 might be grossly inefficient on modern             |
|     multi-core architectures or massively parallel machines. Do your        |
|     own testing! More info at https://www.vasp.at/wiki/index.php/NCORE      |
|     Unfortunately you need to use the default for GW and RPA                |
|     calculations (for HF NCORE is supported but not extensively tested      |
|     yet).                                                                   |
|                                                                             |
 -----------------------------------------------------------------------------

 LDA part: xc-table for Pade appr. of Perdew
  
 WARNING: The GPU port of VASP has been extensively
 tested for: ALGO=Normal, Fast, and VeryFast.
 Other algorithms may produce incorrect results or
 yield suboptimal performance. Handle with care!
  
 POSCAR, INCAR and KPOINTS ok, starting setup
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
creating 32 CUFFT plans with grid size 108 x 150 x 84...
 FFT: planning ...
 WAVECAR not read
 entering main loop
       N       E                     dE             d eps       ncg     rms          rms(c)
DAV:   1     0.780821537900E+04    0.78082E+04   -0.34058E+05  1476   0.981E+02 
DAV:   2    -0.532282165716E+03   -0.83405E+04   -0.79456E+04  1692   0.268E+02 
DAV:   3    -0.205886911588E+04   -0.15266E+04   -0.15008E+04  1440   0.118E+02 
DAV:   4    -0.212473077806E+04   -0.65862E+02   -0.64340E+02  1458   0.294E+01 
DAV:   5    -0.212699135291E+04   -0.22606E+01   -0.22262E+01  1575   0.450E+00    0.824E+01
DAV:   6    -0.227763433963E+04   -0.15064E+03   -0.52026E+02  1467   0.480E+01    0.541E+02
DAV:   7    -0.226783236348E+04    0.98020E+01   -0.22074E+02  1422   0.148E+01    0.151E+03
DAV:   8    -0.226629806393E+04    0.15343E+01   -0.37181E+01  1818   0.356E+00    0.171E+03
DAV:   9    -0.226359076585E+04    0.27073E+01   -0.37558E+00  1647   0.822E-01    0.183E+03
DAV:  10    -0.225222275301E+04    0.11368E+02   -0.67590E-01  1521   0.139E+00    0.169E+03
DAV:  11    -0.225222781565E+04   -0.50626E-02   -0.44245E-02  1431   0.272E-01    0.170E+03
DAV:  12    -0.225504641781E+04   -0.28186E+01   -0.27535E-01  1449   0.989E-01    0.169E+03
DAV:  13    -0.222953105148E+04    0.25515E+02   -0.23985E+00  1341   0.373E+00    0.169E+03
DAV:  14    -0.227986677702E+04   -0.50336E+02   -0.28908E+01  1431   0.142E+01    0.171E+03
DAV:  15    -0.220029257448E+04    0.79574E+02   -0.67030E+01  1332   0.160E+01    0.173E+03
DAV:  16    -0.223203701575E+04   -0.31744E+02   -0.20222E+01  1368   0.109E+01    0.177E+03
DAV:  17    -0.222323930865E+04    0.87977E+01   -0.26001E+01  1350   0.982E+00    0.181E+03
DAV:  18    -0.223700860742E+04   -0.13769E+02   -0.29051E+01  1638   0.416E+00    0.186E+03