Using device 0 (rank 0, local rank 0, local size 3) : Tesla P100-PCIE-16GB
Using device 1 (rank 1, local rank 1, local size 3) : Quadro GP100
Using device 2 (rank 2, local rank 2, local size 3) : Tesla P100-PCIE-16GB
 running on    3 total cores
 distrk:  each k-point on    3 cores,    1 groups
 distr:  one band on    1 cores,    3 groups
  
 *******************************************************************************
  You are running the GPU port of VASP! When publishing results obtained with
  this version, please cite:
   - M. Hacene et al., http://dx.doi.org/10.1002/jcc.23096
   - M. Hutchinson and M. Widom, http://dx.doi.org/10.1016/j.cpc.2012.02.017
  
  in addition to the usual required citations (see manual).
  
  GPU developers: A. Anciaux-Sedrakian, C. Angerer, and M. Hutchinson.
 *******************************************************************************
  
 -----------------------------------------------------------------------------
|                                                                             |
|           W    W    AA    RRRRR   N    N  II  N    N   GGGG   !!!           |
|           W    W   A  A   R    R  NN   N  II  NN   N  G    G  !!!           |
|           W    W  A    A  R    R  N N  N  II  N N  N  G       !!!           |
|           W WW W  AAAAAA  RRRRR   N  N N  II  N  N N  G  GGG   !            |
|           WW  WW  A    A  R   R   N   NN  II  N   NN  G    G                |
|           W    W  A    A  R    R  N    N  II  N    N   GGGG   !!!           |
|                                                                             |
|     Please note that VASP has recently been ported to GPU by means of       |
|     OpenACC. You are running the CUDA-C GPU-port of VASP, which is          |
|     deprecated and no longer actively developed, maintained, or             |
|     supported. In the near future, the CUDA-C GPU-port of VASP will be      |
|     dropped completely. We encourage you to switch to the OpenACC           |
|     GPU-port of VASP as soon as possible.                                   |
|                                                                             |
 -----------------------------------------------------------------------------

 vasp.6.2.1 16May21 (build Apr 11 2022 11:03:26) complex                        
  
 MD_VERSION_INFO: Compiled 2022-04-11T18:25:55-UTC in devlin.sd.materialsdesign.
 com:/home/medea2/data/build/vasp6.2.1/16685/x86_64/src/src/build/gpu from svn 1
 6685
 
 This VASP executable licensed from Materials Design, Inc.
 
 POSCAR found type information on POSCAR Si
 POSCAR found :  1 types and       2 ions
 NWRITE =            1
 NWRITE =            1
 NWRITE =            1
 -----------------------------------------------------------------------------
|                                                                             |
|               ----> ADVICE to this user running VASP <----                  |
|                                                                             |
|     You have a (more or less) 'small supercell' and for smaller cells       |
|     it is recommended to use the reciprocal-space projection scheme!        |
|     The real-space optimization is not efficient for small cells and it     |
|     is also less accurate ...                                               |
|     Therefore, set LREAL=.FALSE. in the INCAR file.                         |
|                                                                             |
 -----------------------------------------------------------------------------

 LDA part: xc-table for Pade appr. of Perdew
  
 WARNING: The GPU port of VASP has been extensively
 tested for: ALGO=Normal, Fast, and VeryFast.
 Other algorithms may produce incorrect results or
 yield suboptimal performance. Handle with care!
  
 POSCAR, INCAR and KPOINTS ok, starting setup
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUDA streams...
creating 32 CUFFT plans with grid size 16 x 16 x 16...
creating 32 CUFFT plans with grid size 16 x 16 x 16...
creating 32 CUFFT plans with grid size 16 x 16 x 16...
 FFT: planning ...
 WAVECAR not read
 entering main loop
       N       E                     dE             d eps       ncg     rms          rms(c)
DAV:   1    -0.935903349511E+00   -0.93590E+00   -0.24478E+03   189   0.408E+02 
DAV:   2    -0.108732527131E+02   -0.99373E+01   -0.98103E+01   306   0.561E+01 
DAV:   3    -0.109470777163E+02   -0.73825E-01   -0.73822E-01   225   0.604E+00 
DAV:   4    -0.109473176190E+02   -0.23990E-03   -0.23990E-03   288   0.340E-01 
DAV:   5    -0.109473179801E+02   -0.36110E-06   -0.36040E-06   279   0.777E-03    0.546E+00
DAV:   6    -0.108137475351E+02    0.13357E+00   -0.86857E-02   207   0.137E+00    0.335E+00
DAV:   7    -0.107532466002E+02    0.60501E-01   -0.16599E-01   243   0.202E+00    0.204E-01
DAV:   8    -0.107543903406E+02   -0.11437E-02   -0.34969E-03   198   0.378E-01    0.744E-02
DAV:   9    -0.107546156274E+02   -0.22529E-03   -0.21649E-04   261   0.100E-01    0.613E-02
DAV:  10    -0.107546391002E+02   -0.23473E-04   -0.24256E-05   234   0.332E-02    0.961E-03
DAV:  11    -0.107546388269E+02    0.27335E-06   -0.90009E-07   198   0.553E-03 
   1 F= -.10754639E+02 E0= -.10754914E+02  d E =-.107546E+02
 curvature:   0.00 expect dE= 0.000E+00 dE for cont linesearch  0.000E+00
 trial: gam= 0.00000 g(F)=  0.296E-43 g(S)=  0.302E-01 ort = 0.000E+00 (trialstep = 0.100E+01)
 search vector abs. value=  0.302E-01
 ######################################################################
 bond charge predicted
       N       E                     dE             d eps       ncg     rms          rms(c)
DAV:   1    -0.107712907908E+02   -0.16652E-01   -0.45323E-01   180   0.332E+00    0.604E-01
DAV:   2    -0.107636040220E+02    0.76868E-02   -0.60539E-03   198   0.447E-01    0.360E-01
DAV:   3    -0.107596242401E+02    0.39798E-02   -0.81628E-03   189   0.407E-01    0.134E-01
DAV:   4    -0.107596269951E+02   -0.27550E-05   -0.68240E-05   207   0.643E-02 
   2 F= -.10759627E+02 E0= -.10759767E+02  d E =-.498817E-02
 trial-energy change:   -0.004988  1 .order   -0.005229   -0.030161    0.019703
 step:   0.5979(harm=  0.6049)  dis= 0.00000  next Energy=   -10.763604 (dE=-0.897E-02)
 ######################################################################
 bond charge predicted
       N       E                     dE             d eps       ncg     rms          rms(c)
DAV:   1    -0.107655526096E+02   -0.59284E-02   -0.74386E-02   180   0.136E+00    0.224E-01
DAV:   2    -0.107643238086E+02    0.12288E-02   -0.94940E-04   198   0.176E-01    0.130E-01
DAV:   3    -0.107636706142E+02    0.65319E-03   -0.13320E-03   189   0.165E-01    0.583E-02
DAV:   4    -0.107636720816E+02   -0.14673E-05   -0.11244E-05   153   0.264E-02 
   3 F= -.10763672E+02 E0= -.10763850E+02  d E =-.903325E-02
 curvature:  -0.30 expect dE=-0.479E-06 dE for cont linesearch -0.479E-06
 trial: gam= 0.00000 g(F)=  0.329E-43 g(S)=  0.158E-05 ort =-0.219E-03 (trialstep = 0.920E+00)
 search vector abs. value=  0.158E-05
 reached required accuracy - stopping structural energy minimisation