6. HPL-MXP rules and validation

The HPL Mixed Precision (HPL-MXP) benchmark is described at https://hpl-mxp.org/.

Different vendors provide optimized implementation of the HPL-MXP benchmarks and teams are encouraged to use these vendor-optimized implementations to get the maximum performance on their clusters.

Nvidia HPL-MXP implementation: https://docs.nvidia.com/nvidia-hpc-benchmarks/HPL_MxP_benchmark.html

AMD HPL-MXP implementation: https://www.amd.com/en/developer/resources/infinity-hub/hpl-mxp.html

An example of the HPL-MXP result output is shown below:

 ****** HPL MxP Result    ******

   EPS           . . . . . . . . . . . . . . . . .          =    2.000000E-16
   Threshold     . . . . . . . . . . . . . . . . .          =    1.600000E+01
 ||Ax-b||_oo     . . . . . . . . . . . . . . . . .          =    8.737455E-14
 ||A   ||_oo     . . . . . . . . . . . . . . . . .          =    2.594294E+05
 ||x   ||_oo     . . . . . . . . . . . . . . . . .          =    7.799867E-06
 ||b   ||_oo     . . . . . . . . . . . . . . . . .          =    9.999990E-01
 ||Ax-b||_oo / (EPS * (||A||_oo * ||x||_oo + ||b||_oo) * N) =    5.599413E-04 ...... PASSED

   N = 258048, NB = 1024, NPROW = 4, NPCOL = 2, SLOPPY-TYPE = 2
      GFLOPS = 4.3468e+05, per GPU =   54335.58 ------ The HPL-MxP performance to report
   LU GFLOPS = 2.4940e+06, per GPU =  311748.98 ------ The performance excluding the iterative solver part

****** HPL MxP Result    ******

The residual that is used to assess the validity of the run is given on this line:

||Ax-b||_oo / (EPS * (||A||_oo * ||x||_oo + ||b||_oo) * N) =    5.599413E-04 ...... PASSED

Files to submit

  • HPL-MXP output saved to a file cert-{N}-mxphpl.rslts

  • HPL-MXP input parameters saved to a file cert-{N}-mxphpl.input

  • The script (or commands) used to run HPL-MXP saved to a file cert-{N}-mxphpl.run

  • HPL-MXP run timestamps saved to a file cert-{N}-mxphpl.tstamps