Performance attribute | Our submission |
---|---|
Category of achievement | Time-to-solution, scalability |
Type of method used | Machine learning |
Results reported for | Whole application with and without I/O |
Precision reported | Mixed precision (FP16 and FP32) |
System scale | Measured on full-scale system (summit) |
Measurement mechanism | Internal timers, DeepSpeed FLOPS profiler |