return to table of content
Fp8 runs ~100 tflops faster when the kernel name has "cutlass" in it
166 comments