Published: July 9, 2025
25
30
296
> fp8 is 100 tflops faster when the kernel name has "cutlass" in it kms https://github.com/triton-lang...
anon, you mean to tell me you're not autotuning the kernel name?
> fp8 is 100 tflops faster when the kernel name has "cutlass" in it kms https://github.com/triton-lang...
anon, you mean to tell me you're not autotuning the kernel name?