Skip to content

[Inference] use fp8 cuda core gemm kernel when M<=4 #15713

[Inference] use fp8 cuda core gemm kernel when M<=4

[Inference] use fp8 cuda core gemm kernel when M<=4 #15713

Annotations

1 warning

Test

succeeded Nov 26, 2024 in 31m 3s