Skip to content

[Inference] use fp8 cuda core gemm kernel when M<=4 #15778

[Inference] use fp8 cuda core gemm kernel when M<=4

[Inference] use fp8 cuda core gemm kernel when M<=4 #15778

Annotations

1 warning

Lint

succeeded Nov 26, 2024 in 2m 32s