Skip to content

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #148

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #148