[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #141
build-and-run.yml
on: pull_request
code-quality
14s
Matrix: build-and-run