Skip to content

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #137

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup

[FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup #137

Triggered via pull request December 4, 2025 18:21
Status Failure
Total duration 5m 21s
Artifacts

build-and-run.yml

on: pull_request
code-quality
13s
code-quality
Matrix: build-and-run
Fit to window
Zoom out
Zoom in

Annotations

2 errors
build-and-run (ptx)
Process completed with exit code 1.
build-and-run (opencl)
The strategy configuration was canceled because "build-and-run.ptx" failed