[WIP] Reduce the number of fa rows for Intel #18138

mmerecki · 2025-12-17T12:43:25Z

Reduce the number of fa rows for Intel to reduce registers usage.

jeffbolznv · 2025-12-17T15:14:04Z

Should this depend on head size? Some models have small head sizes like 64 or even 40, 2 rows seems pretty small for that. But if 2 is best, I don't object.

mmerecki · 2025-12-18T15:05:41Z

Thanks Jeff. I will verify this change with more models and potentially update the value for small head sizes.
I will also add information about the test results before I make the PR ready for review.

Reduce the number of fa rows for Intel

4e35585

loci-dev mentioned this pull request Dec 17, 2025

UPSTREAM PR #18138: Reduce the number of fa rows for Intel auroralabs-loci/llama.cpp#606

Open

mmerecki changed the title ~~Reduce the number of fa rows for Intel~~ [WIP] Reduce the number of fa rows for Intel Dec 17, 2025

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Dec 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Reduce the number of fa rows for Intel #18138

[WIP] Reduce the number of fa rows for Intel #18138

mmerecki commented Dec 17, 2025

Uh oh!

jeffbolznv commented Dec 17, 2025

Uh oh!

mmerecki commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Reduce the number of fa rows for Intel #18138

Are you sure you want to change the base?

[WIP] Reduce the number of fa rows for Intel #18138

Conversation

mmerecki commented Dec 17, 2025

Uh oh!

jeffbolznv commented Dec 17, 2025

Uh oh!

mmerecki commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants