Skip to content

Conversation

@stduhpf
Copy link
Contributor

@stduhpf stduhpf commented Dec 12, 2025

Fixes #984

@stduhpf
Copy link
Contributor Author

stduhpf commented Dec 12, 2025

@leejet It looks like ggml_ext_chunk() is a serious bottleneck. Probably because of the repeated use of ggml_permute() and ggml_cont(). Maybe rewriting it to avoid relying on permutations instead of merging this PR would be even better. Let me know if you'd prefer me giving it a shot.

@leejet
Copy link
Owner

leejet commented Dec 12, 2025

Perhaps #1080 is a simpler fix.

@stduhpf stduhpf closed this Dec 12, 2025
@wbruna wbruna mentioned this pull request Dec 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] Performance regression

2 participants