Very slow performance of mtmd model on Android devices #18247
Unanswered
kimminsu38oo
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I tested the LLaVA 1.5 model on my mobile device (Galaxy S24 Ultra), and unfortunately the performance was extremely slow, to the point where it is barely usable.
https://github.com/ggml-org/llama.cpp/blob/master/docs/multimodal/llava.md#llava-15
Below are the profiling results from my test:
image decoding stage alone takes over 40 seconds..
I would like to ask:
Thank you very much for your work on llama.cpp, and I would really appreciate any insights or guidance on this issue.
Beta Was this translation helpful? Give feedback.
All reactions