Open
Conversation
…beddedllm into szeyu-benchmark-2
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Benchmark
Allow users to test on themselves to get the benchmark of model(s) on different backend. It will analyse the Token In / Out throughput for you in a statistical manner
Benchmark a Model
To benchmark a model, run this
cpu|ipex|openvino|directmlName of the ModelPath to Model|Model Repo IDNumber of Input Tokens (Max 2048)Number of Output TokensLoop to benchmark the models
Customise your benchmarking config
Generate a Report (
XLSX) of a Model's BenchmarkTo Generate report for a model, run this
Name of the ModelGenerate Reports (
XLSX) of Models' BenchmarkList out the models that you want to have report of benchmarking