[peculator training] Update benchmark_speculator_logical.py to support gpt_bigcode/granite #62

sahilsuneja1 · 2024-03-22T21:53:12Z

@daviswer: should we also add the caller script?

daviswer · 2024-03-23T21:27:23Z

Thanks @sahilsuneja1 for putting this together! I don't think we need the caller script - it's ultimately just a simple python call with a bunch of arguments, right? I'd just add a comment with a sample call or two (Llama 7b / granite 20b?) to the top of the benchmark_speculator_logical.py file

daviswer · 2024-03-25T20:55:43Z

speculator/benchmark_speculator_logical.py

+# This example script measures the logical speedup of running a speculator atop a base model. Run as:
+# export CUDA_VISIBLE_DEVICES=1
+# e.g., #1: torchrun --nproc_per_node=1 benchmark_speculator_logical.py --architecture=paged_llama --variant=7b --model_path=~/models/7B-F --tokenizer=~/models/tokenizer.model --model_source=hf --speculator_path=~/models/speculator_7B_F.pth --compile
+# e.g., #2: torchrun --nproc_per_node=1 benchmark_speculator_logical.py --architecture=paged_gpt_bigcode --variant=ibm.20b --model_path=~/models/granite-20b-instruct --tokenizer=~/models/granite-20b-instruct --model_source=hf --speculator_path=~/models/speculator_granite20B.pth --n_predict=4 --threshes=[6,4,3,3]


Needs a --data_path and --subdata!

Thanks, fixed with fake values

daviswer

Looks good!
(Black formatter is requesting changes though)

sahilsuneja1 · 2024-03-27T12:54:57Z

Updated formatting

daviswer · 2024-03-27T16:34:50Z

Further CI complaints. I'll try and figure out how to get them running automatically for you rather than having to wait for my explicit go-ahead

sahilsuneja1 · 2024-03-27T17:59:41Z

Fixed isort issues

sahilsuneja1 · 2024-03-27T20:10:10Z

Dunno how to fix the mypy errors

nairbv · 2024-03-27T20:42:22Z

There are some suggestions here on how to fix some of these, possibly via a fully qualified import?
https://stackoverflow.com/questions/68695851/mypy-cannot-find-implementation-or-library-stub-for-module

@afrittoli may also have suggestions.

pasting the mypy errors here, for easier reference:

speculator/benchmark_speculator_logical.py:7: error: Cannot find implementation or library stub for module named "fms_extras.models.paged_gpt_bigcode"  [import-not-found]
speculator/benchmark_speculator_logical.py:7: error: Cannot find implementation or library stub for module named "fms_extras.models"  [import-not-found]
speculator/benchmark_speculator_logical.py:7: error: Cannot find implementation or library stub for module named "fms_extras"  [import-not-found]
speculator/benchmark_speculator_logical.py:8: error: Cannot find implementation or library stub for module named "fms_extras.models.paged_llama"  [import-not-found]
speculator/benchmark_speculator_logical.py:13: error: Cannot find implementation or library stub for module named "fms_extras.models.speculator"  [import-not-found]
speculator/benchmark_speculator_logical.py:14: error: Cannot find implementation or library stub for module named "fms_extras.utils.generation"  [import-not-found]
speculator/benchmark_speculator_logical.py:16: error: Library stubs not installed for "tqdm"  [import-untyped]
speculator/benchmark_speculator_logical.py:173: error: Incompatible types in assignment (expression has type "None", variable has type "str")  [assignment]
speculator/benchmark_speculator_logical.py:208: error: Cannot find implementation or library stub for module named "fms_extras.utils.cache.paged"  [import-not-found]
speculator/benchmark_speculator_logical.py:243: error: Need type annotation for "data" (hint: "data: List[<type>] = ...")  [var-annotated]
speculator/benchmark_speculator_logical.py:247: error: No overload variant of "next" matches argument type "Streaming_Doc_Dataset"  [call-overload]
speculator/benchmark_speculator_logical.py:247: note: Possible overload variants:
speculator/benchmark_speculator_logical.py:247: note:     def [_T] next(SupportsNext[_T], /) -> _T
speculator/benchmark_speculator_logical.py:247: note:     def [_T, _VT] next(SupportsNext[_T], _VT, /) -> _T | _VT
speculator/benchmark_speculator_logical.py:254: error: Incompatible types in assignment (expression has type "Tensor", variable has type "list[Any]")  [assignment]
speculator/benchmark_speculator_logical.py:340: error: Incompatible types in assignment (expression has type "Any | None", variable has type "Callable[..., Any]")  [assignment]
speculator/benchmark_speculator_logical.py:344: error: "list[Any]" has no attribute "size"  [attr-defined]
speculator/benchmark_speculator_logical.py:347: error: No overload variant of "__getitem__" of "list" matches argument type "tuple[slice, slice]"  [call-overload]
speculator/benchmark_speculator_logical.py:347: note: Possible overload variants:
speculator/benchmark_speculator_logical.py:347: note:     def __getitem__(self, SupportsIndex, /) -> Any
speculator/benchmark_speculator_logical.py:347: note:     def __getitem__(self, slice, /) -> list[Any]
speculator/benchmark_speculator.py:6: error: Cannot find implementation or library stub for module named "fms_extras.models.paged_llama"  [import-not-found]
speculator/benchmark_speculator.py:6: error: Cannot find implementation or library stub for module named "fms_extras.models"  [import-not-found]
speculator/benchmark_speculator.py:6: error: Cannot find implementation or library stub for module named "fms_extras"  [import-not-found]
speculator/benchmark_speculator.py:11: error: Cannot find implementation or library stub for module named "fms_extras.models.speculator"  [import-not-found]
speculator/benchmark_speculator.py:12: error: Cannot find implementation or library stub for module named "fms_extras.utils.generation"  [import-not-found]
speculator/benchmark_speculator.py:14: error: Library stubs not installed for "tqdm"  [import-untyped]
speculator/benchmark_speculator.py:14: note: Hint: "python3 -m pip install types-tqdm"
speculator/benchmark_speculator.py:14: note: (or run "mypy --install-types" to install all missing stub packages)
speculator/benchmark_speculator.py:[132](https://github.com/foundation-model-stack/fms-fsdp/actions/runs/8456415698/job/23168426272?pr=62#step:6:133): error: Incompatible types in assignment (expression has type "None", variable has type "str")  [assignment]
speculator/benchmark_speculator.py:164: error: Cannot find implementation or library stub for module named "fms_extras.utils.cache.paged"  [import-not-found]
speculator/benchmark_speculator.py:164: note: See https://mypy.readthedocs.io/en/stable/running_mypy.html#missing-imports
speculator/benchmark_speculator.py:193: error: Need type annotation for "data" (hint: "data: List[<type>] = ...")  [var-annotated]
speculator/benchmark_speculator.py:197: error: No overload variant of "next" matches argument type "Streaming_Doc_Dataset"  [call-overload]
speculator/benchmark_speculator.py:197: note: Possible overload variants:
speculator/benchmark_speculator.py:197: note:     def [_T] next(SupportsNext[_T], /) -> _T
speculator/benchmark_speculator.py:197: note:     def [_T, _VT] next(SupportsNext[_T], _VT, /) -> _T | _VT
speculator/benchmark_speculator.py:204: error: Incompatible types in assignment (expression has type "Tensor", variable has type "list[Any]")  [assignment]
speculator/benchmark_speculator.py:281: error: Incompatible types in assignment (expression has type "Any | None", variable has type "Callable[..., Any]")  [assignment]
speculator/benchmark_speculator.py:285: error: "list[Any]" has no attribute "size"  [attr-defined]
speculator/train_speculator.py:4: error: Skipping analyzing "fire": module is installed, but missing library stubs or py.typed marker  [import-untyped]
speculator/train_speculator.py:9: error: Cannot find implementation or library stub for module named "fms_extras.models.speculator"  [import-not-found]
speculator/train_speculator.py:14: error: Cannot find implementation or library stub for module named "train_speculator_utils"  [import-not-found]
Found 31 errors in 3 files (checked 18 source files)

daviswer · 2024-03-27T22:05:34Z

Looks like a bunch of that is because this is relying on the paged attention branch, which hasn't fully landed in fms-extras/main yet

daviswer · 2024-03-29T19:00:52Z

Actually it looks like we don't have any dependency on fms-extras listed in requirements - I'm now hitting the same issue with #35. If we can add that then this should go through. @lchu-ibm or @nairbv , do you know the right way to get fms-extras into requirements.txt?

nairbv · 2024-03-29T19:54:38Z

Actually it looks like we don't have any dependency on fms-extras listed in requirements - I'm now hitting the same issue with #35. If we can add that then this should go through. @lchu-ibm or @nairbv , do you know the right way to get fms-extras into requirements.txt?

we can either point directly at the github URL within requirements.txt (see some examples here: https://stackoverflow.com/questions/16584552/how-to-state-in-requirements-txt-a-direct-github-source) or we could publish fms-extras to pypi. probably the first option is simpler/easier

sahilsuneja1 added 2 commits March 22, 2024 17:52

Update benchmark_speculator_logical.py to support gpt_bigcode/granite

7039bb5

Update benchmark_speculator_logical.py

9dce65d

lchu6 requested a review from daviswer March 23, 2024 04:08

sahilsuneja1 added 2 commits March 25, 2024 12:57

Update benchmark_speculator_logical.py

18ecee8

Update benchmark_speculator_logical.py

268f8c8

daviswer reviewed Mar 25, 2024

View reviewed changes

sahilsuneja1 added 2 commits March 25, 2024 18:22

Update benchmark_speculator_logical.py

f0c413f

Update benchmark_speculator_logical.py

a0798ad

daviswer approved these changes Mar 25, 2024

View reviewed changes

sahilsuneja1 added 3 commits March 27, 2024 08:50

Update benchmark_speculator_logical.py

ef98364

Update benchmark_speculator_logical.py

85f2bf0

Update benchmark_speculator.py

eba5053

Update benchmark_speculator_logical.py

1d5858b

lchu6 changed the title ~~Update benchmark_speculator_logical.py to support gpt_bigcode/granite~~ [peculator training] Update benchmark_speculator_logical.py to support gpt_bigcode/granite Mar 28, 2024

lchu6 added the speculator training label Mar 28, 2024

[peculator training] Update benchmark_speculator_logical.py to support gpt_bigcode/granite #62

Are you sure you want to change the base?

[peculator training] Update benchmark_speculator_logical.py to support gpt_bigcode/granite #62

Uh oh!

Conversation

sahilsuneja1 commented Mar 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daviswer commented Mar 23, 2024

Uh oh!

daviswer Mar 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sahilsuneja1 Mar 25, 2024

Choose a reason for hiding this comment

Uh oh!

daviswer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sahilsuneja1 commented Mar 27, 2024

Uh oh!

daviswer commented Mar 27, 2024

Uh oh!

sahilsuneja1 commented Mar 27, 2024

Uh oh!

sahilsuneja1 commented Mar 27, 2024

Uh oh!

nairbv commented Mar 27, 2024

Uh oh!

daviswer commented Mar 27, 2024

Uh oh!

daviswer commented Mar 29, 2024

Uh oh!

nairbv commented Mar 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sahilsuneja1 commented Mar 22, 2024 •

edited

Loading

daviswer Mar 25, 2024 •

edited

Loading

daviswer left a comment •

edited

Loading

nairbv commented Mar 29, 2024 •

edited

Loading