Commit 2427825
Update lm-eval set-up to address regression (#2142)
SUMMARY:
- Seems like setting the collator from tuncation to default + shuffling
addresses the regression we're seeing in lm-eval
- Given the recovery values you see in these tests were determined using
these settings, I think they should be how we evaluate our lm-eval tests
for the time being
---------
Signed-off-by: Dipika Sikka <ds3822@columbia.edu>
Co-authored-by: Brian Dellabetta <brian-dellabetta@users.noreply.github.com>1 parent 5f6c8db commit 2427825
1 file changed
+7
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
1 | 3 | | |
2 | 4 | | |
3 | 5 | | |
4 | 6 | | |
5 | | - | |
| 7 | + | |
6 | 8 | | |
7 | 9 | | |
8 | 10 | | |
| |||
34 | 36 | | |
35 | 37 | | |
36 | 38 | | |
| 39 | + | |
| 40 | + | |
37 | 41 | | |
38 | 42 | | |
39 | 43 | | |
| 44 | + | |
40 | 45 | | |
41 | 46 | | |
42 | 47 | | |
| |||
74 | 79 | | |
75 | 80 | | |
76 | 81 | | |
| 82 | + | |
77 | 83 | | |
78 | 84 | | |
79 | 85 | | |
| |||
0 commit comments