-
Notifications
You must be signed in to change notification settings - Fork 6.7k
Resnet only use contiguous in training mode. #12977
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
sayakpaul
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
The failing test passes on But with this PR, that fails. Could you please check? |
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
|
The behaviour change has little impact on some models' precision; just changing the tolerance from |
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
|
But we need the tests to pass on our CI. |
|
I will check these failed tests. |
|
Hi @sayakpaul . The failed test seems network issue. Please check it. Thanks! |
|
Hi @sayakpaul . Please retrigger the CI. Thanks! |
|
It's so weird. The CI already passed in commit aa65c5c, but failed after update branch. @sayakpaul . Do you have any clue about it? |
Hi @sayakpaul . Would you please re-run the CI to see if the error disappears? Thanks! |
|
I see the lora tests also failed in #13003 |
| @require_hf_hub_version_greater("0.26.5") | ||
| @require_transformers_version_greater("4.47.1") | ||
| def test_save_load_dduf(self, atol=1e-4, rtol=1e-4): | ||
| def test_save_load_dduf(self, atol=1e-3, rtol=1e-3): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We shouldn't change it at the main test level. If a model test failing, we should override the method at the corresponding test class and relax tolerance.


As the contiguous may have a negative impact on CPU performance, ResNet can only use it in training mode.
Fix #12975