(reference-only) Multi backend refactor -> main (full diff of all already merged PRs)#1220
(reference-only) Multi backend refactor -> main (full diff of all already merged PRs)#1220Titus-von-Koeller wants to merge 283 commits intomainfrom
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
* fix xpu dtypoe Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix nf4 dtype Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix version Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix setup version Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable benchmark script Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * Small fixes to non_cuda_backends.mdx --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Titus <9048635+Titus-von-Koeller@users.noreply.github.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable quant storage Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix to numpy Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix 4bit XPU dequant 4bit Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix default value Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix ipex linear set Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix ipex linear set to false when calling state dict Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix Int8Param device patch Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix xpu to cpu Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix xpu cpu data device Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix intel cpu/xpu warning Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix error log Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix lib Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * rm return Nonr Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * error log only without ipex Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix import eerror Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
|
Could someone post about the status/progress of this PR? Like a list of checked and unchecked known items to do. |
* enable xpu 8bit optim * add deqaunt_blockwise * dequantize_blockwise * add bakcend synchronize * refine code * ipex dep * ipex dep too * ipex version check --------- Co-authored-by: jiqing-feng <jiqing.feng@intel.com>
Authored by: Chetan Kumar Verma <chetan.kumar.verma@intel.com> Co-authored-by: Ruheena Suhani Shaik <ruheena.suhani.shaik@intel.com> Co-authored-by: Bhargav Eede <bhargav.eede@intel.com> Co-authored-by: Vivek Goel <vivek.goel@intel.com> Co-authored-by: Ruheena Suhani Shaik <rsshaik@habana.ai>
|
Please see this short update about the multi-backend refactor #1596. cc @anadon |
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
|
We're closing this PR, because the functionality contained therein has been merged to In the mean-time the already merged code can be used by downloading the wheel from CI: |
This PR to
mainserves the purpose to keep an overview of all the extensive changes that have been introduced tomulti-backend-refactorto the iterative PRs around this topic.This will not be merged into master and instead the changes will be ported to the new custom_ops API that's already merged to main. Future backend PRs should be addressed directly at
main.