-
Notifications
You must be signed in to change notification settings - Fork 37
Closed
Description
I am encountering a compatibility issue when running this project on NPU hardware (Huawei Ascend). The project currently depends on torch_memory_saver, which seems to be heavily optimized for CUDA memory management and fragmentation reduction.
Context
- Dependency: torch_memory_saver (likely introduced via sglang or vllm).
- Hardware: Huawei Ascend NPU (CANN).
- Current Behavior: torch_memory_saver specifically hooks into CUDA-related memory APIs (cudaMalloc, etc.). When running on NPU, it either fails to initializ
Question
Does the roadmap include support for non-CUDA backends like NPU?
Metadata
Metadata
Assignees
Labels
No labels