[Feature Request] Support for NPU

I am encountering a compatibility issue when running this project on NPU hardware (Huawei Ascend). The project currently depends on torch_memory_saver, which seems to be heavily optimized for CUDA memory management and fragmentation reduction.

Context

- Dependency: torch_memory_saver (likely introduced via sglang or vllm).
- Hardware: Huawei Ascend NPU (CANN).
- Current Behavior: torch_memory_saver specifically hooks into CUDA-related memory APIs (cudaMalloc, etc.). When running on NPU, it either fails to initializ

Question
Does the roadmap include support for non-CUDA backends like NPU?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Support for NPU #27

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Support for NPU #27

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions