-
Notifications
You must be signed in to change notification settings - Fork 14
Open
Description
This issue tracks progress on running Bamba models with FSDP2.
Top level goals:
- FSDP2 + torch kernel impl
- FSDP2 +
mamba_ssmkernel impl - FSDP2 + TP
- FSDP2 + CP
- FSDP2 + FP8
- FSDP2 + PP
Known Issues/Open Questions
- Requires custom op registration for FSDP2 compatibility
DTensorcurrently only has minimal/experimental support for convolutions. Because of the depthwise convolution in the mamba layers, this is a blocker for TP/CP support. @garrett361 is working on more robust conv support.
Metadata
Metadata
Assignees
Labels
No labels