Skip to content

Conversation

@linamy85
Copy link
Collaborator

@linamy85 linamy85 commented Feb 4, 2026

The new utility will use jnp.finfo and jnp.iinfo to determine the accurate bit width of any dtype, ensuring correct bandwidth metrics for current and future sub-byte types (like int4 or float4).

The new utility will use jnp.finfo and jnp.iinfo to determine the accurate bit
width of any dtype, ensuring correct bandwidth metrics for current and future
sub-byte types (like int4 or float4).
@chishuen
Copy link
Collaborator

chishuen commented Feb 4, 2026

Did you test your change? Can you share the new test collective test results for FP4? Otherwise, LGTM.

@linamy85
Copy link
Collaborator Author

linamy85 commented Feb 5, 2026

Did you test your change? Can you share the new test collective test results for FP4? Otherwise, LGTM.

Yes, the result for all gather on 2x2x2 is as followed:

dtype dtype_bytes achieved_bw (GB/s)_p50
fp4 0.5 183.5038978
fp8 1 184.8115224
bf16 2 185.708862
f16 2 185.7897159
f32 4 186.1313207

Copy link
Collaborator

@hylin2002 hylin2002 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Collaborator

@chishuen chishuen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@linamy85 linamy85 merged commit 7bfaa74 into AI-Hypercomputer:main Feb 5, 2026
2 checks passed
linamy85 added a commit that referenced this pull request Feb 5, 2026
The new utility will use jnp.finfo and jnp.iinfo to determine the accurate bit
width of any dtype, ensuring correct bandwidth metrics for current and future
sub-byte types (like int4 or float4).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants