GPU-friendly truncation implementations #349

lkdvos · 2026-01-08T14:41:19Z

This is an attempt to get rid of the scalar-indexing oriented approach, and instead do more global operations.
Definitely still WIP, and on CPU there are definitely various optimizations that can be applied if needed.
I do wonder about the performance a bit, as I would actually expect that for a large number of sectors this might just be faster.

Some possible optimizations:

for UniqueFusion, finding the nth value is simply partialsortperm(values, n; by, rev), avoiding the need to allocate the full permutation vector
for CPU, cumsum + findlast can be replaced by a loop to avoid some intermediate allocations

codecov · 2026-01-08T16:34:15Z

Codecov Report

❌ Patch coverage is 0% with 19 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/factorizations/truncation.jl	0.00%	19 Missing ⚠️

Files with missing lines	Coverage Δ
src/factorizations/truncation.jl	`17.04% <0.00%> (-70.32%)`	⬇️

... and 29 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Jutho · 2026-01-12T10:16:29Z

I think that implementation is very clean. Not sure I can explain all the errors. Some seem to originate from the eigenvectors of a DiagonalTensorMap also being diagonal, which is being changed, right? But other errors I cannot directly explain without running the code locally.

lkdvos · 2026-01-12T11:11:19Z

I will have a look later, I have a local branch to fix the diagonal implementations already, we'll see what remains after.
I'll also try if it actually works on GPU, and get a similar implementation going for the truncerror version.

kshyatt force-pushed the ld-truncation branch from a9bb7f6 to 228fdcf Compare January 8, 2026 19:51

try to make truncation GPU-friendly

dd38bfb

lkdvos force-pushed the ld-truncation branch from 228fdcf to dd38bfb Compare January 10, 2026 12:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU-friendly truncation implementations #349

GPU-friendly truncation implementations #349

lkdvos commented Jan 8, 2026

Uh oh!

codecov bot commented Jan 8, 2026 •

edited

Loading

Uh oh!

Jutho commented Jan 12, 2026

Uh oh!

lkdvos commented Jan 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GPU-friendly truncation implementations #349

Are you sure you want to change the base?

GPU-friendly truncation implementations #349

Conversation

lkdvos commented Jan 8, 2026

Uh oh!

codecov bot commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Jutho commented Jan 12, 2026

Uh oh!

lkdvos commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Jan 8, 2026 •

edited

Loading

lkdvos commented Jan 12, 2026 •

edited

Loading