Skip to content

Conversation

@LucaMarconato
Copy link
Member

@LucaMarconato LucaMarconato commented Feb 3, 2026

Minor change

The latest dask distributed cannot be used with the second latest dask (the latest dask is not supported because it's not supported in ome-zarr-py). This PR adds an upper bound.

Main change

Improves performance of points parsing by eliminating superfluous conversions to categorical for columns that are not the feature_key column. Also warns the user that converting a column to categorical prior to parsing (by preserving the known status of the categories), improves performance.

Tested with Xenium data (performance of point parsing improves ~20x for ~100 million transcripts): .

@LucaMarconato LucaMarconato changed the title Improve handling of categoricals for feature_key in points Pins distributed; better categorical handing for points parser Feb 3, 2026
@codecov
Copy link

codecov bot commented Feb 3, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.24%. Comparing base (db34783) to head (9b60a23).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1061      +/-   ##
==========================================
+ Coverage   92.21%   92.24%   +0.02%     
==========================================
  Files          49       49              
  Lines        7593     7594       +1     
==========================================
+ Hits         7002     7005       +3     
+ Misses        591      589       -2     
Files with missing lines Coverage Δ
src/spatialdata/models/models.py 88.99% <100.00%> (+0.38%) ⬆️
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@LucaMarconato LucaMarconato merged commit 3cdf3d8 into main Feb 3, 2026
9 checks passed
@LucaMarconato LucaMarconato deleted the performance/points-parsing branch February 3, 2026 17:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants