Skip to content

Conversation

@geruh
Copy link
Contributor

@geruh geruh commented Nov 21, 2025

Closes #2776

Rationale for this change

This PR removes the unused start and length fields from the FileScanTask. While working on the models for rest scanning in #2775 we noticed that these fields were initialized but never accessed or used anywhere in the code base.

Are these changes tested?

> make test
...
3023 passed, 1392 deselected in 30.39s

Are there any user-facing changes?

no

@geruh geruh changed the title fix: Remove unnecessary fields from FileScanTask fix: Remove unused fields from FileScanTask Nov 21, 2025
Copy link
Contributor

@kevinjqliu kevinjqliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM these aren't used at all

@kevinjqliu
Copy link
Contributor

cc @Fokko for another pair of eyes 😄

Copy link
Contributor

@Fokko Fokko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This changes the public API, but I think we can remove this.

The idea was to scale on a row-group level, rather than a file level. But for PyIceberg, I don't think that makes a lot of sense. I'm fine with simplifying this 👍

@Fokko Fokko merged commit 59dc8d1 into apache:main Nov 21, 2025
8 checks passed
@Fokko
Copy link
Contributor

Fokko commented Nov 21, 2025

Thanks @geruh for picking this up, and thanks @kevinjqliu for working on this 🙌

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

refactor FileScanTask, remove start and length

3 participants