Skip to content

Conversation

@yingjianwu98
Copy link
Contributor

@yingjianwu98 yingjianwu98 commented Apr 21, 2025

Closes #1779

Rationale for this change

See issue

Are these changes tested?

Tested locally

Are there any user-facing changes?

No


def set_partition_summary_limit(self, limit: int) -> None:
self.max_changed_partitions_for_summaries = limit
self.max_changed_partitions_for_summaries = partition_summary_limit
Copy link
Contributor

@Fokko Fokko Apr 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's also keep the method below so people that actually use this, won't get any errors:

Suggested change
self.max_changed_partitions_for_summaries = partition_summary_limit
self.max_changed_partitions_for_summaries = partition_summary_limit
def set_partition_summary_limit(self, limit: int) -> None:
self.max_changed_partitions_for_summaries = limit

@Fokko
Copy link
Contributor

Fokko commented Apr 22, 2025

Thanks @stevie9868 for working on this 🙌 I left one comment 👍

@yingjianwu98
Copy link
Contributor Author

yingjianwu98 commented Apr 23, 2025

Thanks @Fokko !
I agree and have updated based on the comment.

Copy link
Contributor

@kevinjqliu kevinjqliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

self.max_changed_partitions_for_summaries = 0
self.max_changed_partitions_for_summaries = partition_summary_limit

def set_partition_summary_limit(self, limit: int) -> None:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe we can deprecate this in favor of the init function

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah, I was thinking about this.
And I think @Fokko makes a good point that people might be using this function

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Deprecation means that we first signal the user, see: https://py.iceberg.apache.org/contributing/#api-compatibility, and in the next release, we can remove it. I don't have strong feelings around removing this one.

@yingjianwu98 yingjianwu98 force-pushed the yingjianw/addPartitionSummaryLimitIntoSCCInit branch from 2542c5c to e321bc4 Compare April 26, 2025 14:56
@yingjianwu98
Copy link
Contributor Author

I have rebased and fixed the test, thanks!

@kevinjqliu kevinjqliu merged commit e8bdc87 into apache:main Apr 28, 2025
7 checks passed
@kevinjqliu
Copy link
Contributor

Thanks @stevie9868 for the PR and @Fokko for the review :)

gabeiglio pushed a commit to Netflix/iceberg-python that referenced this pull request Aug 13, 2025
apache#1940)

<!--
Thanks for opening a pull request!
-->

<!-- In the case this PR will resolve an issue, please replace
${GITHUB_ISSUE_ID} below with the actual Github issue id. -->
Closes apache#1779

# Rationale for this change
See
[issue](apache#1779 (comment))
# Are these changes tested?
Tested locally
# Are there any user-facing changes?
No

<!-- In the case of user-facing changes, please add the changelog label.
-->

---------

Co-authored-by: Yingjian Wu <yingjianw@netflix.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Refactor setting the max_changed_partitions_for_summaries

3 participants