V3: Fix invalid downcasting for nanos #2397

Fokko · 2025-08-28T09:44:26Z

Rationale for this change

It looks like we downcast Arrow nanosecond types always to microseconds.

Are these changes tested?

Are there any user-facing changes?

raulcd · 2025-08-28T13:07:04Z

tests/io/test_pyarrow.py

+
+@pytest.mark.parametrize("format_version", [1, 2, 3])
+def test_task_to_record_batches_nanos(format_version: TableVersion, tmpdir: str) -> None:
+    from datetime import datetime


this import seems unnecessary now.

Good one, I expected the linter to clean that up 🚀

rambleraptor

One small question, but looks good to me otherwise.

rambleraptor · 2025-08-28T15:28:10Z

pyiceberg/io/pyarrow.py

        self._bound_row_filter = bind(table_metadata.schema(), row_filter, case_sensitive=case_sensitive)
        self._case_sensitive = case_sensitive
        self._limit = limit
+        self._downcast_ns_timestamp_to_us = Config().get_bool(DOWNCAST_NS_TIMESTAMP_TO_US_ON_WRITE)


Do we need to check the format version for downcasting? (We have the table_metadata already, so we have access to it)

kevinjqliu

LGTM!

raulcd

Approving as my previous comment has been fixed

kevinjqliu · 2025-08-28T17:45:33Z

would be great to add an integration test showing spark writing V3 with nanos and reading with pyiceberg. We can do that as a follow up

kevinjqliu · 2025-08-28T17:46:01Z

Thanks for the PR @Fokko and thanks @rambleraptor and @raulcd for the review

V3: Fix invalid downcasting for nanos

e1c9c4b

raulcd reviewed Aug 28, 2025

View reviewed changes

rambleraptor approved these changes Aug 28, 2025

View reviewed changes

kevinjqliu approved these changes Aug 28, 2025

View reviewed changes

Thanks Raúl

c948b33

raulcd approved these changes Aug 28, 2025

View reviewed changes

kevinjqliu merged commit 52ff684 into apache:main Aug 28, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

V3: Fix invalid downcasting for nanos #2397

V3: Fix invalid downcasting for nanos #2397

Uh oh!

Fokko commented Aug 28, 2025 •

edited

Loading

Uh oh!

raulcd Aug 28, 2025

Uh oh!

Fokko Aug 28, 2025

Uh oh!

rambleraptor left a comment

Uh oh!

rambleraptor Aug 28, 2025

Uh oh!

kevinjqliu left a comment

Uh oh!

raulcd left a comment

Uh oh!

kevinjqliu commented Aug 28, 2025

Uh oh!

Uh oh!

kevinjqliu commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

V3: Fix invalid downcasting for nanos #2397

V3: Fix invalid downcasting for nanos #2397

Uh oh!

Conversation

Fokko commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

Uh oh!

raulcd Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Fokko Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

rambleraptor left a comment

Choose a reason for hiding this comment

Uh oh!

rambleraptor Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

kevinjqliu left a comment

Choose a reason for hiding this comment

Uh oh!

raulcd left a comment

Choose a reason for hiding this comment

Uh oh!

kevinjqliu commented Aug 28, 2025

Uh oh!

Uh oh!

kevinjqliu commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fokko commented Aug 28, 2025 •

edited

Loading