[python] Fix filter on data evolution table not working issue #7211

XiaoHongbo-Hope · 2026-02-04T10:06:54Z

Purpose/Problem

Filter not working on data evolution read: when a predicate is provided, all rows are returned.

Tests

API and Format

Documentation

JingsongLi · 2026-02-11T22:47:56Z

paimon-python/pypaimon/read/reader/filter_record_batch_reader.py

+        simple_null = self._filter_batch_simple_null(batch)
+        if simple_null is not None:
+            return simple_null
+        if not self.predicate.has_null_check():


We can just use Predicate.to_arrow, it is same to other table modes.

JingsongLi · 2026-02-11T22:48:06Z

paimon-python/pypaimon/read/reader/filter_record_batch_reader.py

+                    schema=result.schema,
+                )
+            except (TypeError, ValueError, pa.ArrowInvalid) as e:
+                logger.debug(


What exception here?

JingsongLi · 2026-02-11T22:49:35Z

paimon-python/pypaimon/read/scanner/file_scanner.py

            return False
        if self.partition_key_predicate and not self.partition_key_predicate.test(entry.partition):
            return False
        if self.deletion_vectors_enabled and entry.file.level == 0:  # do not read level 0 file


We can refactor this, this if should be in is_primary_key_table.

JingsongLi · 2026-02-11T22:50:34Z

paimon-python/pypaimon/read/scanner/file_scanner.py

            if not self.predicate:
                return True
-            if self.predicate_for_stats is None:
+            if self.predicate_for_stats is None or self.data_evolution:


Create a separate if. And we should add comments to this if, explain why there is no filtering done here.

JingsongLi · 2026-02-11T22:52:37Z

paimon-python/pypaimon/read/split_read.py

        super().__init__(table, predicate, read_type, actual_split, row_tracking_enabled)

+    def _push_down_predicate(self) -> Optional[Predicate]:
+        # Do not push predicate to file readers;


Detailed comments, why not push predicate.

JingsongLi · 2026-02-11T22:54:31Z

paimon-python/pypaimon/read/split_read.py

-        return ConcatBatchReader(suppliers)
+        merge_reader = ConcatBatchReader(suppliers)
+        if self.predicate is not None:
+            # Only apply filter when all predicate columns are in read_type (e.g. projected schema).


What we are returning here is complete row, right? So this check should be applicable to all table modes? That shouldn't be added here, it should be verified in SplitRead.init.

XiaoHongbo-Hope marked this pull request as ready for review February 4, 2026 10:12

XiaoHongbo-Hope marked this pull request as draft February 4, 2026 11:06

XiaoHongbo-Hope marked this pull request as ready for review February 8, 2026 07:54

XiaoHongbo-Hope marked this pull request as draft February 8, 2026 09:25

XiaoHongbo-Hope force-pushed the normal_filter_support branch 2 times, most recently from 4354588 to b8e709d Compare February 10, 2026 11:33

XiaoHongbo-Hope marked this pull request as ready for review February 10, 2026 14:12

XiaoHongbo-Hope force-pushed the normal_filter_support branch from c694ad0 to 867ca01 Compare February 11, 2026 04:11

fix filter not working issue on data evolution

653f23e

XiaoHongbo-Hope force-pushed the normal_filter_support branch from 867ca01 to 653f23e Compare February 11, 2026 04:12

fix merge issue during rebase

101be4e

JingsongLi reviewed Feb 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] Fix filter on data evolution table not working issue #7211

[python] Fix filter on data evolution table not working issue #7211

XiaoHongbo-Hope commented Feb 4, 2026 •

edited

Loading

Uh oh!

JingsongLi Feb 11, 2026

Uh oh!

JingsongLi Feb 11, 2026

Uh oh!

JingsongLi Feb 11, 2026

Uh oh!

JingsongLi Feb 11, 2026

Uh oh!

JingsongLi Feb 11, 2026

Uh oh!

JingsongLi Feb 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[python] Fix filter on data evolution table not working issue #7211

Are you sure you want to change the base?

[python] Fix filter on data evolution table not working issue #7211

Conversation

XiaoHongbo-Hope commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose/Problem

Tests

API and Format

Documentation

Uh oh!

JingsongLi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

XiaoHongbo-Hope commented Feb 4, 2026 •

edited

Loading

JingsongLi Feb 11, 2026 •

edited

Loading