refactor: ExecuteResult is reusable, sampleable #2159

TrevorBergeron · 2025-10-09T20:37:09Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

tswast

At a high level, I think this makes a lot of sense. My main worry would be the case of ExecuteResult with local data. If we are keeping these around more, is there a risk of memory leaks?

tswast · 2025-10-22T16:27:58Z

bigframes/core/blocks.py

        dfs = iter(map(self._copy_index_to_pandas, dfs))

-        total_rows = execute_result.total_rows
+        total_rows = result_batches.approx_total_rows


As we discussed offline, I think it'd be helpful to have cases where we know (or at least have very high confidence) of this being exact, such as when reading the destination table of a query.

We do insert row count directly in the tree when known exactly (or at least some of the time we know exactly, might be missing some opportunities). At a certain point we just intermingle known and approx though, before surfacing. Could keep separate if really needed, but a bit more complexity.

TrevorBergeron · 2025-10-28T00:19:13Z

At a high level, I think this makes a lot of sense. My main worry would be the case of ExecuteResult with local data. If we are keeping these around more, is there a risk of memory leaks?

Yeah, this refactor is intended to help with caching, but doesn't actually add any caching behavior for now. So, these data handles will be garbage collected just fine.

tswast · 2025-10-31T16:24:17Z

bigframes/core/compile/ibis_compiler/ibis_compiler.py

    for scan_item in node.scan_list.items:
        if (
-            scan_item.dtype == dtypes.JSON_DTYPE
+            node.source.schema.get_type(scan_item.source_id) == dtypes.JSON_DTYPE


I'm curious what the purpose of this extra layer of indirection is compared to the previous? I guess it's partially undoing some of the changes Shenyang and Chelsea made to plumb type info into the tree?

its mostly from moving the schema to the source definition rather than the scan definition, which brings it more in line with the local source, where a source object is not just data, but interpretation (mostly matters for virtual types)

tswast · 2025-10-31T16:28:40Z

e2e failures are "Blob" related. Probably flakes.

refactor: ExecuteResult is reusable, sampleable

ca51638

product-auto-label bot added size: l Pull request size is large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. samples Issues that are directly related to samples. labels Oct 9, 2025

TrevorBergeron added 16 commits October 10, 2025 21:08

refactor data source classes

6d16001

refactor result size statistics

5afbb2d

refactor materialization type normalize steps

703fd7e

fix read session config

a93808a

push data normalization into managed table init

bea592c

fix broken tests

b9b8692

fix read api invocation

6e53541

add validation to bq source

71fa69d

add debug validation

cf3b7e3

fix result schema extra column handling

bdb1476

fix results bytes estimate

bf8c827

fix name mappings for remote sources

1f85658

refactor query metadata

628363b

fix execution metadata

b890d4b

Merge remote-tracking branch 'github/main' into new_execute_result

9c04587

avoid read api where result small

eb5cb76

product-auto-label bot added size: xl Pull request size is extra large. and removed size: l Pull request size is large. labels Oct 16, 2025

TrevorBergeron requested a review from tswast October 16, 2025 17:45

TrevorBergeron marked this pull request as ready for review October 16, 2025 17:45

TrevorBergeron requested review from a team as code owners October 16, 2025 17:45

blunderbuss-gcf bot assigned drylks-work Oct 16, 2025

tswast reviewed Oct 22, 2025

View reviewed changes

Merge remote-tracking branch 'github/main' into new_execute_result

b5e35fa

TrevorBergeron added 3 commits October 28, 2025 17:30

fix mock execute result in test

91157fb

Merge remote-tracking branch 'github/main' into new_execute_result

5e5892f

fix cluster col limit appliction

c316830

TrevorBergeron requested a review from tswast October 29, 2025 00:13

TrevorBergeron added 2 commits October 30, 2025 18:28

fix schema loacing

46334da

Merge remote-tracking branch 'github/main' into new_execute_result

1cdf793

TrevorBergeron force-pushed the new_execute_result branch from 4032364 to 1cdf793 Compare October 30, 2025 18:30

TrevorBergeron added 3 commits October 30, 2025 18:35

fix unit test

4acf103

read parallel consumer fixes

235e5e5

fix extension type issue

52bce76

tswast approved these changes Oct 31, 2025

View reviewed changes

TrevorBergeron merged commit b23cf83 into main Oct 31, 2025
24 of 25 checks passed

TrevorBergeron deleted the new_execute_result branch October 31, 2025 17:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: ExecuteResult is reusable, sampleable #2159

refactor: ExecuteResult is reusable, sampleable #2159

Uh oh!

TrevorBergeron commented Oct 9, 2025

Uh oh!

tswast left a comment

Uh oh!

tswast Oct 22, 2025

Uh oh!

TrevorBergeron Oct 28, 2025

Uh oh!

TrevorBergeron commented Oct 28, 2025

Uh oh!

tswast Oct 31, 2025

Uh oh!

TrevorBergeron Oct 31, 2025

Uh oh!

tswast commented Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor: ExecuteResult is reusable, sampleable #2159

refactor: ExecuteResult is reusable, sampleable #2159

Uh oh!

Conversation

TrevorBergeron commented Oct 9, 2025

Uh oh!

tswast left a comment

Choose a reason for hiding this comment

Uh oh!

tswast Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

TrevorBergeron Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

TrevorBergeron commented Oct 28, 2025

Uh oh!

tswast Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

TrevorBergeron Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

tswast commented Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants