[Do not merge] Iterative `bind` with a stack instead of recursion #1783

smaheshwar-pltr · 2025-03-10T23:32:05Z

// Do not merge.

For fun, writing out the call stack to avoid the recursion that caused #1759.

smaheshwar-pltr · 2025-03-10T23:35:02Z

tests/table/test_upsert.py

    assert upd.rows_inserted == 4


+def test_large_upsert_into_empty_table(catalog: Catalog) -> None:


This now-passing test fails on main with

def __getitem__(self, key): > return self.data[ref(key)] E RecursionError: maximum recursion depth exceeded in comparison /<PATH>/weakref.py:416: RecursionError !!! Recursion error detected, but an error occurred locating the origin of recursion. The following exception happened when comparing locals in the stack frame: RecursionError: maximum recursion depth exceeded Displaying first and last 10 stack frames out of 962.

smaheshwar-pltr · 2025-03-10T23:35:32Z

tests/table/test_upsert.py

+    num_columns = 50
+    num_rows = 10000


Actually, 20 and 1000 is enough to make main fail this test with (default) recursion depth exceeded

kevinjqliu · 2025-03-11T18:30:50Z

changing the visitor to an iterative approach seems like a sound solution. are there any reasons we dont want to do this?

Fokko · 2025-03-11T19:20:50Z

I like the solution!

changing the visitor to an iterative approach seems like a sound solution. are there any reasons we dont want to do this?

My only concern is performance. We probably want to check what the impact is, since we bind to schemas all over the codebase. It would be good to see if we can get some numbers on the impact on the performance.

smaheshwar-pltr · 2025-03-23T13:08:17Z

Thanks for taking a look folks, and apologies for the delayed response.

are there any reasons we don't want to do this?

This approach serves only to circumvent recursion. When it's said that "iteration is faster than recursion", I don't think it refers to just concretising the frames / call-stack in memory - I believe this aligns with @Fokko mentioning that performance might be worsened, which I agree with. If we do want to go with this PR's approach, then I think we should consider it for the other visitors in visitors.py (after benchmarking performance). I also wonder if (a) some decorator magic is possible for the conversion, because IMO these simple recursive visitors read nicer and feel less error-prone (we can rewrite to be tail-recursive and use tco but I think the rewrite introduces complexity similar to this PR), or if (b) there's some way to un-intrusively keep the current recursive approach.

I see some recent activity / PRs on the original issue so happy to wait for that discussion to conclude?

Iterative binding with a stack instead of recursion

4caba1c

smaheshwar-pltr changed the title ~~Iterative bind with a stack instead of recursion~~ [Do not merge] Iterative bind with a stack instead of recursion Mar 10, 2025

smaheshwar-pltr commented Mar 10, 2025

View reviewed changes

smaheshwar-pltr mentioned this pull request Mar 10, 2025

Issue during Upsert #1759

Closed

kevinjqliu mentioned this pull request Mar 11, 2025

[bug] bind visitor causes RecursionError: maximum recursion depth exceeded #1785

Closed

3 tasks

Fokko mentioned this pull request Mar 24, 2025

Use a balanced tree instead of unbalanced one to prevent recursion error in create_match_filter #1830

Merged

smaheshwar-pltr closed this Mar 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Do not merge] Iterative `bind` with a stack instead of recursion #1783

[Do not merge] Iterative `bind` with a stack instead of recursion #1783

Uh oh!

smaheshwar-pltr commented Mar 10, 2025

Uh oh!

smaheshwar-pltr Mar 10, 2025

Uh oh!

smaheshwar-pltr Mar 10, 2025

Uh oh!

kevinjqliu commented Mar 11, 2025

Uh oh!

Fokko commented Mar 11, 2025

Uh oh!

smaheshwar-pltr commented Mar 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		assert upd.rows_inserted == 4


		def test_large_upsert_into_empty_table(catalog: Catalog) -> None:

[Do not merge] Iterative bind with a stack instead of recursion #1783

[Do not merge] Iterative bind with a stack instead of recursion #1783

Uh oh!

Conversation

smaheshwar-pltr commented Mar 10, 2025

Uh oh!

smaheshwar-pltr Mar 10, 2025

Choose a reason for hiding this comment

Uh oh!

smaheshwar-pltr Mar 10, 2025

Choose a reason for hiding this comment

Uh oh!

kevinjqliu commented Mar 11, 2025

Uh oh!

Fokko commented Mar 11, 2025

Uh oh!

smaheshwar-pltr commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Do not merge] Iterative `bind` with a stack instead of recursion #1783

[Do not merge] Iterative `bind` with a stack instead of recursion #1783

smaheshwar-pltr commented Mar 23, 2025 •

edited

Loading