Skip to content

Conversation

@chelsea-lin
Copy link
Contributor

Fixes internal issue 445774480 🦕

@product-auto-label product-auto-label bot added size: m Pull request size is medium. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Sep 22, 2025
@chelsea-lin chelsea-lin marked this pull request as ready for review September 23, 2025 19:03
@chelsea-lin chelsea-lin requested review from a team as code owners September 23, 2025 19:03
Comment on lines 96 to 104
excluded_cols=[
"bytes_col",
"date_col",
"datetime_col",
"time_col",
"timestamp_col",
"string_col",
],
).node
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we modify the MedianOp definition itself to exclude these then?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated the codes so that the Ibis can allow the non-numeric dtypes for the MedianOp now.

Comment on lines 65 to 67
# TODO(swast): Allow switching between exact and approximate median.
# For now, the best we can do is an approximate median when we're doing
# an aggregation, as PERCENTILE_CONT is only an analytic function.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Exact and approximate median work a lot differently mechanically, so I probably wouldn't handle this far down in the stack, we would probably dispatch a different operator close to the API surface.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed the comments. We are using the QuantileOp for the exact median now.

@chelsea-lin chelsea-lin force-pushed the main_chelsealin_median branch from 714bcb3 to 1edee54 Compare September 23, 2025 20:34
@chelsea-lin chelsea-lin merged commit ca1e44c into main Sep 23, 2025
19 of 25 checks passed
@chelsea-lin chelsea-lin deleted the main_chelsealin_median branch September 23, 2025 21:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. size: m Pull request size is medium.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants