Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
453 commits
Select commit Hold shift + click to select a range
d22e293
Include data types in logical plans of inferred prepare statements (#…
brayanjuls May 21, 2025
39063f6
docs: Fix typos and minor grammatical issues in Architecture docs (#1…
patrickcsullivan May 22, 2025
cb45f1f
add top-memory-consumers option in cli (#16081)
ding-young May 22, 2025
67a2173
fix ci extended test (#16144)
2010YOUY01 May 22, 2025
e5f596b
Fix: handle column name collisions when combining UNION logical input…
LiaCastaneda May 22, 2025
5293b70
adding support for Min/Max over LargeList and FixedSizeList (#16071)
logan-keede May 22, 2025
dc8161e
Move prepare/parameter handling tests into `params.rs` (#16141)
liamzwbao May 22, 2025
ce835da
Add `StateFieldsArgs::return_field` (#16112)
alamb May 22, 2025
17fe504
Support filtering specific sqllogictests identified by line number (#…
gabotechs May 23, 2025
e305353
Enrich GroupedHashAggregateStream name to ease debugging Resources ex…
ahmed-mez May 24, 2025
a256eb4
chore(deps): bump uuid from 1.16.0 to 1.17.0 (#16162)
dependabot[bot] May 24, 2025
2afa3aa
Minor: Fix links in substrait readme (#16156)
alamb May 24, 2025
2199e50
Remove Filter::having field (#16154)
findepi May 24, 2025
9031d6b
Clarify docs and names in parquet predicate pushdown tests (#16155)
alamb May 24, 2025
d4218fd
Minor: Fix name() for FilterPushdown physical optimizer rule (#16175)
adriangb May 24, 2025
2add813
migrate tests in `pool.rs` to use insta (#16145)
lifan-ake May 24, 2025
af67caa
refactor(optimizer): add `.with_schema` for defining test tables (#16…
atahanyorganci May 24, 2025
dacdda2
[Minor] Speedup TPC-H benchmark run with memtable option (#16159)
Dandandan May 24, 2025
34f250a
Fast path for joins with distinct values in build side (#16153)
Dandandan May 24, 2025
3b551e9
chore: Reduce repetition in the parameter type inference tests (#16079)
jsai28 May 25, 2025
605ccbd
feat: array_length for fixed size list (#16167)
chenkovsky May 25, 2025
c5df6ee
fix: remove trailing whitespace in `Display` for `LogicalPlan::Projec…
atahanyorganci May 26, 2025
f3aed4a
chore(deps): bump tokio from 1.45.0 to 1.45.1 (#16190)
dependabot[bot] May 26, 2025
16c7939
Improve `unproject_sort_expr` to handle arbitrary expressions (#16127)
phillipleblanc May 27, 2025
2d12bf6
chore(deps): bump rustyline from 15.0.0 to 16.0.0 (#16194)
dependabot[bot] May 27, 2025
260a28a
feat: ADD sha2 spark function (#16168)
getChan May 27, 2025
db0ab74
Add macro for creating DataFrame (#16090) (#16104)
cj-zhukov May 27, 2025
68e26f1
migrate `logical_plan` tests to insta (#16184)
lifan-ake May 27, 2025
aaae4d7
doc: Move `dataframe!` example into dedicated example (#16197)
comphead May 27, 2025
a92c5e9
chore(deps): bump clap from 4.5.38 to 4.5.39 (#16204)
dependabot[bot] May 28, 2025
7d3c7d8
implement `AggregateExec.partition_statistics` (#15954)
UBarney May 28, 2025
081e95c
doc: add diagram to describe how DataSource, FileSource, and DataSour…
onlyjackfrost May 28, 2025
33a2531
Clarify documentation about gathering statistics for parquet files (#…
alamb May 28, 2025
56a2af7
Propagate .execute() calls immediately in `RepartitionExec` (#16093)
gabotechs May 28, 2025
00132da
Shift from Field to FieldRef for all user defined functions (#16122)
timsaucer May 28, 2025
7002a00
Set aggregation hash seed (#16165)
ctsk May 28, 2025
5944e8b
feat: create builder for disk manager (#16191)
jdrouet May 29, 2025
72f4eab
Fix ScalarStructBuilder::build() for an empty struct (#16205)
Blizzara May 29, 2025
2c2f225
Return an error on overflow in `do_append_val_inner` (#16201)
liamzwbao May 29, 2025
21248fb
Change default SQL mapping for `VARCAHR` from `Utf8` to `Utf8View` (…
zhuqi-lucas May 30, 2025
795988d
chore(deps): bump testcontainers-modules from 0.12.0 to 0.12.1 (#16212)
dependabot[bot] May 30, 2025
9a5c61d
Substrait: handle identical grouping expressions (#16189)
cht42 May 30, 2025
c6e5c91
Add new stats pruning helpers to allow combining partition values in …
adriangb May 30, 2025
900279c
Implement schema adapter support for FileSource and add integration t…
kosiew May 30, 2025
376eb03
Minor: update documentation for PrunableStatistics (#16213)
alamb May 30, 2025
6844e56
Minor: Remove dead code (#16215)
alamb May 31, 2025
5a86142
Add change to VARCHAR in the upgrade guide (#16216)
alamb Jun 1, 2025
7248259
Reduce size of `Expr` struct (#16207)
hendrikmakait Jun 2, 2025
6f1ca29
fix: metadata of join schema (#16221)
chenkovsky Jun 2, 2025
8b9b2fc
fix: add missing row count limits to TPC-H queries (#16230)
0ax1 Jun 3, 2025
5b08b84
Remove use of deprecated dict_ordered in datafusion-proto (#16218) (#…
cj-zhukov Jun 3, 2025
deeff88
debug cargo command in bench script (#16236)
2010YOUY01 Jun 3, 2025
d68111d
Add iceberg-rust to user list (#16246)
jonathanc-n Jun 4, 2025
3236cc0
Simplify FileSource / SchemaAdapterFactory API (#16214)
alamb Jun 4, 2025
992d156
Prepare for 48.0.0 release: Version and Changelog (#16238)
xudong963 Jun 4, 2025
f513e2c
Add dicts to aggregation fuzz testing (#16232)
blaginin Jun 4, 2025
ffbc3a7
chore(deps): bump sysinfo from 0.35.1 to 0.35.2 (#16247)
dependabot[bot] Jun 4, 2025
0c30374
Improve performance of constant aggregate window expression (#16234)
suibianwanwank Jun 4, 2025
abbf73d
Support compound identifier when parsing tuples (#16225)
hozan23 Jun 4, 2025
bf7859e
Schema adapter helper (#16108)
kosiew Jun 4, 2025
9ae41b1
Update tpch, clickbench, sort_tpch to mark failed queries (#16182)
ding-young Jun 5, 2025
448c985
Adjust slttest to pass without RUST_BACKTRACE enabled (#16251)
alamb Jun 5, 2025
ab8cd8c
fix: NaN semantics in GROUP BY (#16256)
chenkovsky Jun 5, 2025
5e307b3
Handle dicts for distinct count (#15871)
blaginin Jun 5, 2025
eeee6b0
Add `--substrait-round-trip` option in sqllogictests (#16183)
gabotechs Jun 5, 2025
25727d4
Minor: fix upgrade papercut where structure was moved (#16264)
alamb Jun 5, 2025
5d3ed9c
feat: Add Aggregate UDF to FFI crate (#14775)
timsaucer Jun 5, 2025
2c8241a
feat(small): Add `BaselineMetrics` to `generate_series()` table funct…
2010YOUY01 Jun 5, 2025
2a7f64a
feat: Add Window UDFs to FFI Crate (#16261)
timsaucer Jun 5, 2025
85f6621
Chore: update DF48 changelog (#16269)
xudong963 Jun 6, 2025
1e20563
chore(deps): bump sqllogictest from 0.28.2 to 0.28.3 (#16286)
dependabot[bot] Jun 6, 2025
adf07bb
chore(deps-dev): bump webpack-dev-server (#16253)
dependabot[bot] Jun 6, 2025
367331e
Improve DataFusion subcrate readme files (#16263)
alamb Jun 6, 2025
4cf1148
Fix intermittent SQL logic test failure in limit.slt by adding ORDER …
kosiew Jun 6, 2025
f198a01
Extend benchmark comparison script with more detailed statistics (#16…
pepijnve Jun 6, 2025
fbafea4
feat: Support defining custom MetricValues in PhysicalPlans (#16195)
sfluor Jun 6, 2025
0f83c1d
feat: add metadata to literal expressions (#16170)
timsaucer Jun 6, 2025
1daa5ed
[MAJOR] Equivalence System Overhaul (#16217)
ozankabak Jun 7, 2025
95771da
Minor: Add upgrade guide for `Expr::WindowFunction` (#16313)
alamb Jun 8, 2025
39c100a
Fix `array_position` on empty list (#16292)
Blizzara Jun 8, 2025
59c6a02
chore(deps): bump flate2 from 1.1.1 to 1.1.2 (#16338)
dependabot[bot] Jun 9, 2025
8fe1b57
chore(deps): bump petgraph from 0.8.1 to 0.8.2 (#16337)
dependabot[bot] Jun 9, 2025
54d85fc
chore(deps): bump substrait from 0.56.0 to 0.57.0 (#16143)
dependabot[bot] Jun 9, 2025
78e4202
feat: Allow cancelling of grouping operations which are CPU bound (#1…
zhuqi-lucas Jun 9, 2025
287b2a9
Add test for ordering of predicate pushdown into parquet (#16169)
adriangb Jun 9, 2025
bd85bed
Fix distinct count for DictionaryArray to correctly account for nulls…
kosiew Jun 9, 2025
2986415
Fix inconsistent schema projection in ListingTable even when schema i…
kosiew Jun 10, 2025
1d73c59
Fix: mark "Spilling (to disk) Joins" as supported in features (#16343)
kosiew Jun 10, 2025
7477aa6
Add late pruning of Parquet files based on file level statistics (#16…
adriangb Jun 10, 2025
b41acf3
tpch: move reading of SQL queries out of timed span. (#16357)
pepijnve Jun 10, 2025
49983df
chore(deps): bump clap from 4.5.39 to 4.5.40 (#16354)
dependabot[bot] Jun 11, 2025
25a2d9f
chore(deps): bump syn from 2.0.101 to 2.0.102 (#16355)
dependabot[bot] Jun 11, 2025
f35416e
Fix cp_solver doc format (#16352)
xudong963 Jun 11, 2025
3a312a9
docs: Expand `MemoryPool` docs with related structs (#16289)
2010YOUY01 Jun 11, 2025
e4166b3
Encapsulate metadata for literals on to a `FieldMetadata` structure (…
alamb Jun 11, 2025
1d61f31
Add support `UInt64` and other integer data types for `to_hex` (#16335)
tlm365 Jun 11, 2025
a91e042
Support datafusion-cli access to public S3 buckets that do not requir…
alamb Jun 11, 2025
06ccae2
Document `copy_array_data` function with example (#16361)
alamb Jun 11, 2025
8a2d618
Fix array_agg memory over use (#16346)
gabotechs Jun 11, 2025
55f1ea6
Update publish command (#16377)
xudong963 Jun 11, 2025
f8aad68
Add more context to error message for datafusion-cli config failure (…
alamb Jun 11, 2025
9c98b01
Fix: datafusion-sqllogictest 48.0.0 can't be published (#16376)
xudong963 Jun 11, 2025
df49f9f
fix: preserve null_equals_null flag in eliminate_cross_join rule (#16…
waynexia Jun 11, 2025
e1716f9
feat: support FixedSizeList for array_has (#16333)
chenkovsky Jun 11, 2025
4a857f0
bug: remove busy-wait while sort is ongoing (#16322)
pepijnve Jun 12, 2025
0e84041
Document Table Constraint Enforcement Behavior in Custom Table Provid…
kosiew Jun 12, 2025
4fa8232
chore: refactor Substrait consumer's "rename_field" and implement the…
Blizzara Jun 12, 2025
31c570e
feat: Support tpch and tpch10 csv format (#16373)
zhuqi-lucas Jun 12, 2025
3fe4640
chore(deps): bump object_store from 0.12.1 to 0.12.2 (#16368)
dependabot[bot] Jun 12, 2025
42e55bd
Disable `datafusion-cli tests in hash (#16382)
alamb Jun 12, 2025
3764fe7
Fix array_concat with NULL arrays (#16348)
alexanderbianchi Jun 12, 2025
3a371ed
doc: Add SQL examples for SEMI + ANTI Joins (#16316)
jonathanc-n Jun 12, 2025
6cd03e2
Minor: add testing case for add YieldStreamExec and polish docs (#16369)
zhuqi-lucas Jun 13, 2025
79f5c8d
chore(deps): bump aws-config from 1.6.3 to 1.8.0 (#16394)
dependabot[bot] Jun 13, 2025
4dd6923
fix: Fix SparkSha2 to be compliant with Spark response and add suppor…
rishvin Jun 13, 2025
5a2933e
fix typo in test file name (#16403)
adriangb Jun 13, 2025
94fd86a
Add topk_tpch benchmark (#16410)
Dandandan Jun 14, 2025
6cfdd59
Remove some clones (#16404)
simonvandel Jun 14, 2025
ef56e7b
chore(deps): bump syn from 2.0.102 to 2.0.103 (#16393)
dependabot[bot] Jun 15, 2025
8e8c116
[datafusion-spark] Example of using Spark compatible function library…
alamb Jun 15, 2025
869acf9
Simplify expressions passed to table functions (#16388)
simonvandel Jun 15, 2025
42fe050
Add fast paths for try_process_unnest (#16389)
simonvandel Jun 15, 2025
ca0b760
cleanup bench.sh usage message (#16416)
2010YOUY01 Jun 15, 2025
36e823f
Add note in upgrade guide about changes to `Expr::Scalar` in 48.0.0 (…
alamb Jun 16, 2025
dd936cb
Update PMC management instructions to follow new ASF process (#16417)
alamb Jun 16, 2025
7d16764
feat: Support RightMark join for NestedLoop and Hash join (#16083)
jonathanc-n Jun 16, 2025
ad0c21f
fix: Fixed error handling for `generate_series/range` (#16391)
jonathanc-n Jun 16, 2025
24f1bb5
Add design process section to the docs (#16397)
alamb Jun 16, 2025
324a271
chore(deps): bump rust_decimal from 1.37.1 to 1.37.2 (#16422)
dependabot[bot] Jun 16, 2025
06631c2
feat: mapping sql Char/Text/String default to Utf8View (#16290)
zhuqi-lucas Jun 17, 2025
d4033ee
Migrate core test to insta, part1 (#16324)
Chen-Yuan-Lai Jun 17, 2025
aa359c5
chore(deps): bump mimalloc from 0.1.46 to 0.1.47 (#16426)
dependabot[bot] Jun 17, 2025
2c5b9e8
chore(deps): bump libc from 0.2.172 to 0.2.173 (#16421)
dependabot[bot] Jun 17, 2025
0c3bb78
Unify Metadata Handing: use `FieldMetadata` in `Expr::Alias` and `Exp…
alamb Jun 17, 2025
1429c92
Dynamic filter pushdown for TopK sorts (#15770)
adriangb Jun 17, 2025
87218a1
fix: Enable WASM compilation by making sqlparser's recursive-protecti…
jonmmease Jun 17, 2025
11fc52d
Use dedicated NullEquality enum instead of null_equals_null boolean (…
tobixdev Jun 17, 2025
056f546
chore: generate basic spark function tests (#16409)
shehabgamin Jun 17, 2025
630aa7b
replace false with NullEqualsNothing (#16437)
ding-young Jun 18, 2025
20de40f
chore(deps): bump bzip2 from 0.5.2 to 0.6.0 (#16441)
dependabot[bot] Jun 18, 2025
accd225
Update Roadmap documentation (#16399)
alamb Jun 18, 2025
9083fc8
chore(deps): bump libc from 0.2.173 to 0.2.174 (#16440)
dependabot[bot] Jun 18, 2025
5ebc49d
feat: support fixed size list for array reverse (#16423)
chenkovsky Jun 18, 2025
4c3b847
feat: add SchemaProvider::table_type(table_name: &str) (#16401)
epgif Jun 18, 2025
4084894
fix: create file for empty stream (#16342)
chenkovsky Jun 18, 2025
7e0f623
Remove redundant license-header-check CI job (#16451)
alamb Jun 19, 2025
e6df27c
. (#16449)
AdamGS Jun 19, 2025
266daf8
doc: Add comments to clarify algorithm for `MarkJoin`s (#16436)
jonathanc-n Jun 19, 2025
43ba6f2
remove unused functions in SortExec (#16457)
adriangb Jun 19, 2025
2d7ae09
Set the default value of `datafusion.execution.collect_statistics` to…
AdamGS Jun 19, 2025
5ca4ff0
Temporarily fix bug in dynamic top-k optimization (#16465)
AdamGS Jun 19, 2025
3c4e39a
Add compression option to SpillManager (#16268)
ding-young Jun 20, 2025
aa1e6da
Ignore `sort_query_fuzzer_runner` (#16462)
blaginin Jun 20, 2025
d8ee8d8
feat: derive `Debug` and `Clone` for `ScalarFunctionArgs` (#16471)
crepererum Jun 20, 2025
5443c67
Redirect user defined webpags (#16475)
alamb Jun 20, 2025
a0eaf51
Revert "Ignore `sort_query_fuzzer_runner` (#16462)" (#16470)
2010YOUY01 Jun 20, 2025
7914624
Reapply "Ignore `sort_query_fuzzer_runner` (#16462)" (#16470) (#16485)
alamb Jun 20, 2025
8b03e5e
Use Tokio's task budget consistently, better APIs to support task can…
pepijnve Jun 20, 2025
a4f4b17
Fix constant window for evaluate stateful (#16430)
suibianwanwank Jun 20, 2025
7cecc18
doc: upgrade guide for new compression option for spill files (#16472)
2010YOUY01 Jun 21, 2025
85eebcd
Use UDTF name in logical plan table scan (#16468)
Jeadie Jun 21, 2025
e737d3f
refactor reassign_predicate_columns to accept an &Schema instead of &…
adriangb Jun 22, 2025
2bf8441
re-enable `sort_query_fuzzer_runner` (#16491)
adriangb Jun 22, 2025
fb01049
Example for using a separate threadpool for CPU bound work (try 3) (#…
alamb Jun 23, 2025
78433a7
fix: document and fix macro hygiene for `config_field!` (#16473)
crepererum Jun 23, 2025
2f1063e
chore(deps): bump syn from 2.0.103 to 2.0.104 (#16507)
dependabot[bot] Jun 23, 2025
b6c8cc5
use 'lit' as the field name for literal values (#16498)
adriangb Jun 23, 2025
cdaaef7
Introduce Async User Defined Functions (#14837)
goldmedal Jun 23, 2025
9556bcd
[datafusion-spark] Implement `factorical` function (#16125)
tlm365 Jun 23, 2025
c37b851
Minor: Add more links to cooperative / scheduling docs (#16484)
alamb Jun 23, 2025
26e27b0
doc: Document DESCRIBE comman in ddl.md (#16524)
krikera Jun 23, 2025
d6e8b07
Add DESC alias for DESCRIBE command (#16514)
lucqui Jun 23, 2025
a862f7a
Split clickbench query set into one file per query (#16476)
pepijnve Jun 23, 2025
1bfd888
Add more doc for physical filter pushdown (#16504)
xudong963 Jun 24, 2025
15a8738
Perf: Optimize CursorValues compare performance for StringViewArray (…
zhuqi-lucas Jun 24, 2025
c4d2cd7
chore: fix CI failures on `ddl.md` (#16526)
comphead Jun 24, 2025
ec92ed3
Support query filter on all benchmarks (#16477)
pepijnve Jun 24, 2025
921f4a0
fix: make `with_new_state` a trait method for `ExecutionPlan` (#16469)
geoffreyclaude Jun 24, 2025
59143c1
`TableProvider` to skip files in the folder which non relevant to sel…
comphead Jun 24, 2025
5400d71
Reuse `BaselineMetrics` in `UnnestMetrics` (#16497)
hendrikmakait Jun 24, 2025
63a10ba
Add some comments about adding new dependencies in datafusion-sql (#1…
alamb Jun 24, 2025
678c25d
Fix array_has to return false for empty arrays instead of null (#16529)
kosiew Jun 24, 2025
334d449
Minor: Add documentation to `AggregateWindowExpr::get_result_column` …
alamb Jun 24, 2025
969ed5e
Simplify predicates in `PushDownFilter` optimizer rule (#16362)
xudong963 Jun 25, 2025
7d3bf52
Add note for planning release in Upgrade Guides (#16534)
xudong963 Jun 25, 2025
20a723b
Fix WindowFrame::new with order_by (#16537)
findepi Jun 25, 2025
dd55e35
chore(deps): bump object_store from 0.12.1 to 0.12.2 (#16548)
dependabot[bot] Jun 25, 2025
a87d6f2
chore(deps): bump mimalloc from 0.1.46 to 0.1.47 (#16547)
dependabot[bot] Jun 25, 2025
e063a8d
Consolidate configuration sections in docs (#16544)
alamb Jun 25, 2025
dd2de91
Add support for Arrow Duration type in Substrait (#16503)
jkosh44 Jun 25, 2025
6f2747f
Allow unparser to override the alias name for the specific dialect (#…
goldmedal Jun 25, 2025
4a4ffd7
Avoid clones when calling find_window_exprs (#16551)
findepi Jun 25, 2025
1b36400
Move Pruning Logic to a Dedicated datafusion-pruning Crate for Improv…
kosiew Jun 25, 2025
fcb193b
Update spilled_bytes to report spill file size (#16535)
ding-young Jun 25, 2025
db13dd9
Minor: add clearer linl to the main website (#16556)
alamb Jun 26, 2025
0e48627
adapt filter expressions to file schema during parquet scan instead o…
adriangb Jun 26, 2025
b405380
Simplify AsyncScalarUdfImpl so it extends ScalarUdfImpl (#16523)
alamb Jun 26, 2025
3649dc8
datafusion-cli: Use correct S3 region if it is not specified (#16502)
liamzwbao Jun 26, 2025
0143b20
feat: support `map_entries` builtin function (#16557)
comphead Jun 26, 2025
0c30f13
Add nested struct casting support and integrate into SchemaAdapter (#…
kosiew Jun 27, 2025
9c6d6ee
Fix type of ExecutionOptions::time_zone (#16569)
findepi Jun 27, 2025
fffcd1f
Improve err message grammar (#16566)
findepi Jun 27, 2025
3839736
feat: add `array_min` scalar function and associated tests (#16574)
dharanad Jun 27, 2025
586a88c
refactor: move PruningPredicate into its own module (#16587)
adriangb Jun 27, 2025
d73f0e8
feat: Finalize support for `RightMark` join + `Mark` join swap (#16488)
jonathanc-n Jun 27, 2025
1884175
chore(deps): bump indexmap from 2.9.0 to 2.10.0 (#16582)
dependabot[bot] Jun 27, 2025
b4ba1c6
Skip re-pruning based on partition values and file level stats if the…
adriangb Jun 27, 2025
cce3f3f
fix: column indices in FFI partition evaluator (#16480)
timsaucer Jun 27, 2025
2999e41
Support timestamp and date arguments for `range` and `generate_series…
simonvandel Jun 27, 2025
8d34abb
Fix normalization of columns in JOIN ... USING. (#16560)
brunal Jun 27, 2025
1de4d0e
fix: support within_group (#16538)
chenkovsky Jun 27, 2025
8d772e5
Revert "feat: Finalize support for `RightMark` join + `Mark` join swa…
comphead Jun 27, 2025
9f3cc7b
move min_batch/max_batch to functions-aggregate-common (#16593)
adriangb Jun 28, 2025
d66d6b9
feat: Parquet modular encryption (#16351)
corwinjoy Jun 28, 2025
1c0dcad
fix: disallow specify both order_by and within_group (#16606)
watchingthewheelsgo Jun 28, 2025
ebf49b4
fix: format within_group error message (#16613)
watchingthewheelsgo Jun 29, 2025
649a36f
Allow usage of table functions in relations (#16571)
osipovartem Jun 30, 2025
f65da24
Update to arrow/parquet 55.2.0 (#16575)
alamb Jun 30, 2025
f4d1990
docs: Minor grammatical fixes for the scalar UDF docs (#16618)
ianthetechie Jun 30, 2025
9a1c09c
Improve field naming in first_value, last_value implementation (#16631)
findepi Jul 1, 2025
df9f096
Fix spurious failure in convert_batches test helper (#16627)
findepi Jul 1, 2025
7cdac33
Aggregate UDF cleanup (#16628)
findepi Jul 1, 2025
47f75ef
Avoid treating incomparable scalars as equal (#16624)
findepi Jul 1, 2025
e75eb7f
fix: reserved keywords in qualified column names (#16584)
crepererum Jul 1, 2025
9bb309c
restore topk pre-filtering of batches and make sort query fuzzer less…
alamb Jul 1, 2025
92f646c
Add support for Arrow Time types in Substrait (#16558)
jkosh44 Jul 1, 2025
17f1c9d
fix: support scalar function nested in get_field in Unparser (#16610)
chenkovsky Jul 1, 2025
72d6d32
chore(deps): bump substrait from 0.57.0 to 0.58.0 (#16640)
dependabot[bot] Jul 1, 2025
de79843
Support explain tree format debug for benchmark debug (#16604)
zhuqi-lucas Jul 1, 2025
25c2a07
Add microbenchmark for spilling with compression (#16512)
ding-young Jul 2, 2025
f03a8fd
Fix parquet filter_pushdown: respect parquet filter pushdown config i…
adriangb Jul 2, 2025
8366d6e
chore(deps): bump aws-config from 1.8.0 to 1.8.1 (#16651)
dependabot[bot] Jul 2, 2025
d4ef26d
Migrate core test to insta, part 2 (#16617)
Chen-Yuan-Lai Jul 2, 2025
705ea42
Update all spark SLT files (#16637)
findepi Jul 2, 2025
6870cc1
Add PhysicalExpr optimizer and cast unwrapping (#16530)
adriangb Jul 2, 2025
3ca09a6
benchmark: Support sort_tpch10 for benchmark (#16671)
zhuqi-lucas Jul 3, 2025
18f00a7
chore(deps): bump tokio from 1.45.1 to 1.46.0 (#16666)
dependabot[bot] Jul 3, 2025
06e5bbe
Fix TopK Sort incorrectly pushed down past operators that do not acce…
zhuqi-lucas Jul 3, 2025
50dc83a
Convert Option<Vec<sort expression>> to Vec<sort expression> (#16615)
ViggoC Jul 3, 2025
65242a6
Improve error message when ScalarValue fails to cast array (#16670)
findepi Jul 3, 2025
5a48857
Add an example of embedding indexes inside a parquet file (#16395)
zhuqi-lucas Jul 3, 2025
1cc67ab
`datafusion-cli`: Refactor statement execution logic (#16634)
liamzwbao Jul 3, 2025
3118b81
Implementation for regex_instr (#15928)
nirnayroy Jul 3, 2025
acf0bbe
Refactor error handling to use boxed errors for DataFusionError varia…
kosiew Jul 4, 2025
8b9c1f1
Add SchemaAdapterFactory Support for ListingTable with Schema Evoluti…
kosiew Jul 4, 2025
8c5d06d
Reuse Rows allocation in RowCursorStream (#16647)
Dandandan Jul 4, 2025
0185da6
Perf: fast CursorValues compare for StringViewArray using inline_key_…
zhuqi-lucas Jul 4, 2025
a715173
refactor: shrink `SchemaError` (#16653)
crepererum Jul 4, 2025
aadb79b
rustup version (#16663)
melroy12 Jul 4, 2025
c9a93c9
test how merge commit look
ding-young Jul 5, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
3 changes: 3 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,9 @@ github:
main:
required_pull_request_reviews:
required_approving_review_count: 1
pull_requests:
# enable updating head branches of pull requests
allow_update_branch: true

# publishes the content of the `asf-site` branch to
# https://datafusion.apache.org/
Expand Down
21 changes: 1 addition & 20 deletions .github/workflows/docs_pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -34,26 +34,7 @@ on:
workflow_dispatch:

jobs:
# Run doc tests
linux-test-doc:
name: cargo doctest (amd64)
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run doctests (embedded rust examples)
run: cargo test --doc --features avro,json
- name: Verify Working Directory Clean
run: git diff --exit-code


# Test doc build
linux-test-doc-build:
name: Test doc build
Expand Down
14 changes: 12 additions & 2 deletions .github/workflows/extended.yml
Original file line number Diff line number Diff line change
Expand Up @@ -101,7 +101,17 @@ jobs:
- name: Run tests (excluding doctests)
env:
RUST_BACKTRACE: 1
run: cargo test --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --workspace --lib --tests --bins --features avro,json,backtrace,extended_tests,recursive_protection
run: |
cargo test \
--profile ci \
--exclude datafusion-examples \
--exclude datafusion-benchmarks \
--exclude datafusion-cli \
--workspace \
--lib \
--tests \
--bins \
--features avro,json,backtrace,extended_tests,recursive_protection
- name: Verify Working Directory Clean
run: git diff --exit-code
- name: Cleanup
Expand All @@ -126,7 +136,7 @@ jobs:
- name: Run tests
run: |
cd datafusion
cargo test --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --exclude datafusion-sqllogictest --workspace --lib --tests --features=force_hash_collisions,avro
cargo test --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --exclude datafusion-sqllogictest --exclude datafusion-cli --workspace --lib --tests --features=force_hash_collisions,avro
cargo clean

sqllogictest-sqlite:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ jobs:
uses: actions/labeler@v5.0.0
with:
repo-token: ${{ secrets.GITHUB_TOKEN }}
configuration-path: .github/workflows/dev_pr/labeler.yml
configuration-path: .github/workflows/labeler/labeler-config.yml
sync-labels: true

# TODO: Enable this when eps1lon/actions-label-merge-conflict is available.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@ physical-expr:

physical-plan:
- changed-files:
- any-glob-to-any-file: [datafusion/physical-plan/**/*']
- any-glob-to-any-file: ['datafusion/physical-plan/**/*']


catalog:
Expand Down Expand Up @@ -77,6 +77,10 @@ proto:
- changed-files:
- any-glob-to-any-file: ['datafusion/proto/**/*', 'datafusion/proto-common/**/*']

spark:
- changed-files:
- any-glob-to-any-file: ['datafusion/spark/**/*']

substrait:
- changed-files:
- any-glob-to-any-file: ['datafusion/substrait/**/*']
Expand Down
34 changes: 24 additions & 10 deletions .github/workflows/rust.yml
Original file line number Diff line number Diff line change
Expand Up @@ -39,14 +39,6 @@ on:
workflow_dispatch:

jobs:
# Check license header
license-header-check:
runs-on: ubuntu-latest
name: Check License Header
steps:
- uses: actions/checkout@v4
- uses: korandoru/hawkeye@v6

# Check crate compiles and base cargo check passes
linux-build-lib:
name: linux build test
Expand Down Expand Up @@ -401,8 +393,8 @@ jobs:
- name: Run tests with headless mode
working-directory: ./datafusion/wasmtest
run: |
wasm-pack test --headless --firefox
wasm-pack test --headless --chrome --chromedriver $CHROMEWEBDRIVER/chromedriver
RUSTFLAGS='--cfg getrandom_backend="wasm_js"' wasm-pack test --headless --firefox
RUSTFLAGS='--cfg getrandom_backend="wasm_js"' wasm-pack test --headless --chrome --chromedriver $CHROMEWEBDRIVER/chromedriver

# verify that the benchmark queries return the correct results
verify-benchmark-results:
Expand Down Expand Up @@ -476,6 +468,28 @@ jobs:
POSTGRES_HOST: postgres
POSTGRES_PORT: ${{ job.services.postgres.ports[5432] }}

sqllogictest-substrait:
name: "Run sqllogictest in Substrait round-trip mode"
needs: linux-build-lib
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run sqllogictest
# TODO: Right now several tests are failing in Substrait round-trip mode, so this
# command cannot be run for all the .slt files. Run it for just one that works (limit.slt)
# until most of the tickets in https://github.com/apache/datafusion/issues/16248 are addressed
# and this command can be run without filters.
run: cargo test --test sqllogictests -- --substrait-round-trip limit.slt

# Temporarily commenting out the Windows flow, the reason is enormously slow running build
# Waiting for new Windows 2025 github runner
# Details: https://github.com/apache/datafusion/issues/13726
Expand Down
Loading
Loading