perfect hash join #19411

UBarney · 2025-12-19T15:07:15Z

Which issue does this PR close?

Closes [EPIC]: Perfect Hash Join #17635.

Rationale for this change

This PR introduces a Perfect Hash Join optimization by using an array-based direct mapping(ArrayMap) instead of a HashMap.
The array-based approach outperforms the standard Hash Join when the build-side keys are dense (i.e., the ratio of count / (max - min+1) is high) or when the key range (max - min) is sufficiently small.

The following results from the hj.rs benchmark suite. The benchmark was executed with the optimization enabled by setting DATAFUSION_EXECUTION_PERFECT_HASH_JOIN_MIN_KEY_DENSITY=0.1


┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query                                                  ┃   base_hj ┃ density=0.1 ┃        Change ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1_density=1_prob_hit=1_25*1.5M                  │   5.50 ms │     4.54 ms │ +1.21x faster │
│ QQuery 2_density=0.026_prob_hit=1_25*1.5M              │   6.13 ms │     5.43 ms │ +1.13x faster │
│ QQuery 3_density=1_prob_hit=1_100K*60M                 │ 132.59 ms │    97.42 ms │ +1.36x faster │
│ QQuery 4_density=1_prob_hit=0.1_100K*60M               │ 146.66 ms │    97.75 ms │ +1.50x faster │
│ QQuery 5_density=0.75_prob_hit=1_100K*60M              │ 139.85 ms │   103.82 ms │ +1.35x faster │
│ QQuery 6_density=0.75_prob_hit=0.1_100K*60M            │ 256.62 ms │   192.15 ms │ +1.34x faster │
│ QQuery 7_density=0.5_prob_hit=1_100K*60M               │ 136.27 ms │    91.64 ms │ +1.49x faster │
│ QQuery 8_density=0.5_prob_hit=0.1_100K*60M             │ 234.89 ms │   185.35 ms │ +1.27x faster │
│ QQuery 9_density=0.2_prob_hit=1_100K*60M               │ 132.76 ms │    98.44 ms │ +1.35x faster │
│ QQuery 10_density=0.2_prob_hit=0.1_100K*60M            │ 240.04 ms │   184.93 ms │ +1.30x faster │
│ QQuery 11_density=0.1_prob_hit=1_100K*60M              │ 133.02 ms │   108.11 ms │ +1.23x faster │
│ QQuery 12_density=0.1_prob_hit=0.1_100K*60M            │ 235.44 ms │   209.10 ms │ +1.13x faster │
│ QQuery 13_density=0.01_prob_hit=1_100K*60M             │ 135.64 ms │   132.52 ms │     no change │
│ QQuery 14_density=0.01_prob_hit=0.1_100K*60M           │ 235.88 ms │   234.62 ms │     no change │
│ QQuery 15_density=0.2_prob_hit=0.1_100K_(20%_dups)*60M │ 178.49 ms │   147.55 ms │ +1.21x faster │
└────────────────────────────────────────────────────────┴───────────┴─────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary          ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (base_hj)       │ 2349.79ms │
│ Total Time (density=0.1)   │ 1893.37ms │
│ Average Time (base_hj)     │  156.65ms │
│ Average Time (density=0.1) │  126.22ms │
│ Queries Faster             │        13 │
│ Queries Slower             │         0 │
│ Queries with No Change     │         2 │
│ Queries with Failure       │         0 │
└────────────────────────────┴───────────┘

The following results from tpch-sf10

┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃       base ┃ perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1     │  739.66 ms │  743.84 ms │     no change │
│ QQuery 2     │  315.94 ms │  317.53 ms │     no change │
│ QQuery 3     │  655.79 ms │  669.24 ms │     no change │
│ QQuery 4     │  215.48 ms │  218.79 ms │     no change │
│ QQuery 5     │ 1131.42 ms │ 1146.03 ms │     no change │
│ QQuery 6     │  202.32 ms │  190.83 ms │ +1.06x faster │
│ QQuery 7     │ 1734.06 ms │ 1710.50 ms │     no change │
│ QQuery 8     │ 1185.05 ms │ 1173.90 ms │     no change │
│ QQuery 9     │ 2036.76 ms │ 1994.30 ms │     no change │
│ QQuery 10    │  907.32 ms │  893.20 ms │     no change │
│ QQuery 11    │  306.63 ms │  275.46 ms │ +1.11x faster │
│ QQuery 12    │  404.00 ms │  381.95 ms │ +1.06x faster │
│ QQuery 13    │  531.67 ms │  498.58 ms │ +1.07x faster │
│ QQuery 14    │  317.63 ms │  303.04 ms │     no change │
│ QQuery 15    │  602.24 ms │  572.18 ms │     no change │
│ QQuery 16    │  200.00 ms │  201.68 ms │     no change │
│ QQuery 17    │ 1848.67 ms │ 1790.60 ms │     no change │
│ QQuery 18    │ 2130.63 ms │ 2179.84 ms │     no change │
│ QQuery 19    │  501.81 ms │  529.85 ms │  1.06x slower │
│ QQuery 20    │  637.91 ms │  661.72 ms │     no change │
│ QQuery 21    │ 1882.43 ms │ 1917.10 ms │     no change │
│ QQuery 22    │  130.68 ms │  141.76 ms │  1.08x slower │
└──────────────┴────────────┴────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary         ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (base)         │ 18618.10ms │
│ Total Time (perfect_hj)   │ 18511.93ms │
│ Average Time (base)       │   846.28ms │
│ Average Time (perfect_hj) │   841.45ms │
│ Queries Faster            │          4 │
│ Queries Slower            │          2 │
│ Queries with No Change    │         16 │
│ Queries with Failure      │          0 │
└───────────────────────────┴────────────┘

What changes are included in this PR?

During the collect_left_input (build) phase, we now conditionally use an ArrayMap instead of a standard JoinHashMapType. This optimization is triggered only when the following conditions are met:
- There is exactly one join key.
- The join key can be any integer type convertible to u64 (excluding i128 and u128).
- The key distribution is sufficiently dense or the key range (max - min) is small enough to justify an array-based allocation.
- build_side.num_rows() < u32::MAX
The ArrayMap works by storing the minimum key as an offset and using a Vec to directly map a key k to its build-side index via data[k- offset].
Rewrite Hash Join micro-benchmarks in benchmarks/src/hj.rs to evaluate ArrayMap and
HashMap performance across varying key densities and probe hit rates

Are these changes tested?

Yes

Are there any user-facing changes?

Yes, this PR introduces two new session configuration parameters to control the behavior of the Perfect Hash Join optimization:

perfect_hash_join_small_build_threshold: This parameter defines the maximum key range (max_key - min_key) for the build side to be considered "small." If the key range is below this threshold, the array-based join will be triggered regardless of key density.
perfect_hash_join_min_key_density: This parameter sets the minimum density (row_count / key_range) required to enable the perfect hash join optimization for larger key ranges

Dandandan · 2025-12-19T18:40:41Z

datafusion/physical-plan/src/joins/array_map.rs

+    ) -> Result<Self> {
+        // Initialize with 0 (sentinel for not found)
+        let mut data: Vec<u32> = vec![0; range];
+        let mut next: Option<Vec<u32>> = None;


Suggested change

let mut next: Option<Vec<u32>> = None;

let mut next: Vec<u32> = vec![];

I think this should work as well

Done, it's cleaner this way

Dandandan · 2025-12-19T18:54:58Z

run benchmarks

alamb-ghbot · 2025-12-19T18:55:06Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing perfect_hj (a442460) to 8550010 diff using: tpch_mem clickbench_partitioned clickbench_extended
Results will be posted here when complete

Dandandan · 2025-12-19T18:55:07Z

run benchmark tpcds

Dandandan · 2025-12-19T19:00:15Z

datafusion/common/src/config.rs

+        /// 
+        /// TODO: Currently only supports cases where left_side.num_rows() < u32::MAX.
+        /// Support for left_side.num_rows() >= u32::MAX will be added in the future.
+        pub perfect_hash_join_min_key_density: f64, default = 0.99


This seems very high? For a hashmap I believe it's ~75% default (plus it has some more overhead per key), so I think a 75% probably could still be better overall?

That's a great point.
I'll add some benchmarks to compare the performance at different densities, including 75%, to find the optimal value for our use case. I'll update this based on the results.
Thanks for the suggestion!

Sounds good!

I think we can set it to 20%.

In terms of memory usage (run with this code: ), ArrayMap consumes less memory than JoinHashMap at a 20% density, even with duplicate keys.

Based on the hj.rs benchmark(results in the PR description), ArrayMap is also faster than JoinHashMap at 20% density, regardless of whether there are duplicate keys.

Memory Comparison Matrix (num_rows = 1000000) | Density | Dup Rate | ArrayMap (MB) | JoinHashMap (MB) | Ratio (AM/JHM) | |---------|----------|---------------|------------------|----------------| | 100% | 0% | 3.81 | 37.81 | 0.10x | | 100% | 25% | 7.63 | 37.81 | 0.20x | | 100% | 50% | 7.63 | 37.81 | 0.20x | | 100% | 75% | 7.63 | 37.81 | 0.20x | | 75% | 0% | 5.09 | 37.81 | 0.13x | | 75% | 25% | 8.90 | 37.81 | 0.24x | | 75% | 50% | 8.90 | 37.81 | 0.24x | | 75% | 75% | 8.90 | 37.81 | 0.24x | | 50% | 0% | 7.63 | 37.81 | 0.20x | | 50% | 25% | 11.44 | 37.81 | 0.30x | | 50% | 50% | 11.44 | 37.81 | 0.30x | | 50% | 75% | 11.44 | 37.81 | 0.30x | | 20% | 0% | 19.07 | 37.81 | 0.50x | | 20% | 25% | 22.89 | 37.81 | 0.61x | | 20% | 50% | 22.89 | 37.81 | 0.61x | | 20% | 75% | 22.89 | 37.81 | 0.61x | | 10% | 0% | 38.15 | 37.81 | 1.01x | | 10% | 25% | 41.96 | 37.81 | 1.11x | | 10% | 50% | 41.96 | 37.81 | 1.11x | | 10% | 75% | 41.96 | 37.81 | 1.11x |

What about ~15%?

Agreed. Setting it to ~15% works well—it not only improves performance but also reduces the memory footprint (space usage). I've updated the threshold.

Memory Comparison Matrix (num_rows = 1000000) | Density | Dup Rate | ArrayMap (MB) | JoinHashMap (MB) | Ratio (AM/JHM) | |---------|----------|---------------|------------------|----------------| ... | 15% | 0% | 25.43 | 37.81 | 0.67x | | 15% | 25% | 29.25 | 37.81 | 0.77x | | 15% | 50% | 29.25 | 37.81 | 0.77x | | 15% | 75% | 29.25 | 37.81 | 0.77x |

alamb-ghbot · 2025-12-19T19:34:36Z

🤖: Benchmark completed

Details

Comparing HEAD and perfect_hj
--------------------
Benchmark clickbench_extended.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Query        ┃        HEAD ┃  perfect_hj ┃    Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ QQuery 0     │  2728.43 ms │  2746.77 ms │ no change │
│ QQuery 1     │  1259.76 ms │  1286.32 ms │ no change │
│ QQuery 2     │  2442.97 ms │  2404.14 ms │ no change │
│ QQuery 3     │  1146.01 ms │  1175.40 ms │ no change │
│ QQuery 4     │  2324.58 ms │  2297.95 ms │ no change │
│ QQuery 5     │ 28847.71 ms │ 28651.86 ms │ no change │
│ QQuery 6     │  3925.44 ms │  4071.92 ms │ no change │
│ QQuery 7     │  3480.92 ms │  3513.25 ms │ no change │
└──────────────┴─────────────┴─────────────┴───────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary         ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 46155.84ms │
│ Total Time (perfect_hj)   │ 46147.61ms │
│ Average Time (HEAD)       │  5769.48ms │
│ Average Time (perfect_hj) │  5768.45ms │
│ Queries Faster            │          0 │
│ Queries Slower            │          0 │
│ Queries with No Change    │          8 │
│ Queries with Failure      │          0 │
└───────────────────────────┴────────────┘
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃        HEAD ┃  perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0     │     2.55 ms │     2.28 ms │ +1.12x faster │
│ QQuery 1     │    49.94 ms │    49.72 ms │     no change │
│ QQuery 2     │   136.75 ms │   135.77 ms │     no change │
│ QQuery 3     │   157.55 ms │   156.38 ms │     no change │
│ QQuery 4     │  1127.59 ms │  1099.95 ms │     no change │
│ QQuery 5     │  1513.28 ms │  1540.48 ms │     no change │
│ QQuery 6     │     2.09 ms │     2.18 ms │     no change │
│ QQuery 7     │    54.92 ms │    54.54 ms │     no change │
│ QQuery 8     │  1435.36 ms │  1429.02 ms │     no change │
│ QQuery 9     │  1826.50 ms │  1818.96 ms │     no change │
│ QQuery 10    │   362.05 ms │   359.18 ms │     no change │
│ QQuery 11    │   413.64 ms │   415.45 ms │     no change │
│ QQuery 12    │  1346.02 ms │  1355.55 ms │     no change │
│ QQuery 13    │  2003.64 ms │  2030.93 ms │     no change │
│ QQuery 14    │  1258.62 ms │  1264.24 ms │     no change │
│ QQuery 15    │  1236.44 ms │  1220.63 ms │     no change │
│ QQuery 16    │  2660.06 ms │  2679.40 ms │     no change │
│ QQuery 17    │  2646.48 ms │  2669.57 ms │     no change │
│ QQuery 18    │  5032.89 ms │  5243.53 ms │     no change │
│ QQuery 19    │   119.08 ms │   120.08 ms │     no change │
│ QQuery 20    │  1909.83 ms │  1955.97 ms │     no change │
│ QQuery 21    │  2219.56 ms │  2228.45 ms │     no change │
│ QQuery 22    │  3754.10 ms │  3742.17 ms │     no change │
│ QQuery 23    │ 12284.83 ms │ 12304.70 ms │     no change │
│ QQuery 24    │   200.89 ms │   216.80 ms │  1.08x slower │
│ QQuery 25    │   459.79 ms │   477.52 ms │     no change │
│ QQuery 26    │   217.46 ms │   226.68 ms │     no change │
│ QQuery 27    │  2733.98 ms │  2714.81 ms │     no change │
│ QQuery 28    │ 24706.02 ms │ 23446.26 ms │ +1.05x faster │
│ QQuery 29    │   983.86 ms │   951.63 ms │     no change │
│ QQuery 30    │  1325.66 ms │  1334.96 ms │     no change │
│ QQuery 31    │  1361.59 ms │  1319.42 ms │     no change │
│ QQuery 32    │  4654.91 ms │  4894.57 ms │  1.05x slower │
│ QQuery 33    │  5895.01 ms │  5580.74 ms │ +1.06x faster │
│ QQuery 34    │  5921.45 ms │  5968.72 ms │     no change │
│ QQuery 35    │  1944.76 ms │  1894.39 ms │     no change │
│ QQuery 36    │    66.49 ms │    65.88 ms │     no change │
│ QQuery 37    │    45.71 ms │    47.05 ms │     no change │
│ QQuery 38    │    66.55 ms │    67.85 ms │     no change │
│ QQuery 39    │   105.16 ms │   104.80 ms │     no change │
│ QQuery 40    │    28.03 ms │    27.95 ms │     no change │
│ QQuery 41    │    23.88 ms │    24.08 ms │     no change │
│ QQuery 42    │    19.99 ms │    20.73 ms │     no change │
└──────────────┴─────────────┴─────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary         ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 94314.96ms │
│ Total Time (perfect_hj)   │ 93263.97ms │
│ Average Time (HEAD)       │  2193.37ms │
│ Average Time (perfect_hj) │  2168.93ms │
│ Queries Faster            │          3 │
│ Queries Slower            │          2 │
│ Queries with No Change    │         38 │
│ Queries with Failure      │          0 │
└───────────────────────────┴────────────┘
--------------------
Benchmark tpch_mem_sf1.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃      HEAD ┃ perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1     │ 139.34 ms │  130.00 ms │ +1.07x faster │
│ QQuery 2     │  27.17 ms │   23.45 ms │ +1.16x faster │
│ QQuery 3     │  40.09 ms │   38.79 ms │     no change │
│ QQuery 4     │  29.24 ms │   28.73 ms │     no change │
│ QQuery 5     │  89.41 ms │   88.45 ms │     no change │
│ QQuery 6     │  20.11 ms │   19.98 ms │     no change │
│ QQuery 7     │ 233.64 ms │  193.84 ms │ +1.21x faster │
│ QQuery 8     │  40.29 ms │   38.67 ms │     no change │
│ QQuery 9     │ 108.11 ms │  102.06 ms │ +1.06x faster │
│ QQuery 10    │  64.69 ms │   64.74 ms │     no change │
│ QQuery 11    │  18.64 ms │   11.46 ms │ +1.63x faster │
│ QQuery 12    │  51.76 ms │   49.96 ms │     no change │
│ QQuery 13    │  47.87 ms │   48.78 ms │     no change │
│ QQuery 14    │  14.03 ms │   14.30 ms │     no change │
│ QQuery 15    │  25.20 ms │   25.20 ms │     no change │
│ QQuery 16    │  25.60 ms │   24.53 ms │     no change │
│ QQuery 17    │ 156.37 ms │  151.59 ms │     no change │
│ QQuery 18    │ 283.58 ms │  283.26 ms │     no change │
│ QQuery 19    │  35.96 ms │   38.98 ms │  1.08x slower │
│ QQuery 20    │  48.62 ms │   50.36 ms │     no change │
│ QQuery 21    │ 312.85 ms │  309.67 ms │     no change │
│ QQuery 22    │  17.67 ms │   18.01 ms │     no change │
└──────────────┴───────────┴────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary         ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 1830.21ms │
│ Total Time (perfect_hj)   │ 1754.80ms │
│ Average Time (HEAD)       │   83.19ms │
│ Average Time (perfect_hj) │   79.76ms │
│ Queries Faster            │         5 │
│ Queries Slower            │         1 │
│ Queries with No Change    │        16 │
│ Queries with Failure      │         0 │
└───────────────────────────┴───────────┘

alamb-ghbot · 2025-12-19T19:34:41Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing perfect_hj (a442460) to 8550010 diff using: tpcds
Results will be posted here when complete

alamb-ghbot · 2025-12-19T19:43:35Z

🤖: Benchmark completed

Details

Comparing HEAD and perfect_hj
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃        HEAD ┃  perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1     │    64.32 ms │    64.03 ms │     no change │
│ QQuery 2     │   202.76 ms │   209.93 ms │     no change │
│ QQuery 3     │   160.17 ms │   160.30 ms │     no change │
│ QQuery 4     │  2076.33 ms │  2016.14 ms │     no change │
│ QQuery 5     │   271.72 ms │   267.86 ms │     no change │
│ QQuery 6     │  1539.80 ms │  1528.76 ms │     no change │
│ QQuery 7     │   501.34 ms │   500.09 ms │     no change │
│ QQuery 8     │   172.26 ms │   172.88 ms │     no change │
│ QQuery 9     │   283.58 ms │   274.82 ms │     no change │
│ QQuery 10    │   169.65 ms │   172.34 ms │     no change │
│ QQuery 11    │  1414.43 ms │  1436.15 ms │     no change │
│ QQuery 12    │    74.91 ms │    73.81 ms │     no change │
│ QQuery 13    │   554.37 ms │   557.49 ms │     no change │
│ QQuery 14    │  2016.91 ms │  1803.46 ms │ +1.12x faster │
│ QQuery 15    │    28.50 ms │    27.30 ms │     no change │
│ QQuery 16    │    56.60 ms │    57.60 ms │     no change │
│ QQuery 17    │   354.64 ms │   357.88 ms │     no change │
│ QQuery 18    │   191.69 ms │   190.50 ms │     no change │
│ QQuery 19    │   228.28 ms │   229.67 ms │     no change │
│ QQuery 20    │    23.36 ms │    23.50 ms │     no change │
│ QQuery 21    │    33.71 ms │    32.15 ms │     no change │
│ QQuery 22    │   985.29 ms │   906.05 ms │ +1.09x faster │
│ QQuery 23    │  1845.20 ms │  1713.30 ms │ +1.08x faster │
│ QQuery 24    │   642.89 ms │   636.64 ms │     no change │
│ QQuery 25    │   523.12 ms │   516.07 ms │     no change │
│ QQuery 26    │   123.94 ms │   126.15 ms │     no change │
│ QQuery 27    │   504.93 ms │   500.48 ms │     no change │
│ QQuery 28    │   303.81 ms │   302.04 ms │     no change │
│ QQuery 29    │   444.98 ms │   447.70 ms │     no change │
│ QQuery 30    │    65.01 ms │    63.94 ms │     no change │
│ QQuery 31    │   306.15 ms │   306.66 ms │     no change │
│ QQuery 32    │    77.86 ms │    78.00 ms │     no change │
│ QQuery 33    │   193.25 ms │   194.57 ms │     no change │
│ QQuery 34    │   159.04 ms │   162.10 ms │     no change │
│ QQuery 35    │   169.34 ms │   171.45 ms │     no change │
│ QQuery 36    │   296.09 ms │   286.26 ms │     no change │
│ QQuery 37    │   257.06 ms │   254.49 ms │     no change │
│ QQuery 38    │   155.26 ms │   151.49 ms │     no change │
│ QQuery 39    │   216.46 ms │   194.61 ms │ +1.11x faster │
│ QQuery 40    │   197.40 ms │   172.47 ms │ +1.14x faster │
│ QQuery 41    │    16.30 ms │    16.66 ms │     no change │
│ QQuery 42    │   142.70 ms │   145.52 ms │     no change │
│ QQuery 43    │   126.26 ms │   129.13 ms │     no change │
│ QQuery 44    │    15.68 ms │    14.96 ms │     no change │
│ QQuery 45    │    83.14 ms │    80.86 ms │     no change │
│ QQuery 46    │   329.53 ms │   329.04 ms │     no change │
│ QQuery 47    │  1279.95 ms │  1186.17 ms │ +1.08x faster │
│ QQuery 48    │   416.35 ms │   420.97 ms │     no change │
│ QQuery 49    │   355.14 ms │   354.37 ms │     no change │
│ QQuery 50    │   338.48 ms │   343.17 ms │     no change │
│ QQuery 51    │   298.83 ms │   294.59 ms │     no change │
│ QQuery 52    │   147.14 ms │   143.27 ms │     no change │
│ QQuery 53    │   149.53 ms │   146.40 ms │     no change │
│ QQuery 54    │   207.43 ms │   216.19 ms │     no change │
│ QQuery 55    │   144.21 ms │   144.60 ms │     no change │
│ QQuery 56    │   193.16 ms │   190.45 ms │     no change │
│ QQuery 57    │   319.98 ms │   302.28 ms │ +1.06x faster │
│ QQuery 58    │   515.69 ms │   482.75 ms │ +1.07x faster │
│ QQuery 59    │   285.93 ms │   288.57 ms │     no change │
│ QQuery 60    │   199.98 ms │   199.54 ms │     no change │
│ QQuery 61    │   240.08 ms │   235.47 ms │     no change │
│ QQuery 62    │  1378.44 ms │  1398.12 ms │     no change │
│ QQuery 63    │   149.43 ms │   148.88 ms │     no change │
│ QQuery 64    │  1142.94 ms │  1161.26 ms │     no change │
│ QQuery 65    │   368.38 ms │   363.41 ms │     no change │
│ QQuery 66    │   389.40 ms │   393.14 ms │     no change │
│ QQuery 67    │   646.76 ms │   627.61 ms │     no change │
│ QQuery 68    │   383.87 ms │   384.78 ms │     no change │
│ QQuery 69    │   167.41 ms │   170.00 ms │     no change │
│ QQuery 70    │   526.93 ms │   528.35 ms │     no change │
│ QQuery 71    │   183.79 ms │   184.70 ms │     no change │
│ QQuery 72    │  2474.00 ms │  2083.02 ms │ +1.19x faster │
│ QQuery 73    │   156.03 ms │   157.55 ms │     no change │
│ QQuery 74    │   893.90 ms │   899.19 ms │     no change │
│ QQuery 75    │   394.28 ms │   389.15 ms │     no change │
│ QQuery 76    │   184.12 ms │   182.03 ms │     no change │
│ QQuery 77    │   265.99 ms │   262.24 ms │     no change │
│ QQuery 78    │   939.94 ms │   953.61 ms │     no change │
│ QQuery 79    │   341.30 ms │   340.35 ms │     no change │
│ QQuery 80    │   496.20 ms │   493.37 ms │     no change │
│ QQuery 81    │    43.41 ms │    40.77 ms │ +1.06x faster │
│ QQuery 82    │   296.71 ms │   283.45 ms │     no change │
│ QQuery 83    │    67.95 ms │    62.11 ms │ +1.09x faster │
│ QQuery 84    │    64.72 ms │    65.66 ms │     no change │
│ QQuery 85    │   214.94 ms │   226.66 ms │  1.05x slower │
│ QQuery 86    │    60.18 ms │    57.66 ms │     no change │
│ QQuery 87    │   151.20 ms │   157.81 ms │     no change │
│ QQuery 88    │   243.17 ms │   250.08 ms │     no change │
│ QQuery 89    │   174.02 ms │   172.62 ms │     no change │
│ QQuery 90    │    36.80 ms │    37.36 ms │     no change │
│ QQuery 91    │    95.95 ms │    92.97 ms │     no change │
│ QQuery 92    │    78.25 ms │    77.30 ms │     no change │
│ QQuery 93    │   270.50 ms │   263.82 ms │     no change │
│ QQuery 94    │    85.61 ms │    84.43 ms │     no change │
│ QQuery 95    │   260.54 ms │   228.76 ms │ +1.14x faster │
│ QQuery 96    │   111.78 ms │   110.86 ms │     no change │
│ QQuery 97    │   186.45 ms │   187.03 ms │     no change │
│ QQuery 98    │   235.54 ms │   237.35 ms │     no change │
│ QQuery 99    │ 14895.30 ms │ 15020.29 ms │     no change │
└──────────────┴─────────────┴─────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary         ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 52748.01ms │
│ Total Time (perfect_hj)   │ 51783.83ms │
│ Average Time (HEAD)       │   532.81ms │
│ Average Time (perfect_hj) │   523.07ms │
│ Queries Faster            │         12 │
│ Queries Slower            │          1 │
│ Queries with No Change    │         86 │
│ Queries with Failure      │          0 │
└───────────────────────────┴────────────┘

UBarney · 2025-12-21T14:22:54Z

benchmarks/compare.py

    comparison = BenchmarkRun.load_from_file(comparison_path)

-    console = Console()
+    console = Console(width=200)


I've increased the console width to 200. I added more information like 'density' to the queryName, which made it longer and caused it to be cut off in the output before

UBarney · 2025-12-21T14:22:56Z

benchmarks/compare.py

-    baseline_header = baseline_path.parent.stem
-    comparison_header = comparison_path.parent.stem
+    baseline_header = baseline_path.parent.name
+    comparison_header = comparison_path.parent.name


Before, a path like .../density=0.1/... was incorrectly shortened to density=0. Now, by using .parent.name, we correctly get the full directory name, density=0.1

Dandandan · 2026-01-05T20:23:38Z

Can you solve the conflicts?

UBarney · 2026-01-06T09:49:40Z

Can you solve the conflicts?

I've resolved the conflicts. Once #19602 is merged, I will update ArrayMap::mark_existing_probes to follow logic similar to JoinHashMapType::contain_hashes.

Dandandan · 2026-01-06T10:13:10Z

Can you solve the conflicts?

I've resolved the conflicts. Once #19602 is merged, I will update ArrayMap::mark_existing_probes to follow logic similar to JoinHashMapType::contain_hashes.

Done

UBarney · 2026-01-07T09:50:06Z

Can you solve the conflicts?

I've resolved the conflicts. Once #19602 is merged, I will update ArrayMap::mark_existing_probes to follow logic similar to JoinHashMapType::contain_hashes.

Done

I have added contain_keys to ArrayMap and updated the logic accordingly.

Dandandan · 2026-01-07T10:22:41Z

run benchmarks

alamb-ghbot · 2026-01-07T10:22:47Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing perfect_hj (e9b35e5) to 1037f0a diff using: tpch_mem clickbench_partitioned clickbench_extended
Results will be posted here when complete

Dandandan · 2026-01-07T10:22:50Z

run benchmark tpch

Dandandan · 2026-01-07T10:22:59Z

run benchmark tpcds

alamb-ghbot · 2026-01-07T11:03:05Z

🤖: Benchmark completed

Details

Comparing HEAD and perfect_hj
--------------------
Benchmark clickbench_extended.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Query        ┃        HEAD ┃  perfect_hj ┃    Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ QQuery 0     │  2361.73 ms │  2300.54 ms │ no change │
│ QQuery 1     │   934.19 ms │   912.49 ms │ no change │
│ QQuery 2     │  1885.10 ms │  1860.90 ms │ no change │
│ QQuery 3     │  1202.09 ms │  1180.71 ms │ no change │
│ QQuery 4     │  2306.91 ms │  2298.40 ms │ no change │
│ QQuery 5     │ 28041.32 ms │ 28094.82 ms │ no change │
│ QQuery 6     │  3849.98 ms │  3872.67 ms │ no change │
│ QQuery 7     │  3794.60 ms │  3815.13 ms │ no change │
└──────────────┴─────────────┴─────────────┴───────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary         ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 44375.93ms │
│ Total Time (perfect_hj)   │ 44335.67ms │
│ Average Time (HEAD)       │  5546.99ms │
│ Average Time (perfect_hj) │  5541.96ms │
│ Queries Faster            │          0 │
│ Queries Slower            │          0 │
│ Queries with No Change    │          8 │
│ Queries with Failure      │          0 │
└───────────────────────────┴────────────┘
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃        HEAD ┃  perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0     │     1.44 ms │     1.46 ms │     no change │
│ QQuery 1     │    48.94 ms │    49.13 ms │     no change │
│ QQuery 2     │   131.83 ms │   132.88 ms │     no change │
│ QQuery 3     │   154.94 ms │   151.89 ms │     no change │
│ QQuery 4     │  1129.27 ms │  1094.93 ms │     no change │
│ QQuery 5     │  1364.06 ms │  1368.14 ms │     no change │
│ QQuery 6     │     1.41 ms │     1.42 ms │     no change │
│ QQuery 7     │    52.53 ms │    53.49 ms │     no change │
│ QQuery 8     │  1469.70 ms │  1415.32 ms │     no change │
│ QQuery 9     │  1804.95 ms │  1800.38 ms │     no change │
│ QQuery 10    │   344.22 ms │   345.91 ms │     no change │
│ QQuery 11    │   387.44 ms │   396.02 ms │     no change │
│ QQuery 12    │  1237.81 ms │  1286.77 ms │     no change │
│ QQuery 13    │  1912.18 ms │  1972.25 ms │     no change │
│ QQuery 14    │  1233.69 ms │  1247.50 ms │     no change │
│ QQuery 15    │  1254.11 ms │  1243.60 ms │     no change │
│ QQuery 16    │  2567.98 ms │  2570.61 ms │     no change │
│ QQuery 17    │  2550.81 ms │  2580.33 ms │     no change │
│ QQuery 18    │  5612.71 ms │  4913.15 ms │ +1.14x faster │
│ QQuery 19    │   120.74 ms │   119.55 ms │     no change │
│ QQuery 20    │  1913.07 ms │  1904.79 ms │     no change │
│ QQuery 21    │  2146.63 ms │  2192.36 ms │     no change │
│ QQuery 22    │  3703.55 ms │  3729.43 ms │     no change │
│ QQuery 23    │ 22049.38 ms │ 12242.00 ms │ +1.80x faster │
│ QQuery 24    │   207.87 ms │   203.23 ms │     no change │
│ QQuery 25    │   456.99 ms │   469.41 ms │     no change │
│ QQuery 26    │   214.05 ms │   204.73 ms │     no change │
│ QQuery 27    │  2767.10 ms │  2680.87 ms │     no change │
│ QQuery 28    │ 22195.62 ms │ 23141.30 ms │     no change │
│ QQuery 29    │   991.32 ms │   939.56 ms │ +1.06x faster │
│ QQuery 30    │  1349.48 ms │  1319.89 ms │     no change │
│ QQuery 31    │  1368.06 ms │  1339.89 ms │     no change │
│ QQuery 32    │  5091.91 ms │  5138.39 ms │     no change │
│ QQuery 33    │  5512.57 ms │  5484.79 ms │     no change │
│ QQuery 34    │  5638.93 ms │  6020.19 ms │  1.07x slower │
│ QQuery 35    │  1992.03 ms │  1922.04 ms │     no change │
│ QQuery 36    │    63.99 ms │    64.41 ms │     no change │
│ QQuery 37    │    43.36 ms │    45.51 ms │     no change │
│ QQuery 38    │    66.10 ms │    64.66 ms │     no change │
│ QQuery 39    │   100.63 ms │   101.04 ms │     no change │
│ QQuery 40    │    28.11 ms │    26.02 ms │ +1.08x faster │
│ QQuery 41    │    21.33 ms │    22.12 ms │     no change │
│ QQuery 42    │    18.14 ms │    19.02 ms │     no change │
└──────────────┴─────────────┴─────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┓
┃ Benchmark Summary         ┃             ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 101320.99ms │
│ Total Time (perfect_hj)   │  92020.38ms │
│ Average Time (HEAD)       │   2356.30ms │
│ Average Time (perfect_hj) │   2140.01ms │
│ Queries Faster            │           4 │
│ Queries Slower            │           1 │
│ Queries with No Change    │          38 │
│ Queries with Failure      │           0 │
└───────────────────────────┴─────────────┘
--------------------
Benchmark tpch_mem_sf1.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃      HEAD ┃ perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1     │ 120.09 ms │  112.47 ms │ +1.07x faster │
│ QQuery 2     │  29.95 ms │   25.31 ms │ +1.18x faster │
│ QQuery 3     │  36.86 ms │   36.60 ms │     no change │
│ QQuery 4     │  28.68 ms │   29.61 ms │     no change │
│ QQuery 5     │  85.84 ms │   86.63 ms │     no change │
│ QQuery 6     │  19.61 ms │   19.47 ms │     no change │
│ QQuery 7     │ 243.13 ms │  145.00 ms │ +1.68x faster │
│ QQuery 8     │  37.03 ms │   31.49 ms │ +1.18x faster │
│ QQuery 9     │ 117.04 ms │   95.96 ms │ +1.22x faster │
│ QQuery 10    │  75.18 ms │   63.06 ms │ +1.19x faster │
│ QQuery 11    │  19.93 ms │   12.30 ms │ +1.62x faster │
│ QQuery 12    │  49.38 ms │   49.85 ms │     no change │
│ QQuery 13    │  46.20 ms │   45.37 ms │     no change │
│ QQuery 14    │  13.04 ms │   13.40 ms │     no change │
│ QQuery 15    │  23.47 ms │   23.69 ms │     no change │
│ QQuery 16    │  24.37 ms │   23.86 ms │     no change │
│ QQuery 17    │ 148.67 ms │  147.84 ms │     no change │
│ QQuery 18    │ 284.66 ms │  272.35 ms │     no change │
│ QQuery 19    │  35.58 ms │   37.97 ms │  1.07x slower │
│ QQuery 20    │  48.85 ms │   51.08 ms │     no change │
│ QQuery 21    │ 295.14 ms │  184.30 ms │ +1.60x faster │
│ QQuery 22    │  17.43 ms │   17.36 ms │     no change │
└──────────────┴───────────┴────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary         ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 1800.12ms │
│ Total Time (perfect_hj)   │ 1524.97ms │
│ Average Time (HEAD)       │   81.82ms │
│ Average Time (perfect_hj) │   69.32ms │
│ Queries Faster            │         8 │
│ Queries Slower            │         1 │
│ Queries with No Change    │        13 │
│ Queries with Failure      │         0 │
└───────────────────────────┴───────────┘

alamb-ghbot · 2026-01-07T11:03:09Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing perfect_hj (e9b35e5) to 1037f0a diff using: tpch
Results will be posted here when complete

alamb-ghbot · 2026-01-07T11:03:46Z

🤖: Benchmark completed

Details

Comparing HEAD and perfect_hj
--------------------
Benchmark tpch_sf1.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃      HEAD ┃ perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1     │ 191.42 ms │  194.17 ms │     no change │
│ QQuery 2     │  92.90 ms │   79.24 ms │ +1.17x faster │
│ QQuery 3     │ 122.93 ms │  118.45 ms │     no change │
│ QQuery 4     │  76.58 ms │   76.39 ms │     no change │
│ QQuery 5     │ 169.79 ms │  167.60 ms │     no change │
│ QQuery 6     │  65.74 ms │   62.99 ms │     no change │
│ QQuery 7     │ 213.34 ms │  204.62 ms │     no change │
│ QQuery 8     │ 159.39 ms │  158.60 ms │     no change │
│ QQuery 9     │ 224.42 ms │  220.08 ms │     no change │
│ QQuery 10    │ 184.10 ms │  187.50 ms │     no change │
│ QQuery 11    │  73.24 ms │   57.11 ms │ +1.28x faster │
│ QQuery 12    │ 116.93 ms │  116.71 ms │     no change │
│ QQuery 13    │ 218.62 ms │  214.30 ms │     no change │
│ QQuery 14    │  94.08 ms │   89.32 ms │ +1.05x faster │
│ QQuery 15    │ 115.42 ms │  117.40 ms │     no change │
│ QQuery 16    │  55.32 ms │   56.42 ms │     no change │
│ QQuery 17    │ 269.67 ms │  269.94 ms │     no change │
│ QQuery 18    │ 318.44 ms │  316.34 ms │     no change │
│ QQuery 19    │ 131.59 ms │  133.16 ms │     no change │
│ QQuery 20    │ 121.70 ms │  124.62 ms │     no change │
│ QQuery 21    │ 259.46 ms │  252.28 ms │     no change │
│ QQuery 22    │  41.13 ms │   36.72 ms │ +1.12x faster │
└──────────────┴───────────┴────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary         ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 3316.23ms │
│ Total Time (perfect_hj)   │ 3253.95ms │
│ Average Time (HEAD)       │  150.74ms │
│ Average Time (perfect_hj) │  147.91ms │
│ Queries Faster            │         4 │
│ Queries Slower            │         0 │
│ Queries with No Change    │        18 │
│ Queries with Failure      │         0 │
└───────────────────────────┴───────────┘

alamb-ghbot · 2026-01-07T11:03:48Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing perfect_hj (e9b35e5) to 1037f0a diff using: tpcds
Results will be posted here when complete

alamb-ghbot · 2026-01-07T11:03:50Z

Benchmark script failed with exit code 1.

Last 10 lines of output:

Click to expand

BRANCH_NAME: HEAD
DATA_DIR: /home/alamb/arrow-datafusion/benchmarks/data
RESULTS_DIR: /home/alamb/arrow-datafusion/benchmarks/results/HEAD
CARGO_COMMAND: cargo run --release
PREFER_HASH_JOIN: true
***************************

Please prepare TPC-DS data first by following instructions:
  ./bench.sh data tpcds

Dandandan · 2026-01-07T11:08:08Z

┏━━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃      HEAD ┃ perfect_hj ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1     │ 120.09 ms │  112.47 ms │ +1.07x faster │
│ QQuery 2     │  29.95 ms │   25.31 ms │ +1.18x faster │
│ QQuery 3     │  36.86 ms │   36.60 ms │     no change │
│ QQuery 4     │  28.68 ms │   29.61 ms │     no change │
│ QQuery 5     │  85.84 ms │   86.63 ms │     no change │
│ QQuery 6     │  19.61 ms │   19.47 ms │     no change │
│ QQuery 7     │ 243.13 ms │  145.00 ms │ +1.68x faster │
│ QQuery 8     │  37.03 ms │   31.49 ms │ +1.18x faster │
│ QQuery 9     │ 117.04 ms │   95.96 ms │ +1.22x faster │
│ QQuery 10    │  75.18 ms │   63.06 ms │ +1.19x faster │
│ QQuery 11    │  19.93 ms │   12.30 ms │ +1.62x faster │
│ QQuery 12    │  49.38 ms │   49.85 ms │     no change │
│ QQuery 13    │  46.20 ms │   45.37 ms │     no change │
│ QQuery 14    │  13.04 ms │   13.40 ms │     no change │
│ QQuery 15    │  23.47 ms │   23.69 ms │     no change │
│ QQuery 16    │  24.37 ms │   23.86 ms │     no change │
│ QQuery 17    │ 148.67 ms │  147.84 ms │     no change │
│ QQuery 18    │ 284.66 ms │  272.35 ms │     no change │
│ QQuery 19    │  35.58 ms │   37.97 ms │  1.07x slower │
│ QQuery 20    │  48.85 ms │   51.08 ms │     no change │
│ QQuery 21    │ 295.14 ms │  184.30 ms │ +1.60x faster │
│ QQuery 22    │  17.43 ms │   17.36 ms │     no change │
└──────────────┴───────────┴────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary         ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)         │ 1800.12ms │
│ Total Time (perfect_hj)   │ 1524.97ms │
│ Average Time (HEAD)       │   81.82ms │
│ Average Time (perfect_hj) │   69.32ms │
│ Queries Faster            │         8 │
│ Queries Slower            │         1 │
│ Queries with No Change    │        13 │
│ Queries with Failure      │         0 │
└───────────────────────────┴───────────┘

Very nice!

Dandandan · 2026-01-07T12:40:06Z

run benchmark hj

alamb-ghbot · 2026-01-07T12:40:08Z

🤖 Hi @Dandandan, thanks for the request (#19411 (comment)).

scrape_comments.py only supports whitelisted benchmarks.

Standard: clickbench_1, clickbench_extended, clickbench_partitioned, clickbench_pushdown, external_aggr, tpcds, tpch, tpch10, tpch_mem, tpch_mem10
Criterion: aggregate_query_sql, aggregate_vectorized, case_when, character_length, in_list, range_and_generate_series, sort, sql_planner, strpos, substr_index, with_hashes

Please choose one or more of these with run benchmark <name> or run benchmark <name1> <name2>...
Unsupported benchmarks: hj.

Copilot

Pull request overview

This PR introduces a "perfect hash join" optimization for DataFusion's hash join implementation. When join keys are dense integers within a limited range, an array-based direct mapping (ArrayMap) is used instead of a general-purpose HashMap, resulting in significant performance improvements.

Key Changes:

Introduces ArrayMap struct for dense integer join key mapping using direct array indexing
Adds two configuration parameters: perfect_hash_join_small_build_threshold (default: 1024) and perfect_hash_join_min_key_density (default: 0.2)
Refactors hash join code to conditionally use ArrayMap or HashMap via a new Map enum

Reviewed changes

Copilot reviewed 17 out of 18 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
datafusion/physical-plan/src/joins/array_map.rs	New file implementing `ArrayMap` for dense integer join keys with support for signed/unsigned integers via wrapping arithmetic
datafusion/physical-plan/src/joins/chain.rs	Extracted chain traversal logic shared between `ArrayMap` and `JoinHashMap`
datafusion/physical-plan/src/joins/mod.rs	Adds `Map` enum to unify `HashMap` and `ArrayMap` interfaces
datafusion/physical-plan/src/joins/hash_join/exec.rs	Implements `try_create_array_map` logic and integrates perfect hash join into build phase
datafusion/physical-plan/src/joins/hash_join/stream.rs	Updates stream processing to handle both `HashMap` and `ArrayMap` variants
datafusion/physical-plan/src/joins/hash_join/shared_bounds.rs	Updates bounds tracking to use `Map` instead of `JoinHashMapType`
datafusion/physical-plan/src/joins/hash_join/partitioned_hash_eval.rs	Updates `HashTableLookupExpr` to support both map types
datafusion/physical-plan/src/joins/join_hash_map.rs	Changes `JoinHashMapOffset` to `MapOffset` and extracts chain traversal
datafusion/physical-plan/src/joins/stream_join_utils.rs	Updates to use new `MapOffset` type
datafusion/common/src/config.rs	Adds two new configuration parameters for perfect hash join tuning
docs/source/user-guide/configs.md	Documents the new configuration parameters
datafusion/sqllogictest/test_files/information_schema.slt	Updates expected configuration output
datafusion/proto/tests/cases/roundtrip_physical_plan.rs	Updates serialization tests to handle new `Map` enum
datafusion/physical-plan/Cargo.toml	Adds `num-traits` dependency for numeric type conversions
benchmarks/src/hj.rs	Rewrites hash join benchmarks to use TPC-H data with varying densities
benchmarks/bench.sh	Updates to generate TPC-H SF10 data for hash join benchmarks
benchmarks/compare.py	Formatting improvements for wider query names

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-07T12:47:39Z

datafusion/physical-plan/src/joins/array_map.rs

+    pub(crate) fn try_new(array: &ArrayRef, min_val: u64, max_val: u64) -> Result<Self> {
+        let range = max_val.wrapping_sub(min_val);
+        let size = (range + 1) as usize;


The cast (range + 1) as usize on line 163 could potentially overflow if range is close to u64::MAX. While the threshold checks in try_create_array_map should prevent extremely large ranges, there's no explicit check here. Consider adding a bounds check or documenting the assumption that range is validated before calling this function.

I've added a check to prevent potential overflow when calculating size.

Copilot · 2026-01-07T12:47:39Z

datafusion/physical-plan/src/joins/array_map.rs

+    pub fn estimate_memory_size(min_val: u64, max_val: u64, num_rows: usize) -> usize {
+        let range = Self::calculate_range(min_val, max_val);
+        let size = (range + 1) as usize;


Similar to try_new, the cast (range + 1) as usize on line 148 could overflow if range is close to u64::MAX. Since this function is used to estimate memory before allocation, an overflow could lead to allocating much less memory than needed or wrapping to a very small value. Consider adding validation or using checked arithmetic.

I've added a check to prevent potential overflow when calculating size.

Copilot · 2026-01-07T12:47:39Z

datafusion/physical-plan/src/joins/hash_join/exec.rs

+            session_config
+                .options_mut()
+                .execution
+                .perfect_hash_join_min_key_density = 1.0 / 0.0;


The division by zero 1.0 / 0.0 is used to create positive infinity for disabling perfect hash join. While this works in Rust (floating-point division by zero produces infinity, not a panic), it's unclear and unconventional. Consider using f64::INFINITY for better readability and clearer intent.

Suggested change

.perfect_hash_join_min_key_density = 1.0 / 0.0;

.perfect_hash_join_min_key_density = f64::INFINITY;

Accepted. Using f64::INFINITY is indeed much clearer.

Copilot · 2026-01-07T12:47:40Z

datafusion/physical-plan/src/joins/hash_join/exec.rs

+/// (also known as a "perfect hash join") instead of a general-purpose hash map.
+/// This optimization is used when:
+/// 1. There is exactly one join key.
+/// 2. The join key can be any integer type convertible to u64 (excluding i128 and u128).


The documentation refers to "excluding i128 and u128" types, but these types are not actually checked or handled anywhere in the code. The is_supported_type function in array_map.rs only supports up to 64-bit integers (Int8-Int64, UInt8-UInt64). Consider removing the mention of i128/u128 from the documentation or clarifying that they're not supported due to design limitations, not just current implementation.

Suggested change

/// 2. The join key can be any integer type convertible to u64 (excluding i128 and u128).

/// 2. The join key is an integer type up to 64 bits wide that can be losslessly converted

/// to `u64` (128-bit integer types such as `i128` and `u128` are not supported).

I've applied the suggested change

github-actions bot added common Related to common crate physical-plan Changes to the physical-plan crate labels Dec 19, 2025

Dandandan reviewed Dec 19, 2025

View reviewed changes

UBarney force-pushed the perfect_hj branch from a442460 to 31efb2d Compare December 21, 2025 14:15

UBarney commented Dec 21, 2025

View reviewed changes

github-actions bot added proto Related to proto crate documentation Improvements or additions to documentation sqllogictest SQL Logic Tests (.slt) labels Dec 23, 2025

UBarney force-pushed the perfect_hj branch 4 times, most recently from 8044bce to 954f1df Compare December 28, 2025 08:02

UBarney marked this pull request as ready for review December 28, 2025 08:02

UBarney force-pushed the perfect_hj branch from 954f1df to 35018b6 Compare December 31, 2025 06:48

UBarney force-pushed the perfect_hj branch 2 times, most recently from 37cd7ad to 2580507 Compare January 6, 2026 09:49

UBarney added 2 commits January 7, 2026 09:25

feat: support perfect hash join

b847020

perfect_hash_join_min_key_density=0.2

e9b35e5

UBarney force-pushed the perfect_hj branch from 2580507 to e9b35e5 Compare January 7, 2026 09:42

fix ci

9ae24a1

Dandandan requested a review from Copilot January 7, 2026 12:41

Copilot started reviewing on behalf of Dandandan January 7, 2026 12:41 View session

Copilot AI reviewed Jan 7, 2026

View reviewed changes

address comment

c772070

	let mut next: Option<Vec<u32>> = None;
	let mut next: Vec<u32> = vec![];

	.perfect_hash_join_min_key_density = 1.0 / 0.0;
	.perfect_hash_join_min_key_density = f64::INFINITY;

	/// 2. The join key can be any integer type convertible to u64 (excluding i128 and u128).
	/// 2. The join key is an integer type up to 64 bits wide that can be losslessly converted
	/// to `u64` (128-bit integer types such as `i128` and `u128` are not supported).

perfect hash join #19411

Are you sure you want to change the base?

perfect hash join #19411

Conversation

UBarney commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dandandan commented Dec 19, 2025

Uh oh!

alamb-ghbot commented Dec 19, 2025

Uh oh!

Dandandan commented Dec 19, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment was marked as outdated.

Uh oh!

UBarney Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dandandan Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb-ghbot commented Dec 19, 2025

Uh oh!

alamb-ghbot commented Dec 19, 2025

Uh oh!

alamb-ghbot commented Dec 19, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dandandan commented Jan 5, 2026

Uh oh!

UBarney commented Jan 6, 2026

Uh oh!

Dandandan commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

UBarney commented Jan 7, 2026

Uh oh!

Dandandan commented Jan 7, 2026

Uh oh!

alamb-ghbot commented Jan 7, 2026

Uh oh!

Dandandan commented Jan 7, 2026

Uh oh!

Dandandan commented Jan 7, 2026

Uh oh!

alamb-ghbot commented Jan 7, 2026

Uh oh!

alamb-ghbot commented Jan 7, 2026

Uh oh!

alamb-ghbot commented Jan 7, 2026

Uh oh!

alamb-ghbot commented Jan 7, 2026

Uh oh!

alamb-ghbot commented Jan 7, 2026

Uh oh!

Dandandan commented Jan 7, 2026

UBarney commented Dec 19, 2025 •

edited

Loading

UBarney Dec 31, 2025 •

edited

Loading

Dandandan Jan 7, 2026 •

edited

Loading

Dandandan commented Jan 6, 2026 •

edited

Loading