-
Notifications
You must be signed in to change notification settings - Fork 260
WIP: Add support for remote Parquet HDFS writer with openDAL #2929
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #2929 +/- ##
============================================
+ Coverage 56.12% 59.59% +3.46%
- Complexity 976 1377 +401
============================================
Files 119 167 +48
Lines 11743 15494 +3751
Branches 2251 2570 +319
============================================
+ Hits 6591 9233 +2642
- Misses 4012 4961 +949
- Partials 1140 1300 +160 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
e2dfaa1 to
cc21ac4
Compare
|
Filed #2971 for double writes |
cc21ac4 to
27c4d54
Compare
| fn parse_hdfs_url(url: &Url) -> Result<(Box<dyn ObjectStore>, Path), object_store::Error> { | ||
| // Creates an HDFS object store from a URL using the native HDFS implementation | ||
| #[cfg(all(feature = "hdfs", not(feature = "hdfs-opendal")))] | ||
| fn create_hdfs_object_store( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
renamed from parse_hdfs_url to reflect more sense
| if (!cmd.outputPath.toString.startsWith("file:")) { | ||
| return Unsupported(Some("Only local filesystem output paths are supported")) | ||
| if (!cmd.outputPath.toString.startsWith("file:") && !cmd.outputPath.toString | ||
| .startsWith("hdfs:")) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hdfs-opendal can support more schemes, see: #2272
| sync::Arc, | ||
| }; | ||
|
|
||
| use opendal::{services::Hdfs, Operator}; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is hdfs-opendal always enabled?

Which issue does this PR close?
Running experiments to use openDAL with HDFS writes on local and remote clusters
Closes #2890 .
Rationale for this change
What changes are included in this PR?
How are these changes tested?