Replies: 21 comments 6 replies
-
|
Why do we disallow sort for binary? |
Beta Was this translation helpful? Give feedback.
-
|
In the PR of #3774 we were talking about sorting on binary attrs. The decision was based on the fact that the order may be different for every user, what do you think? |
Beta Was this translation helpful? Give feedback.
-
|
I do have the need to sort binary. For example, I can have number 1, 2, 3, and I want to sort them as binary. We don't need to limit the possibility of use cases. |
Beta Was this translation helpful? Give feedback.
-
|
What do you mean with sorting 1,2,3 as binary? are not they sorted as long or other types? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
Yes, I think that is also unexpected for some users, and it may cause confusion for real users, example; An user has a CSV with a column containing scientific notation numbers mixed with normal floats, if in my python code I sort with pandas default, then pandas will cast automatically to float, and the sort will be right. In texera, now, when reading my CSV, then sorting by that column, the sort will be as strings resulting in a totally different output as in my python code, and since there is no hint in the UI that the column is string the user will have a bad experience. //// Extra question On the JVM (Java/Scala), byte/Byte is signed 8-bit two’s-complement (-128..127). Is not this another source of problems when sorting bytes? I know similar things can apply to strings but managing bytes is more complex which means for users needing bytes sorting we should provide more Params on how to cast and manage the bytes, right? |
Beta Was this translation helpful? Give feedback.
-
|
All the potential issues you described have nothing to do with sorting — they’re more about how to propagate the type correctly before sorting it. The sort operator should be general and capable of processing all data types. How to make the result correct is a separate issue. I don't think we should disable a function because we can't or we don't want to spend time making the result correct. Sort on binary type is supported in pretty much all the database systems, I think we should support it. I don’t have much insight into the technical difficulty since I didn’t review the PR, but as a user, I would like to sort the binary. |
Beta Was this translation helpful? Give feedback.
-
|
You can't say they have nothing to do with sorting if in the end they affect the sorting, right? I think we will not reach an agreement, if @chenlica agrees with you then I will close this issue and add binary sorting for the scala version. @Yicong-Huang also agrees with you. |
Beta Was this translation helpful? Give feedback.
-
|
My understanding is both of the perspectives are right:
|
Beta Was this translation helpful? Give feedback.
-
|
Each perspective has its own merits. To help us decide a design, can we evaluate existing systems and see their design decision? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
How about RapidMiner? |
Beta Was this translation helpful? Give feedback.
-
|
Hello good night, I think RapidMiner does not have binary array data type: https://docs.rapidminer.com/9.8/studio/getting-started/important-terms.html#data-types Also once the tables are loaded they do not restrict the sort based on any data type. |
Beta Was this translation helpful? Give feedback.
-
|
In this case, I suggest we support sorting on binary attributes. @aglinxinyuan @aicam @carloea2 please share your thoughts so that we can finish this issue. |
Beta Was this translation helpful? Give feedback.
-
|
After reviewing other platforms, I agree we should enable sorting on binary attributes, KNIME supports it, and it should be a small change on our side. |
Beta Was this translation helpful? Give feedback.
-
I agree |
Beta Was this translation helpful? Give feedback.
-
|
I converted this issue to a discussion. Seems we all agreed to add support for sorting binary. that's good. @carloea2 please create another task issue to track it. |
Beta Was this translation helpful? Give feedback.
-
|
@carloea2 Are we done with this discussion? If so, we can close it. |
Beta Was this translation helpful? Give feedback.
-
|
Make sure to mention related issue(s) and PR(s). |
Beta Was this translation helpful? Give feedback.




Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Task Summary
Add a frontend message explaining sorting on binary attributes is not supported and mention brief options such as extracting sortable features from them beforehand.
Priority
P1 – Low
Task Type
Beta Was this translation helpful? Give feedback.
All reactions