-
Notifications
You must be signed in to change notification settings - Fork 245
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] Take a closer look at JsonTuple #10405
Comments
|
#10420 disables json_tuple by default. Ideally we should look at pulling in Spark unit tests for json_tuple to make sure that we can match what they are doing. At a minimum we need to look at adding tests around single quotes, white space normalization, number normalization, mixed types, etc. |
@revans2 I had a look, and none of those tests seem like a blocker to me to enable
Let me know what you think. |
Sure those tests sound find to me. I think we already have tests for get_json_object that cover them, but having ones explicitly for json_tuple would be good. |
This PR adds tests as specified in #10405. Specifically, the cases addressed are: dictionary of ints, \r\n whitespace in strings, and garbage at the end of input lines (e.g. '{"a":100} this is valid according to spark'). Signed-off-by: Warrick He <[email protected]>
Is your feature request related to a problem? Please describe.
We recently disabled
get_json_object
andfrom_json
by default because of errors that can crop up when using them. But we didn't do the same kind of thing forjson_tuple
. Ourjson_tuple
is implemented in terms ofget_json_object
. It is not perfect, but it probably has the same issues with escaping thatget_json_object
also has, which caused it to be disabled by default. We should spend some more time testing it and see if we need to also disable it by default. At least until the issues withget_json_object
are fixed.The text was updated successfully, but these errors were encountered: