Skip to content

Commit

Permalink
[#142] Sort the columns in exploded_df_a and exploded_df_b when selec…
Browse files Browse the repository at this point in the history
…ting them out

Previously we were selecting with a set, so the columns got all mixed up. Let's
sort them so that they are easier to work with. The order of the columns should
not affect the results.
  • Loading branch information
riley-harper committed Aug 13, 2024
1 parent e57ac93 commit 71b9db5
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion hlink/linking/matching/link_step_explode.py
Original file line number Diff line number Diff line change
Expand Up @@ -153,7 +153,7 @@ def _explode(
# be a breaking change to remove this. We'd have to look into the
# ramifications.
if len(all_exploding_columns) > 0:
exploded_df = exploded_df.select(*all_column_names)
exploded_df = exploded_df.select(sorted(all_column_names))

return exploded_df

Expand Down

0 comments on commit 71b9db5

Please sign in to comment.