You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Columns of type id should be dropped when computing metrics. ID columns are not synthetically generated, so should not contribute to the metric calculation. Currently they are being classified as categorical, which is incorrect.
Expected behavior
DetectionMetric should use the provided metadata to drop id columns from the real and synthetic data when computing the metric. This logic should also be applied to any other relevant metrics.
The text was updated successfully, but these errors were encountered:
Some metrics are are already implemented with the correct behavior. Eg any MultiSingleColumn metric (such as KSTest) has a declared set of field_types that it is compatible with. Only those fields are selected when computing the metric.
Problem Description
Columns of type
id
should be dropped when computing metrics. ID columns are not synthetically generated, so should not contribute to the metric calculation. Currently they are being classified as categorical, which is incorrect.Expected behavior
DetectionMetric should use the provided
metadata
to dropid
columns from the real and synthetic data when computing the metric. This logic should also be applied to any other relevant metrics.The text was updated successfully, but these errors were encountered: