Skip to content

Add a fuzzy version of the AggJoiner #1289

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
rcap107 opened this issue Apr 7, 2025 · 0 comments
Open

Add a fuzzy version of the AggJoiner #1289

rcap107 opened this issue Apr 7, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@rcap107
Copy link
Contributor

rcap107 commented Apr 7, 2025

Problem Description

The current implementation of the AggJoiner executes an exact left join between the main table and the aux table. It would be useful to have a "fuzzy" version of this that allows to aggregate the "fuzzy" groups that are found using the machinery implemented in the Joiner.

This would be very useful with geographical data, for example: by using latitude and longitude as join keys, it would become possible to aggregate new features based on the neighborhood of a sample in the main table.

Feature Description

This could be either a new object (FuzzyAggJoiner?), or be implemented as a flag in the current AggJoiner.
At join time, rather than using the current implementation, the groups identified by the Joiner should be used to find the aggregations and the new features should be added.

Alternative Solutions

No response

Additional Context

No response

@rcap107 rcap107 added the enhancement New feature or request label Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant