Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding in synthetic dataset generators based on the original FedProx paper #238

Merged
merged 7 commits into from
Sep 26, 2024

Conversation

emersodb
Copy link
Collaborator

PR Type

Feature

Short Description

Clickup Ticket(s): Link

This small PR implements synthetic dataset generators based on the original FedProx paper. These generators are to be used in future experiments and replication studies. There are a few small modifications to how the generation takes place, but most of the process is inspired by the original implementation.

Note: These synthetic datasets end up with pretty skewed labels, which is a bit disappointing. See the tests for additional illustration. However, they will still be useful. It's just something to be aware of.

Tests Added

Several tests are implemented to check correctness of both generators (IID, non-IID)

Copy link
Collaborator

@fatemetkl fatemetkl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Just some minor comments/questions.

fl4health/utils/data_generation.py Show resolved Hide resolved
fl4health/utils/data_generation.py Show resolved Hide resolved
fl4health/utils/data_generation.py Show resolved Hide resolved
@emersodb emersodb merged commit 9e1097c into main Sep 26, 2024
6 checks passed
@emersodb emersodb deleted the dbe/fedprox_synthetic_data_generator branch September 26, 2024 15:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants