Federated Fairness Analytics

About

For federated learning to be trusted for real world application, particularly where avoiding bias is of critical importance and where data protection legislation such as GDPR limits the ability of the server to validate decisions against datasets directly - the implications of federated learning upon fairness must be better understood. In the current state of the art surveys, the lack of clear definitions or metrics to quantify fairness in federated learning is cited as a key open problem. This project proposes definitions for a number of notions of fairness with corresponding metrics in order to quanity such fairness. The metrics are used to benchmark a number of existing approaches offering a unique insight into fairness performance and improving explainability and transparency of systems, without violating data privacy.

NOTE: This repository demonstrates the output of my final year Electrical and Electronic Engineering MEng, Individual Research Project at the University of Bristol.

Tools and Resources

The project would not be possible without the fantastic array of open-source tools and datasets that fasciliate federated learning research. This project utilises:

Flower
Hugging Face Datasets including NSL-KDD and CIFAR-10
PyTorch
TensorFlow Federated Datasets to utilise the natural partioning of the LEAF dataset, FEMNIST

The project uses the Flower simulations and was run on an NVIDIA A40 GPU.

Notions of Fairness

$J (x)$ represents Jain's Fairness index, a common measure of uniformity of $x$ over $N$ datapoints. Where $J (x) = \frac{(\sum x)^{2}}{N \times \sum x^{2}}$

General fairness in federated learning is broken into the following four, symptomatic notions $\in (0, 1]$ :

1. Individual Fairness, $f_{j}$

Do all clients perform proportionately to their contribution?

$f_{i} = J (G_{n, k})$ where the gain, $G_{n, k} = \frac{x_{n, k}}{s_{n, k}}$ for objective function outcome (typically classification accuracy), $x_{n, k}$ and federated Shapley contribution, $s_{n, k}$ for client $n$ in round $k$ .

2. Protected Group Fairness, $f_{g}$

Do subgroups of the population that exhibit sensitive attributes perform equivalently to those without?

$f_{g} = m e a n (m e d (\sum | E O D_{n, a, k} |))$ where for each sensitive label, $a$ in set $A$ , the absolute value of the equality odds, $E O D$ is measured at each client in each round. The mean of the median values at each client indicates a measure for the protected group performance over the federation.

3. Incentive Fairness, $f_{r}$

Are clients rewarded proportionately to their contributions and in equal timeframes?

$f_{r} = J (R_{n, k})$ where the gain, $R_{n, k} = \frac{r_{n, k}}{s_{n, k}}$ for client reward outcome (typically classification accuracy of the model received from the start of the training round on the clients own dataset when using federated evaluation), $r_{n, k}$ and federated Shapley contribution, $s_{n, k}$ for client $n$ in round $k$ .

4. Orchestrator Fairness, $f_{o}$

Does the server succeed in its role of orchestrating a learning ecosystem that maximises the objective function?

$f_{o} = \frac{1}{N} \sum x_{n, k}$ measures the average performance of the clients in the federation. This can be compared to centralised performance if feasible.

General fairness, $F_{T}$

General fairness is proposed as the weighted sum of the above notions, in the case that each notion is weighted equally, the following expression arises to define $F_{T}$ :

$F_{T} = \frac{(f_{j} + f_{g} + f_{r} + f_{o})}{4}$

Experimental Details

The project benchmarks use the following variables:

Approach/ strategy - FedAvg, q-FedAvg, Ditto and FedMinMax
Datasets and number of clients - CIFAR-10 with 10 clients, CIFAR-10 with 100 clients, NSL-KDD with 100 clients, Federated-EMNIST (FEMNIST) with 205 clients.
Heterogeneity - each dataset is simulated with IID and non-IID partioning between clients. The Direchlet partitioner is used to emulate the non-IID setting for NSL-KDD and CIFAR, with values of $α$ varying between datasets. The non-IID case for the FEMNIST dataset is achieved using natural partitioning per writer.

How to Use

Clone the repository and install the necessary imports:

pip install -r requirements.txt

Select an experiment of choice from the root directory, conditions are indicated by filename and the config parameters at the top of the script may be adjusted. Run the experiment from the root, for example using:

python fedavg_cifar_iid_100c_v1.py

NOTE: for FEMNIST simulations, a .pickle file containing a copy of the appropriately paritioned PyTorch dataloaders must be provided. This is because the dataset used was augmented a TensorFlow Federated Dataset before the release of Flower Datasets. An alternative would be to use the power of Flower and Hugging Face datasets to implement the EMNIST partioning as similarly to what has been achieved for NSL-KDD and CIFAR-10, as in source.load_cifar.py for example.

Sample Results

Using the plotting scripts provided. Results such as below can be obtained:

Assumptions and Applicability

Federated learning comes in many flavours. In order to constrain the scope, this research is applicable to federated learning systems with the following characteristics:

Centrally orchestrated – there exists a single central entity that organises the learning, is responsible for initiating training rounds, aggregating the models and selecting clients. This is referred to as the server and orchestrator interchangeably and will be assumed as trustworthy. Implementations using distributed or blockchain control are out of scope for this project.
Horizontal – horizontal federated learning is assumed, where the feature space is consistent across clients as is the case in most federated learning applications including Google’s Gboard, this is for simplicity to focus on fairness without considering the problem of entity alignment in vertical federated learning.
Task Agnostic – this work does not have a specific application in mind and the optimisation of the model is considered to be out of scope. Configurations that achieve satisfactory performance in centralised settings are selected and deployed to the clients.
Known Sensitive Attributes – the labels corresponding to protected groups must be known by the clients and server in order to be measured.
Blackbox Clients – no information is available about the clients, for example regarding database size, dataset distribution, intended participation rate, communication capability, processing ability. However, it can be assumed that the client is capable of processing the model in question.

Repository Structure

├── LICENSE
├── README.md
├── requirements.txt
├── Results
    ├── Ditto_CIFAR_iid_100C_5PC_10E_30R_v1.json
    ├── ...
    │   <.json results files - 3 per experiment>
    ├── ...
    ├── q_FedAvg_NSLKDD_niid_100C_5PC_5E_30R_v3.json
    └── Plots
        ├── Ditto_CIFAR_iid_100C_bar.png
        ├── ...
        │   <.png result images>
        ├── ...
        └── q_FedAvg_NSLKDD_niid_100C_tS.png
├── ditto_cifar_iid_100c_v1.py
├── ...
├── <individual experiment .py files>
├── ...
├── q_fedavg_nslkdd_niid_100c_v1.py
├── plotter_v1.py
├── plotter_v2.py
├── femnist 
    ├── femnist_iid_loaded.pickle
    └── femnist_niid_loaded.pickle
└── source 
    ├── cifar_net.py 
    ├── client.py
    ├── ditto.py
    ├── fedminmax.py
    ├── femnist_net.py
    ├── load_cifar.py
    ├── load_nslkdd.py
    ├── nslkdd_net.py
    ├── FEMNIST_loading.ipynb
    └── shapley.py

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name	Name	Last commit message	Last commit date
Latest commit oscardilley update Aug 8, 2024 5e3115e · Aug 8, 2024 History 95 Commits
Results	Results	update	Aug 8, 2024
source	source	Merge branch 'main' of https://github.com/oscardilley/federated-fairness	Aug 8, 2024
.gitignore	.gitignore	added cifar niid	Mar 28, 2024
LICENSE	LICENSE	Initial commit	Mar 16, 2024
README.md	README.md	Update README.md	Apr 23, 2024
ditto_cifar_iid_100c_v1.py	ditto_cifar_iid_100c_v1.py	more experiments added	Apr 8, 2024
ditto_cifar_iid_10c_v1.py	ditto_cifar_iid_10c_v1.py	more experiments added	Apr 8, 2024
ditto_cifar_niid_100c_v1.py	ditto_cifar_niid_100c_v1.py	more results	Apr 13, 2024
ditto_cifar_niid_10c_v1.py	ditto_cifar_niid_10c_v1.py	more experiments added	Apr 8, 2024
ditto_femnist_iid_205c_v1 .py	ditto_femnist_iid_205c_v1 .py	Ditto working from here, ignore previous as parameters were not updating	Apr 4, 2024
ditto_femnist_niid_205c_v1.py	ditto_femnist_niid_205c_v1.py	Ditto working from here, ignore previous as parameters were not updating	Apr 4, 2024
ditto_nslkdd_iid_100c_v1.py	ditto_nslkdd_iid_100c_v1.py	Ditto working from here, ignore previous as parameters were not updating	Apr 4, 2024
ditto_nslkdd_niid_100c_v1.py	ditto_nslkdd_niid_100c_v1.py	Ditto working from here, ignore previous as parameters were not updating	Apr 4, 2024
fedavg_cifar_iid_100c_v1.py	fedavg_cifar_iid_100c_v1.py	bulked out and collected some results	Apr 3, 2024
fedavg_cifar_iid_10c_v1.py	fedavg_cifar_iid_10c_v1.py	bulked out and collected some results	Apr 3, 2024
fedavg_cifar_niid_100c_v1.py	fedavg_cifar_niid_100c_v1.py	bulked out and collected some results	Apr 3, 2024
fedavg_cifar_niid_10c_v1.py	fedavg_cifar_niid_10c_v1.py	bulked out and collected some results	Apr 3, 2024
fedavg_femnist_iid_205c_v1.py	fedavg_femnist_iid_205c_v1.py	bulked out and collected some results	Apr 3, 2024
fedavg_femnist_niid_205c_v1.py	fedavg_femnist_niid_205c_v1.py	bulked out and collected some results	Apr 3, 2024
fedavg_nslkdd_iid_100c_v1.py	fedavg_nslkdd_iid_100c_v1.py	backing up	Apr 12, 2024
fedavg_nslkdd_niid_100c_v1 .py	fedavg_nslkdd_niid_100c_v1 .py	bulked out and collected some results	Apr 3, 2024
fedminmax_cifar_iid_100c_v1.py	fedminmax_cifar_iid_100c_v1.py	more plots	Apr 9, 2024
fedminmax_cifar_iid_5c_v1.py	fedminmax_cifar_iid_5c_v1.py	more plots	Apr 9, 2024
fedminmax_nslkdd_iid_5c_v1.py	fedminmax_nslkdd_iid_5c_v1.py	saving results	Apr 9, 2024
plotter_1.py	plotter_1.py	update	Aug 8, 2024
plotter_2.py	plotter_2.py	updating timeseries titles	Aug 8, 2024
q_fedavg_cifar_iid_100c_v1.py	q_fedavg_cifar_iid_100c_v1.py	updated plotting	Apr 13, 2024
q_fedavg_cifar_iid_10c_v1.py	q_fedavg_cifar_iid_10c_v1.py	more experiments added	Apr 8, 2024
q_fedavg_cifar_niid_100c_v1.py	q_fedavg_cifar_niid_100c_v1.py	more experiments added	Apr 8, 2024
q_fedavg_cifar_niid_10c_v1.py	q_fedavg_cifar_niid_10c_v1.py	more experiments added	Apr 8, 2024
q_fedavg_femnist_iid_205c_v1.py	q_fedavg_femnist_iid_205c_v1.py	more experiments added	Apr 8, 2024
q_fedavg_femnist_niid_205c_v1.py	q_fedavg_femnist_niid_205c_v1.py	saving results	Apr 9, 2024
q_fedavg_nslkdd_iid_100c_v1.py	q_fedavg_nslkdd_iid_100c_v1.py	more experiments added	Apr 8, 2024
q_fedavg_nslkdd_niid_100c_v1.py	q_fedavg_nslkdd_niid_100c_v1.py	more experiments added	Apr 8, 2024
readme.md	readme.md	updated plotting	Apr 13, 2024
requirements.txt	requirements.txt	NSL-KDD preprocessing complete	Mar 30, 2024
run_experiments.sh	run_experiments.sh	updated fonts on all plots	Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Federated Fairness Analytics

About

Tools and Resources

Notions of Fairness

1. Individual Fairness, $f_{j}$

2. Protected Group Fairness, $f_{g}$

3. Incentive Fairness, $f_{r}$

4. Orchestrator Fairness, $f_{o}$

General fairness, $F_{T}$

Experimental Details

How to Use

Sample Results

Assumptions and Applicability

Repository Structure

License

About

Releases

Packages

Languages

License

oscardilley/federated-fairness

Folders and files

Latest commit

History

Repository files navigation

Federated Fairness Analytics

About

Tools and Resources

Notions of Fairness

1. Individual Fairness, f j

2. Protected Group Fairness, f g

3. Incentive Fairness, f r

4. Orchestrator Fairness, f o

General fairness, F T

Experimental Details

How to Use

Sample Results

Assumptions and Applicability

Repository Structure

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Individual Fairness, $f_{j}$

2. Protected Group Fairness, $f_{g}$

3. Incentive Fairness, $f_{r}$

4. Orchestrator Fairness, $f_{o}$

General fairness, $F_{T}$

Packages