Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

kta-intel · 2024-11-15T19:40:33Z

This PR enables a TaskRunner-based federated XGBoost using the bootstrap aggregation

Specifically this PR:

creates an xgb_higgs task runner workspace to train on the higgs dataset [ref] with all required code (i.e. src/taskrunner.py, src/dataloader.py ,plan/*.yaml, etc.
adds a tasks_xgb.yaml to enable new FedBaggingXGBoost aggregation when running xgb training workloads
adds delta_updates parameter to Aggregator in order to bypass delta updating (for deep learning models getting weight deltas makes sense since the model size should stay relatively consistent, for tree-based algorithms, this makes less sense because more trees are added over time)
- delta_updates is set to true by default to preserve normal behavior. xgboost taskrunner explicitly sets it to false to bypass it
introduces new loader_xgb.py as the backend / superclass to src/dataloader.py
introduces new runner_xgb.py as the backend / superclass to src/taskrunner.py
introduces new federated boostrap algorithm for xgboost in aggregation_function.fed_bagging which bags the latest trees to a global model, consistent with currently accept federated xgboost algorithms in the industry

Signed-off-by: kta-intel <[email protected]>

This reverts commit d3937ef. Signed-off-by: kta-intel <[email protected]>

Signed-off-by: kta-intel <[email protected]>

MasterSkepticista · 2024-11-18T07:22:39Z

openfl-workspace/workspace/plan/defaults/tasks_xgb.yaml

General question: Have you used specific formatters for yaml files?

I copied over the yaml from other workspaces as a template then ran bash shell/format.sh in the whole repo. Is there something additional that you recommend?

openfl-workspace/xgb_higgs/.workspace

openfl-workspace/xgb_higgs/plan/cols.yaml

openfl-workspace/xgb_higgs/src/dataloader.py

openfl-workspace/xgb_higgs/src/setup_data.py

openfl/component/aggregator/aggregator.py

openfl/federated/task/runner_xgb.py

openfl-workspace/xgb_higgs/plan/defaults

openfl-workspace/xgb_higgs/plan/data.yaml

teoparvanov

Awesome work, thanks @kta-intel! I have a couple of questions and comments, but overall the PR looks in excellent shape for such a sizeable new feature.

PS: is there an easy way to add a CI job that covers at least the "happy path" of an XGBoost-based federation?

openfl-workspace/xgb_higgs/src/setup_data.py

openfl/federated/data/loader_xgb.py

openfl/federated/task/runner_xgb.py

openfl/interface/aggregation_functions/fed_bagging.py

Signed-off-by: kta-intel <[email protected]>

kta-intel · 2024-11-18T21:13:10Z

PS: is there an easy way to add a CI job that covers at least the "happy path" of an XGBoost-based federation?

Thanks for the review Teo! I'm not sure what you mean by this, would this just be a toy sanity check CI?

Signed-off-by: kta-intel <[email protected]>

openfl-workspace/xgb_higgs/src/dataloader.py

teoparvanov · 2024-11-19T07:29:46Z

PS: is there an easy way to add a CI job that covers at least the "happy path" of an XGBoost-based federation?

Thanks for the review Teo! I'm not sure what you mean by this, would this just be a toy sanity check CI?

Yes, I mean a very basic CI job that does an E2E run of the XGBoost workspace. Like here, but using xgb_higgs as the template, and with a reduced test matrix (f.e. just ubuntu + python3.10). However, if the effort for this turns out to be significant, you can also consider doing it in a separate PR.

Other than that, the PR is ready to be merged IMO. Thanks, @kta-intel !

kta-intel · 2024-11-19T18:10:34Z

Yes, I mean a very basic CI job that does an E2E run of the XGBoost workspace. Like here, but using the xgb_higgs as a template, and with a reduced test matrix (f.e. just ubuntu + python3.10). However, if the effort for this turns out to be significant, you can also consider doing it in a separate PR.

I see! This is a good idea, but lets save it as a separate PR. Thanks for the review and suggestion(s) @teoparvanov!

kta-intel added 15 commits November 8, 2024 09:05

initial xgboost workspace commit

33d304f

Signed-off-by: kta-intel <[email protected]>

updating taskrunner and aggregation function

93dc8b4

Signed-off-by: kta-intel <[email protected]>

runner updates

52fea84

Signed-off-by: kta-intel <[email protected]>

logic for loader

1275fd6

Signed-off-by: kta-intel <[email protected]>

enabling work

49f5cdf

Signed-off-by: kta-intel <[email protected]>

further enabling work

ddece36

Signed-off-by: kta-intel <[email protected]>

fix first round local validation

c7e2d76

Signed-off-by: kta-intel <[email protected]>

remove need to convert to float64

9d385a7

Signed-off-by: kta-intel <[email protected]>

fix model save

ce4b34f

Signed-off-by: kta-intel <[email protected]>

remove set_trace and fix spacing

70e4171

Signed-off-by: kta-intel <[email protected]>

rename workspace and fix plan

3d2df78

Signed-off-by: kta-intel <[email protected]>

fix lint

54cdc5e

Signed-off-by: kta-intel <[email protected]>

more formatting fixes

51a0afa

Signed-off-by: kta-intel <[email protected]>

revert space removal

d3937ef

Signed-off-by: kta-intel <[email protected]>

Revert "revert space removal"

dd2027c

This reverts commit d3937ef. Signed-off-by: kta-intel <[email protected]>

kta-intel force-pushed the xgboost-fedbagging branch from 6c79178 to dd2027c Compare November 15, 2024 21:50

kta-intel added 3 commits November 15, 2024 13:53

revert changes on interface.plan

e008e4a

Signed-off-by: kta-intel <[email protected]>

remove from history. unchanged

3cbd5e5

Signed-off-by: kta-intel <[email protected]>

reverting back to fresh state for interface.plan

051d8fc

Signed-off-by: kta-intel <[email protected]>

kta-intel changed the title ~~[WIP] Enable federated XGBoost using bootstrap aggregation in Task Runner~~ Enable federated XGBoost using bootstrap aggregation in Task Runner Nov 15, 2024

kta-intel marked this pull request as ready for review November 15, 2024 22:14

kta-intel requested review from MasterSkepticista, teoparvanov and psfoley November 15, 2024 22:14

kta-intel and others added 4 commits November 15, 2024 17:18

Merge branch 'securefederatedai:develop' into xgboost-fedbagging

58172c1

move delta_updates below assigner in args

a8d9b59

Signed-off-by: kta-intel <[email protected]>

add delta_update default to True, remove from yaml

5f1d909

Signed-off-by: kta-intel <[email protected]>

enable modin pandas

3670bd0

Signed-off-by: kta-intel <[email protected]>

MasterSkepticista requested changes Nov 18, 2024

View reviewed changes

teoparvanov reviewed Nov 18, 2024

View reviewed changes

openfl/interface/aggregation_functions/fed_bagging.py Show resolved Hide resolved

kta-intel added 8 commits November 18, 2024 08:04

add DO NOT EDIT notice

dcfdd70

Signed-off-by: kta-intel <[email protected]>

added docstrings

bd03eac

Signed-off-by: kta-intel <[email protected]>

set DEFAULT_PATH to cwd

326069d

Signed-off-by: kta-intel <[email protected]>

fix docstrings and remove commented out lines

8a75cc5

Signed-off-by: kta-intel <[email protected]>

change to use_delta_updates for readibility

450d8c3

Signed-off-by: kta-intel <[email protected]>

split test data for collaborators

eecffe0

Signed-off-by: kta-intel <[email protected]>

clean up methods

238448f

Signed-off-by: kta-intel <[email protected]>

clean up taskrunner

16cd7e1

Signed-off-by: kta-intel <[email protected]>

teoparvanov approved these changes Nov 18, 2024

View reviewed changes

kta-intel added 4 commits November 18, 2024 10:24

remove conditional for unused condition

4c03932

Signed-off-by: kta-intel <[email protected]>

add conversion check

6aa9838

Signed-off-by: kta-intel <[email protected]>

set global model attribute to np array for consistency

ac2a925

Signed-off-by: kta-intel <[email protected]>

raise value error when model is empty when trying to set tensor dict

d65def1

Signed-off-by: kta-intel <[email protected]>

kta-intel added 7 commits November 18, 2024 13:23

remove conversion checker to avoid circular import issue

63be874

Signed-off-by: kta-intel <[email protected]>

add docstring and more descriptive comments

b346b24

Signed-off-by: kta-intel <[email protected]>

formatting fix

809b69b

Signed-off-by: kta-intel <[email protected]>

fixing import sorting

cf67f62

Signed-off-by: kta-intel <[email protected]>

format fix

acb89d5

Signed-off-by: kta-intel <[email protected]>

remove unnecessarly files

5794b70

Signed-off-by: kta-intel <[email protected]>

format fix, comparing datatype

34f7d8a

Signed-off-by: kta-intel <[email protected]>

noopurintel reviewed Nov 19, 2024

View reviewed changes

openfl-workspace/xgb_higgs/src/dataloader.py Show resolved Hide resolved

Merge branch 'securefederatedai:develop' into xgboost-fedbagging

837031b

teoparvanov merged commit 3c983ef into securefederatedai:develop Nov 20, 2024
29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

kta-intel commented Nov 15, 2024 •

edited

Loading

MasterSkepticista Nov 18, 2024

kta-intel Nov 19, 2024

teoparvanov left a comment

kta-intel commented Nov 18, 2024

teoparvanov commented Nov 19, 2024 •

edited

Loading

kta-intel commented Nov 19, 2024

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

Conversation

kta-intel commented Nov 15, 2024 • edited Loading

MasterSkepticista Nov 18, 2024

Choose a reason for hiding this comment

kta-intel Nov 19, 2024

Choose a reason for hiding this comment

teoparvanov left a comment

Choose a reason for hiding this comment

kta-intel commented Nov 18, 2024

teoparvanov commented Nov 19, 2024 • edited Loading

kta-intel commented Nov 19, 2024

kta-intel commented Nov 15, 2024 •

edited

Loading

teoparvanov commented Nov 19, 2024 •

edited

Loading