You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+8-4Lines changed: 8 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,7 +28,9 @@ This is a **binary classification task** at the client level.
28
28
You are **not** given:
29
29
- Raw client data
30
30
- Client behavior labels
31
-
- Descriptions of malicious strategies
31
+
- Descriptions of malicious strategies
32
+
- Intermediate client updates
33
+
32
34
33
35
---
34
36
@@ -38,6 +40,8 @@ For each client, predict whether it is:
38
40
-`honest`
39
41
-`malicious`
40
42
43
+
Clients labeled as malicious include any that deviated from the protocol, such as by poisoning updates, free-riding, or otherwise behaving non-compliantly.
44
+
41
45
---
42
46
43
47
## Submission Format
@@ -72,7 +76,7 @@ Participants are ranked by accuracy, with Macro F1-score used to break ties.
72
76
73
77
## Rules and Constraints
74
78
75
-
- You may **not retrain** the provided model.
79
+
--You may not retrain, fine-tune, or modify the provided model.
76
80
- Only the provided artifacts may be used.
77
81
- No access to raw client data is allowed.
78
82
- All computation must be performed offline.
@@ -82,7 +86,7 @@ Participants are ranked by accuracy, with Macro F1-score used to break ties.
82
86
## Ground Truth
83
87
84
88
Ground truth labels are defined by how each client was implemented in the federated learning simulation.
85
-
Each client is unambiguously labeled as either **honest** or **malicious**.
89
+
Each client is unambiguously labeled as either honest or malicious based on its behavior during training.
86
90
87
91
---
88
92
@@ -97,7 +101,7 @@ Each client is unambiguously labeled as either **honest** or **malicious**.
97
101
98
102
## Goal
99
103
100
-
This task reflects a realistic and high-impact security challenge: **detecting malicious behavior in large-scale federated learning systems after training has completed**.
104
+
This task reflects a realistic and high-impact security challenge: **auditing federated learning systems to detect malicious or non-compliant clients after training has completed, without access to raw data or client updates.**.
0 commit comments