Skip to content

Commit 930d812

Browse files
committed
refined version
1 parent 644739d commit 930d812

3 files changed

Lines changed: 12 additions & 6 deletions

File tree

README.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,9 @@ This is a **binary classification task** at the client level.
2828
You are **not** given:
2929
- Raw client data
3030
- Client behavior labels
31-
- Descriptions of malicious strategies
31+
- Descriptions of malicious strategies
32+
- Intermediate client updates
33+
3234

3335
---
3436

@@ -38,6 +40,8 @@ For each client, predict whether it is:
3840
- `honest`
3941
- `malicious`
4042

43+
Clients labeled as malicious include any that deviated from the protocol, such as by poisoning updates, free-riding, or otherwise behaving non-compliantly.
44+
4145
---
4246

4347
## Submission Format
@@ -72,7 +76,7 @@ Participants are ranked by accuracy, with Macro F1-score used to break ties.
7276

7377
## Rules and Constraints
7478

75-
- You may **not retrain** the provided model.
79+
- - You may not retrain, fine-tune, or modify the provided model.
7680
- Only the provided artifacts may be used.
7781
- No access to raw client data is allowed.
7882
- All computation must be performed offline.
@@ -82,7 +86,7 @@ Participants are ranked by accuracy, with Macro F1-score used to break ties.
8286
## Ground Truth
8387

8488
Ground truth labels are defined by how each client was implemented in the federated learning simulation.
85-
Each client is unambiguously labeled as either **honest** or **malicious**.
89+
Each client is unambiguously labeled as either honest or malicious based on its behavior during training.
8690

8791
---
8892

@@ -97,7 +101,7 @@ Each client is unambiguously labeled as either **honest** or **malicious**.
97101

98102
## Goal
99103

100-
This task reflects a realistic and high-impact security challenge: **detecting malicious behavior in large-scale federated learning systems after training has completed**.
104+
This task reflects a realistic and high-impact security challenge: **auditing federated learning systems to detect malicious or non-compliant clients after training has completed, without access to raw data or client updates.**.
101105

102106

103107

baseline.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,5 @@
1+
#FYI: This baseline ignores all the model information and predicts all the clients as honest.
2+
13
import json
24
import csv
35
import torch

metadata.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,9 +18,9 @@
1818
},
1919

2020
"malicious_behavior": {
21-
"type": "label_flipping",
22-
"description": "Malicious clients replace true labels with random labels during local training"
21+
"description": "Some clients deviated from the protocol in unknown ways"
2322
},
23+
2424

2525
"task_definition": {
2626
"task_type": "binary_classification",

0 commit comments

Comments
 (0)