Skip to content

Commit 07157bd

Browse files
authored
fix: handle None test_ds in skipped messages count (#232)
Co-authored-by: Lancer <maruixiang6688@gmail.com>
1 parent ec6f0d9 commit 07157bd

1 file changed

Lines changed: 2 additions & 1 deletion

File tree

scripts/prepare_data.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -191,8 +191,9 @@ def process_and_save_ds(train_ds, test_ds, output_path, proc_fn, dataset_name):
191191
f.write(json.dumps(row) + "\n")
192192

193193
if total_skipped_count > 0:
194+
total_messages = len(train_ds) + (len(test_ds) if test_ds is not None else 0)
194195
print(
195-
f"Skipped {total_skipped_count}/{len(train_ds)+len(test_ds)} messages for {dataset_name}"
196+
f"Skipped {total_skipped_count}/{total_messages} messages for {dataset_name}"
196197
)
197198

198199

0 commit comments

Comments
 (0)