Skip to content

Commit 0d7c747

Browse files
committed
check to make sure huggingface tokenizer is used
1 parent b05c3b0 commit 0d7c747

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

src/maxtext/input_pipeline/grain_data_processing.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -312,7 +312,7 @@ def sft_preprocessing_pipeline(
312312
tokenizer_model, pad_id = data_processing_utils.get_tokenizer_and_pad_id(config)
313313
base_tokenizer_model = tokenizer_model
314314

315-
tokenizer_model = tokenizer_model.tokenizer
315+
tokenizer_model = getattr(tokenizer_model, "tokenizer", tokenizer_model)
316316

317317
data_processing_utils.validate_and_configure_sft_columns(
318318
data_columns, tokenizer_model, getattr(config, "chat_template", None)

0 commit comments

Comments
 (0)