Skip to content

Commit b901202

Browse files
committed
check to make sure huggingface tokenizer is used
1 parent 731ea61 commit b901202

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

src/maxtext/input_pipeline/grain_data_processing.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -295,7 +295,7 @@ def sft_preprocessing_pipeline(
295295
tokenizer_model, pad_id = data_processing_utils.get_tokenizer_and_pad_id(config)
296296
base_tokenizer_model = tokenizer_model
297297

298-
tokenizer_model = tokenizer_model.tokenizer
298+
tokenizer_model = getattr(tokenizer_model, "tokenizer", tokenizer_model)
299299

300300
data_processing_utils.validate_and_configure_sft_columns(
301301
data_columns, tokenizer_model, getattr(config, "chat_template", None)

0 commit comments

Comments
 (0)