Update tokenizer_config.json

by ybelkada - opened Jan 19, 2023

←

BigScience Workshop org Jan 19, 2023

No description provided.

BigScience Workshop org Jan 19, 2023

Muennighoff changed pull request status to merged Jan 19, 2023

BigScience Workshop org Jan 19, 2023

It's annoying that we're missing indent here ... Whoever is writing this file should probably have indent option set.

BigScience Workshop org Jan 19, 2023

+100 (and cc @Narsil )

BigScience Workshop org Jan 19, 2023

Just to be sure, I think tokenizer_config.json comes from transformers, and it's supposed to be fixed for quite a while: https://github.com/huggingface/transformers/blame/862888a35834527fed61beaf42373423ffdbd216/src/transformers/tokenization_utils_base.py#L2155 @ybelkada did you perhaps change this manually instead of going through transformers?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment