System Info I want to SFT Mistral-v0.3 with my own chat template.<

cc <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-h

How do I replace a spare tokens? about transformers HOT 4 OPEN

kouyakamada commented on July 4, 2024

How do I replace a spare tokens?

from transformers.

Comments (4)

ArthurZucker commented on July 4, 2024 3

cc @itazap that would indeed be a good addition! More and more people pre-allocate some tokens and we don't have a replace token.

from transformers.

ArthurZucker commented on July 4, 2024

PS: you can already replace directly in the vocab and the added_vocab (since there tokens are part of both)

from transformers.

lee-onidas commented on July 4, 2024

Hey @ArthurZucker,

I tried replacing a token in the vocab (not the added_tokens) for the tokenizer.json file. But when I try to load the tokenizer back up new_tokenizer = AutoTokenizer.from_pretrained('path/to/tokenizer) I get the following error: "Exception: data did not match any variant of untagged enum ModelWrapper at line 356367 column 3"

Do you know what the problem might be?

from transformers.

lee-onidas commented on July 4, 2024

Hey @ArthurZucker,

I tried replacing a token in the vocab (not the added_tokens) for the tokenizer.json file. But when I try to load the tokenizer back up new_tokenizer = AutoTokenizer.from_pretrained('path/to/tokenizer) I get the following error: "Exception: data did not match any variant of untagged enum ModelWrapper at line 356367 column 3"

Do you know what the problem might be?

You can ignore this sorry. I found the issue. if you change the vocab in anyway, you need to make sure you also update the merges accordingly.

from transformers.

How do I replace a spare tokens? about transformers HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent