You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I noticed that the llama3 tokenizer loaded with hf transformers.AutoTokenizer only add a token when call the encode function. May I ask during llama3 pretraining, which behavior is taken? only add token or add both and tokens for each training document.
The text was updated successfully, but these errors were encountered:
Hello,
I noticed that the llama3 tokenizer loaded with hf transformers.AutoTokenizer only add a token when call the encode function. May I ask during llama3 pretraining, which behavior is taken? only add token or add both and tokens for each training document.
The text was updated successfully, but these errors were encountered: