Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llm tokenizer padding size问题 #1236

Open
Kang9779 opened this issue Nov 17, 2024 · 1 comment
Open

llm tokenizer padding size问题 #1236

Kang9779 opened this issue Nov 17, 2024 · 1 comment

Comments

@Kang9779
Copy link

在llm decoder-only架构下,tokenizer.padding_side是否都应该设置为left

@hanhainebula
Copy link
Collaborator

你好,@Kang9779。也可以设置为 right,这里的代码也可以正常 pooling,但目前的 LLM-based embedding model 一般都设置为 left。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants