-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for Qwen2 (Llama2 based) #101
base: release-2.20
Are you sure you want to change the base?
Conversation
Does this work for Qwen2.5? I tried Code to reproduce:
Errors:
|
Yes I remember I tested 2.5 as well. Does the original Qwen2 config work for you ? What TP degree did you use? |
My TP degree in the test code is 1. You can take a look at my code above for full configurations. |
|
OK, I figured out the configuration error for
For A little weird to me... Is there an explanation for this configuration? |
Adding Qwen2 model module that hast been tested with
Qwen2-7B
.The Qwen2 module is based on the Llama2 module and differs in following points:
bias=True
in the QKV projectionload_weights
function of the modelBy submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.