Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

模型转hf的部分,没有把inv_freq写进pytorch_model.bin.index.json #9

Open
wzh1994 opened this issue Jul 14, 2023 · 1 comment

Comments

@wzh1994
Copy link

wzh1994 commented Jul 14, 2023

https://github.com/OpenLMLab/OpenChineseLLaMA/blob/main/tools/llama_colossalai.py#L1111C6-L1111C6
这里附近,需要加上对inv_freq的映射,否则转出的模型会出现如下错误:
image

@wzh1994
Copy link
Author

wzh1994 commented Jul 14, 2023

此外由于集群不稳定,经常save到一半崩掉,这里建议加上断点续存的能力,也是这个位置附近
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant