-
Notifications
You must be signed in to change notification settings - Fork 582
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
微调过程中的收敛问题 #1265
Comments
loss突然变得很高,中间模型似乎训崩了。 |
感谢,我降低学习率解决了。看起来bge-m3-v2-rerank是一个已经收敛的模型,不能用太大的学习率,要不然会跳出LOSS极小值的区域。 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
我在bge-m3-v2-ranker上增加我自己的数据微调模型。使用脚本提供的默认参数进行训练。
我的LOSS图开起来不太正常,LOSS除了最开始波动之外,后续训练过程中一直在1.3左右。
望解答
The text was updated successfully, but these errors were encountered: