Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如何获取comparison data? #7

Open
yjh0410 opened this issue Nov 10, 2023 · 0 comments
Open

如何获取comparison data? #7

yjh0410 opened this issue Nov 10, 2023 · 0 comments

Comments

@yjh0410
Copy link

yjh0410 commented Nov 10, 2023

您好,很感谢作者团队公布了UltraFeedback数据集,我目前在尝试使用这个数据集去训练Reward model,但遇到了一个问题。

数据集共包含64K的指令,256K的response,依照论文的设定,从这些数据集能生成340K的comparisons,请问这个是怎么生成的?我没有在项目代码中找到这一功能。如果项目代码里有的话,是在下面的路经中吗?

https://github.com/OpenBMB/UltraFeedback/tree/main/src/comparison_data_generation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant