Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training time #3

Open
RyanG41 opened this issue Sep 23, 2024 · 1 comment
Open

training time #3

RyanG41 opened this issue Sep 23, 2024 · 1 comment

Comments

@RyanG41
Copy link

RyanG41 commented Sep 23, 2024

Hi,

May I ask about the training time for retrieval task?

I noticed that the code is using 6 GPUs, while I am currently using 4*RTX4090 and it went fast on the 30 epochs of base model but seems very slow on the following 300 epochs. May I know about the total training time on your side?

Thanks a lot.

Best Regard

@auniquesun
Copy link
Owner

@RyanG41 I think it is normal. The following 300 epochs train a much large Transformer model than previous 30 epochs. I think it takes about 20 hours on 6 2080Ti GPUs if I remember correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants