Replies: 1 comment 2 replies
-
|
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm following the t5 notebook and I have two questions regarding the warmup stage.
Which data should be passed to the t5 model during warmup? One sample? One batch? All the data?
The example says "IRL, encoder and decoder should be warmed each on their own" - what does that mean and how it should be done?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions