Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? #758

Taoer1996 · 2024-11-29T06:34:58Z

❓ The question

Thank you for the fully open-sourced OLMo2; that's really amazing research work!!

And we have a small question regarding the training process of Stage 2. Was the optimizer state from Stage 1 used during the training of Stage 2? If so, will the full checkpoint, including the optimizer state, be open-sourced?

aman-17 · 2024-12-03T18:21:19Z

Hey @Taoer1996, we loaded the final checkpoint from Stage-1 for Stage-2, and the optimizer state wasn’t reset. As for open-sourcing the full checkpoints with the optimizer state, we are planning to release them, I’ll keep you posted if there are any updates on this.

Taoer1996 · 2024-12-11T03:24:01Z

Hey @Taoer1996, we loaded the final checkpoint from Stage-1 for Stage-2, and the optimizer state wasn’t reset. As for open-sourcing the full checkpoints with the optimizer state, we are planning to release them, I’ll keep you posted if there are any updates on this.

Thanks for clarifying that! Really excited to see what comes next.

Taoer1996 added the type/question An issue that's a question label Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? #758

Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? #758

Taoer1996 commented Nov 29, 2024

aman-17 commented Dec 3, 2024

Taoer1996 commented Dec 11, 2024

Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? #758

Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? #758

Comments

Taoer1996 commented Nov 29, 2024

❓ The question

aman-17 commented Dec 3, 2024

Taoer1996 commented Dec 11, 2024