Question regarding the training of the llama2 version #337

JunZhan2000 · 2023-09-01T13:42:09Z

JunZhan2000
Sep 1, 2023

Thank you for your work!
Is the training of the mini-gpt4 in the llama2 version the same as the vicuna version, including aspects like training data and duration? In my actual usage, I've noticed that the llama2 version has many enhancements.
If there are any differences, please let me know what they are. Thank you very much.

Answered by TsuTikgiau

Sep 2, 2023

Hello! The training data and training strategy is exactly the same as the vicuna version. One different thing is, in the old vicuna version, we use the blip-2's q-former. In the llama2 version, we remove it. The linear layer now directly map the output of clip's vision encoder to LLM's input.

View full answer

TsuTikgiau · 2023-09-02T13:48:39Z

TsuTikgiau
Sep 2, 2023
Maintainer

Hello! The training data and training strategy is exactly the same as the vicuna version. One different thing is, in the old vicuna version, we use the blip-2's q-former. In the llama2 version, we remove it. The linear layer now directly map the output of clip's vision encoder to LLM's input.

0 replies

TsuTikgiau · 2023-09-02T14:28:12Z

TsuTikgiau
Sep 2, 2023
Maintainer

BTW, may I ask what are these enhancements you found in the LLama2 version?

2 replies

JunZhan2000 Sep 4, 2023
Author

Thank you for your reply, sorry I didn't see it in time.
The most obvious is that the ability to follow instructions is much stronger. For example, I let it only do true or false questions. The first version often does not follow instructions or gibberish, while the second version usually answers yes or no according to instructions. I think it might be llama2, which is also seen in other multimodal models.

Shivtej8446 Oct 11, 2023

Can anyone guide step by step running the project in vs code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding the training of the llama2 version #337

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Question regarding the training of the llama2 version #337

JunZhan2000 Sep 1, 2023

Replies: 2 comments · 2 replies

TsuTikgiau Sep 2, 2023 Maintainer

TsuTikgiau Sep 2, 2023 Maintainer

JunZhan2000 Sep 4, 2023 Author

Shivtej8446 Oct 11, 2023

JunZhan2000
Sep 1, 2023

Replies: 2 comments 2 replies

TsuTikgiau
Sep 2, 2023
Maintainer

TsuTikgiau
Sep 2, 2023
Maintainer

JunZhan2000 Sep 4, 2023
Author