Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

Commit

Permalink
Update chatglm.h
Browse files Browse the repository at this point in the history
  • Loading branch information
intellinjun authored Mar 7, 2024
1 parent 77da4e2 commit a70ffc1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion neural_speed/models/chatglm/chatglm.h
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ enum chatglm_model {
static const model_scratch chatglm_mem_req(int n_layers) {
switch (n_layers) {
case 28:
return {2048ull * MB, 2048ull * MB, 4096ull * MB};
return {4096ull * MB, 4096ull * MB, 8192ull * MB};
// TODO(hengyu): add more variants besides 6B
default:
MODEL_ASSERT(false);
Expand Down

0 comments on commit a70ffc1

Please sign in to comment.