Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

Commit

Permalink
support falcon40b
Browse files Browse the repository at this point in the history
  • Loading branch information
Zhenzhong1 committed Mar 13, 2024
1 parent 1c018ec commit 8fe6c74
Showing 1 changed file with 1 addition and 4 deletions.
5 changes: 1 addition & 4 deletions neural_speed/convert/convert_quantized_falcon.py
Original file line number Diff line number Diff line change
Expand Up @@ -54,10 +54,7 @@ def main(args_in: Optional[List[str]] = None) -> None:
ftype = 1

# 1. write hparams
#
# "n_head": 71,
# "n_layer": 32,
n_head_kv = hparams.get("num_kv_heads", 1)
n_head_kv = hparams.get("n_head_kv", 1)
n_head = hparams["n_head"]
head_dim = hparams["hidden_size"] // n_head

Expand Down

0 comments on commit 8fe6c74

Please sign in to comment.