[Model Enabling] Support ChatGLM3 #182

Zhenzhong1 · 2024-03-20T09:53:37Z

Type of Change

New Feature.

Description

Support ChatGLM3 in the Neural Speed.

Expected Behavior & Potential Risk

N/A

How has this PR been tested?

numactl -m 0 -C 0-55 python scripts/run.py /home/zhenzhong/model/chatglm3-6b/ -p "你好" --model_type=chatglm3

Dependency Change?

N/A

neural_speed/application/main_run.cpp

a32543254

LGTM

a32543254 · 2024-03-21T05:35:58Z

Could you also add extension test, and post the benchmark data here?

a32543254 · 2024-03-21T05:52:59Z

Could you also add extension test, and post the benchmark data here?

No need, seems chat GLM3 share same structure with chat GLM2. so we can treat them as one.

Zhenzhong1 · 2024-03-21T05:56:45Z

Could you also add extension test, and post the benchmark data here?

No need, seems chat GLM3 share same structure with chat GLM2. so we can treat them as one.

yes. It's true. It's ok. I have added it. It also helps us to check convert / queantize / inference pipeline for GLM3.

https://github.com/intel-innersource/frameworks.ai.lpot.lpot-validation/pull/623/files

docs/prompt_template.md

Zhenzhong1 added 10 commits March 18, 2024 22:53

support chatglm3

abb94ff

add chatglm3

60e1083

support chatglm3 convert

74a64fc

add chatglm3 to model_utils.cpp rope

b86c4c7

fully copy the chatglm3 from chatglm2

639cde1

remove chatglm3 files

a5b5d4c

fixed the run.py issue

1cc1518

chatglm3 inference pass

73cc488

refactor chatglm3 files done

0c46705

cleancode

e62881e

Zhenzhong1 marked this pull request as ready for review March 21, 2024 03:22

Zhenzhong1 requested review from intellinjun and a32543254 March 21, 2024 03:23

Zhenzhong1 added 3 commits March 21, 2024 11:25

Merge branch 'main' into zhenzhong/chatglm3

0336cc3

update doc

c139cae

update doc

6729a64

Zhenzhong1 requested review from zhentaoyu and zhenwei-intel March 21, 2024 05:28

a32543254 reviewed Mar 21, 2024

View reviewed changes

neural_speed/application/main_run.cpp Outdated Show resolved Hide resolved

a32543254 approved these changes Mar 21, 2024

View reviewed changes

intellinjun approved these changes Mar 21, 2024

View reviewed changes

remove cout

dcfe522

a32543254 reviewed Mar 21, 2024

View reviewed changes

docs/prompt_template.md Show resolved Hide resolved

extension tests

a4a7ff0

Zhenzhong1 requested a review from VincyZhang March 21, 2024 06:13

Zhenzhong1 added the ready to merge label Mar 21, 2024

VincyZhang merged commit 94e74d7 into main Mar 21, 2024
10 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model Enabling] Support ChatGLM3 #182

[Model Enabling] Support ChatGLM3 #182

Zhenzhong1 commented Mar 20, 2024 •

edited

Loading

a32543254 left a comment

a32543254 commented Mar 21, 2024

a32543254 commented Mar 21, 2024

Zhenzhong1 commented Mar 21, 2024 •

edited

Loading

[Model Enabling] Support ChatGLM3 #182

[Model Enabling] Support ChatGLM3 #182

Conversation

Zhenzhong1 commented Mar 20, 2024 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

a32543254 left a comment

Choose a reason for hiding this comment

a32543254 commented Mar 21, 2024

a32543254 commented Mar 21, 2024

Zhenzhong1 commented Mar 21, 2024 • edited Loading

Zhenzhong1 commented Mar 20, 2024 •

edited

Loading

Zhenzhong1 commented Mar 21, 2024 •

edited

Loading