Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CPU]Check runtime_options from IR model #27778

Open
wants to merge 3 commits into
base: releases/2024/5
Choose a base branch
from

Conversation

zhangYiIntel
Copy link
Contributor

Details:

  • Check runtim_options from IR model
  • Set KV_CACHE_PRECISION & DYNAMIC_QUANTIZATION_GROUP_SIZE from runtim_options of IR model
  • Example IR model with runtim_options
        <rt_info>
                <runtime_options>
                        <KV_CACHE_PRECISION value="f16" />
                </runtime_options>
        </rt_info>

Tickets:

@zhangYiIntel zhangYiIntel requested review from a team as code owners November 28, 2024 02:12
@github-actions github-actions bot added the category: CPU OpenVINO CPU plugin label Nov 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: CPU OpenVINO CPU plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant