-
Notifications
You must be signed in to change notification settings - Fork 430
Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] Does PytorchEngine Visual Model Support Prefix Caching?
#2789
opened Nov 21, 2024 by
OftenDream
3 tasks
[Bug] Llama-3.2-1B-Instruct and InternVL2-1B does not supported kvin4, is that expected?
#2786
opened Nov 21, 2024 by
zhulinJulia24
3 tasks
[Bug] Response of converted Qwen2-57B-A14B-Instruct-GPTQ-Int4 returns garbled characters
#2785
opened Nov 21, 2024 by
zhulinJulia24
3 tasks
[Bug] SystemExit: 1 asyncio.exceptions.TimeoutError
#2782
opened Nov 20, 2024 by
LIUKAI0815
2 of 3 tasks
[Bug] Qwen2.5无法跑通tools call(官方案例代码)
awaiting response
#2775
opened Nov 20, 2024 by
turkeymz
3 tasks done
[Bug] The quantization process of Qwen/Qwen2-VL-7B-Instruct is getting killed without throwing error.
#2770
opened Nov 19, 2024 by
vjaideep08
3 tasks done
[Bug] 昇腾910B通过lmdeploy镜像,使用qwen2-vl-7b模型,推理过程报错: call aclnnBatchMatMul failed
#2769
opened Nov 18, 2024 by
fusmile0101
1 of 3 tasks
How can I specify the rope scaling type when starting the API server?
#2768
opened Nov 18, 2024 by
snachx
[Bug] The script "profile_generation.py" went haywire and crashed
#2760
opened Nov 15, 2024 by
yuchiwang
3 tasks done
Why do video frames in lmdeploy need to be converted into base64 encoding?
#2759
opened Nov 15, 2024 by
AmazDeng
[Bug] Cannot install torch-npu==2.3.1, torch==2.3.1 and torchvision==0.18.1 because these package versions have conflicting dependencies.
#2745
opened Nov 13, 2024 by
jiabao-wang
3 tasks
[Feature] The
cache-max-entry-count
working off percentages makes it difficult to setup multiple servers
#2732
opened Nov 9, 2024 by
mrakgr
[Bug] Accuracy of W8A8 is big different from that of the original model
#2730
opened Nov 9, 2024 by
HelloCard
3 tasks done
[Bug] Deployment of Llama3.1-70b getting struck
#2724
opened Nov 7, 2024 by
pulkitmehtaworkmetacube
3 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.