InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 256
Star 2.9k

Code
Issues 140
Pull requests 27
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 6

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#890 opened Dec 26, 2023 by vansin

Open

Labels 32 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

140 Open 769 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Bug] 为什么pipeline输出只有一个1个token？

#1766 opened Jun 12, 2024 by Axiaozhu1

2 tasks

[Feature] 请问支持ChatGLM3吗

#1764 opened Jun 12, 2024 by Franklin-L

[Feature] 多模态的模型支持在线serving吗？ awaiting response

#1762 opened Jun 12, 2024 by CSEEduanyu

[Feature] 使用已经构建好的input使用lmdeploy来进行推理 awaiting response

#1760 opened Jun 12, 2024 by KooSung

[Bug] ImageEncoder INFO 日志耗时统计不准确

#1759 opened Jun 12, 2024 by DefTruth

2 tasks

[Bug] Turbomind 后端显存占用翻倍

#1758 opened Jun 11, 2024 by QwertyJack

2 tasks done

[Bug] 判断条件检查

#1757 opened Jun 11, 2024 by seetimee

1 of 2 tasks

[Bug] Key Error loading OpenGVLab/Mini-InternVL-Chat-4B-V1-5

#1756 opened Jun 11, 2024 by HaoLiuHust

2 tasks done

[Bug] tp=4 tp=8 no response

#1755 opened Jun 11, 2024 by zeroleavebaoyang

2 tasks done

[Bug] Official image doesn't work for 4090 on CUDA 12.3 (but works for all other CUDA versions, and works for 12.3 on other GPU types)

#1750 opened Jun 11, 2024 by josephrocca

2 tasks done

[Feature] Low priority: Allow specifying HuggingFace model/repo name in lmdeploy convert

#1749 opened Jun 10, 2024 by josephrocca

[Feature] Support for compact Vision-Language models

#1748 opened Jun 10, 2024 by vody-am

[Bug] xcomposer 4khd lora weight error in lmdeploy

#1747 opened Jun 8, 2024 by ztfmars

2 tasks done

[Feature] min_p sampling parameter

#1745 opened Jun 8, 2024 by josephrocca

[Bug] Many concurrent requests with --enable-prefix-caching AND --quant-policy 8 crashes with: CUDA runtime error: an illegal memory access was encountered /opt/lmdeploy/src/turbomind/utils/allocator.h:231

#1744 opened Jun 8, 2024 by josephrocca

2 tasks done

[Bug] Space is incorrectly removed from start of generated text for /v1/completion endpoint

#1743 opened Jun 8, 2024 by josephrocca

2 tasks done

logits输出有问题[Bug]

#1742 opened Jun 8, 2024 by GZL11

2 tasks done

[Docs] Guidance on setting num_tokens_per_iter and max_prefill_iters to optimal values

#1740 opened Jun 8, 2024 by josephrocca

[Bug] detokenize_incrementally: OverflowError: out of range integral type conversion attempted

#1739 opened Jun 7, 2024 by josephrocca

2 tasks done

[Feature] Speculative Decoding

#1738 opened Jun 7, 2024 by josephrocca

[Bug] 量化模型时无输出

#1735 opened Jun 7, 2024 by NB-Group

2 tasks done

[Feature Request] OpenAI-compatible stop param

#1731 opened Jun 7, 2024 by josephrocca

[Bug] 部署cogvlm2运行时，接受的多个并发之间存在干扰，后面的请求使用前面请求传的图像 bug

Something isn't working

#1730 opened Jun 7, 2024 by LRHstudy

1 of 2 tasks

High GPU memory for running InternVL-Chat-V1-5-AWQ awaiting response

#1728 opened Jun 7, 2024 by tairen99

[Feature] Support for THUDM/glm-4v-9b planned feature

#1726 opened Jun 6, 2024 by Iven2132

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly