-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: mlc-ai/mlc-llm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Unable to serve Mistral-7B-Instruct-v0.3
bug
Confirmed bugs
#2447
opened May 28, 2024 by
swamysrivathsan
[Doc] Python API KV/memory reset details absent
documentation
Improvements or additions to documentation
#2426
opened May 26, 2024 by
federicoparra
[Feature Request] phi-3 small realeased -> performs two times ebtter then Phi-3 mini
feature request
New feature or request
#2420
opened May 26, 2024 by
sebastienbo
Phi-2 q4f16_1 runs faster when compiled without Confirmed bugs
tvm.relax.transform.FuseOps()
and tvm.relax.transform.FuseTIR()
transformations
bug
#2405
opened May 24, 2024 by
MMuzzammil1
Fail to build tvm-unity from source on orin[Bug]
bug
Confirmed bugs
#2389
opened May 23, 2024 by
Louym
[Bug] java.lang.NullPointerException: Attempt to invoke virtual method 'org.apache.tvm.TVMValue org.apache.tvm.Function.invoke()' on a null object reference
bug
Confirmed bugs
#2366
opened May 21, 2024 by
View999888
[Question] Single forward pass through ChatModule
question
Question about the usage
#2354
opened May 17, 2024 by
caenopy
[Feature Request] Implement AttentionStore
feature request
New feature or request
#2353
opened May 16, 2024 by
kripper
[Question] mlc_llm serve fails with --speculative-mode, does it require certain hardware?
question
Question about the usage
#2350
opened May 16, 2024 by
0xDEADFED5
[Question] Can MLC quantize multimodal models?
question
Question about the usage
#2349
opened May 16, 2024 by
LJ-Hao
[Question] Deployment of Pruned Models
question
Question about the usage
#2338
opened May 14, 2024 by
qianjyM
[Question] Parallel computations using multiple streams?
question
Question about the usage
#2332
opened May 13, 2024 by
taegeonum
[Bug] InternalError: Check failed: (res == VK_SUCCESS) is false: Vulkan Error, code=-4: VK_ERROR_DEVICE_LOST
bug
Confirmed bugs
#2328
opened May 11, 2024 by
aaaaaad333
[Tracking] Create a CPU Compatible PagedKVCache
status: tracking
Tracking work in progress
#2325
opened May 11, 2024 by
tqchen
1 task
[Tracking] Sentence Embedding Model
status: tracking
Tracking work in progress
#2324
opened May 11, 2024 by
tqchen
1 task
[Bug] mlc_llm package failed once, and i cant run it again
bug
Confirmed bugs
#2323
opened May 11, 2024 by
CallMeTkt
[Feature Request] Medusa support
feature request
New feature or request
#2319
opened May 10, 2024 by
EmilioZhao
[Bug] Support multiple "system" messages in REST API
bug
Confirmed bugs
#2311
opened May 10, 2024 by
bayley
[Model Request] can we get Aryanne/Calypso-3B-alpha-v2-gguf
new-models
#2293
opened May 7, 2024 by
Louis654
Previous Next
ProTip!
Updated in the last three days: updated:>2024-05-25.