Message from Server:TensorRT-LLM not supported on Server yet. #164

Rodenhhh · 2024-03-04T12:27:36Z

i run the docker ghcr.io/collabora/whisperbot-base:latest and run the server , but when i send the request by client , i faced
client:.

[INFO]: * recording
setting
[INFO]: Waiting for server ready ...
[INFO]: Opened connection
Message from Server: TensorRT-LLM not supported on Server yet. Reverting to available backend: 'faster_whisper'
[INFO]: Websocket connection closed: 1000:

server:
[03/04/2024-12:11:15] TensorRT-LLM not supported: [TensorRT-LLM][ERROR] CUDA runtime error in cub::DeviceSegmentedRadixSort::SortPairsDescending(nullptr, cubTempStorageSize, logProbs, (T*) nullptr, idVals, (int*) nullptr, vocabSize * batchSize, batchSize, beginOffsetBuf, offsetBuf + 1, 0, sizeof(T) * 8, stream): no kernel image is available for execution on the device (/root/TensorRT-LLM/cpp/tensorrt_llm/kernels/samplingTopPKernels.cu:322)

The text was updated successfully, but these errors were encountered:

makaveli10 · 2024-03-04T13:51:02Z

@Rodenhhh Hey, which gpu are using ?

Rodenhhh · 2024-03-04T13:55:52Z

@makaveli10 A100-40GB

makaveli10 · 2024-03-04T16:59:42Z

@Rodenhhh yeah the docker image is supposed to work only on 4090, and unfortunately we missed that part and its not mentioned anywhere sorry for the trouble.

As for a solution, stay tuned we will push a docker-compose to make TensorRT-LLM setup straight forward.
Thanks

makaveli10 · 2024-03-11T10:08:09Z

@Rodenhhh you can test the docker compose setup if it builds and works as expected. Just make sure to pass the right CUDA_ARCH to docker compose build to have tensorrt-llm build successfully

makaveli10 · 2024-04-04T09:30:02Z

@Rodenhhh did you get a chance to try out the docker compose solution?

makaveli10 self-assigned this Mar 4, 2024

makaveli10 mentioned this issue Mar 11, 2024

Docker compose tensorrt #177

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Message from Server:TensorRT-LLM not supported on Server yet. #164

Message from Server:TensorRT-LLM not supported on Server yet. #164

Rodenhhh commented Mar 4, 2024

makaveli10 commented Mar 4, 2024

Rodenhhh commented Mar 4, 2024

makaveli10 commented Mar 4, 2024

makaveli10 commented Mar 11, 2024

makaveli10 commented Apr 4, 2024

Message from Server:TensorRT-LLM not supported on Server yet. #164

Message from Server:TensorRT-LLM not supported on Server yet. #164

Comments

Rodenhhh commented Mar 4, 2024

makaveli10 commented Mar 4, 2024

Rodenhhh commented Mar 4, 2024

makaveli10 commented Mar 4, 2024

makaveli10 commented Mar 11, 2024

makaveli10 commented Apr 4, 2024