-
Notifications
You must be signed in to change notification settings - Fork 2k
Open
Labels
Infra<NV>automated tests, build checks, github actions, system stability & efficiency.<NV>automated tests, build checks, github actions, system stability & efficiency.bugSomething isn't workingSomething isn't workingwaiting for feedback
Description
System Info
In the Docker Image 1.2.0rc1, the file cudaDriverWrapper.h is missing from the path /app/tensorrt_llm in the Docker image. As a result, the required include header files must be manually copied from the source code.
In addition, nvidia-modelopt was installed incorrectly, which causes the export of the quantized KV cache to fail.
Who can help?
No response
Information
- The official example scripts
- My own modified scripts
Tasks
- An officially supported task in the
examplesfolder (such as GLUE/SQuAD, ...) - My own task or dataset (give details below)
Reproduction
1
Expected behavior
1
actual behavior
1
additional notes
1
Before submitting a new issue...
- Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.
Metadata
Metadata
Assignees
Labels
Infra<NV>automated tests, build checks, github actions, system stability & efficiency.<NV>automated tests, build checks, github actions, system stability & efficiency.bugSomething isn't workingSomething isn't workingwaiting for feedback