Skip to content

Conversation

@soodoshll
Copy link

@soodoshll soodoshll commented Feb 5, 2026

New Requirements:

  • nvidia-nvjpeg
  • nvidia-nvimgcodec-cu13==0.7.0.11

nvimgcodec is enabled by default. Note that this change does not affect the image loader in dynamo.vllm, which has its own image processs logic. Will come up with a PR to dynamo later.

Enable tensor ipc with
--multimodal-tensor-ipc=torch_shm

**Note: ** images sent to the api server will be decoded even before being added to the waiting queue, so please throttle the max concurrency at the client side (now I use 384 * num_backend), which potentially requires modifying task.py.

cc @wangshangsam

soodoshll and others added 21 commits February 3, 2026 13:30
Make tensor IPC datapath optional/config-based

Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Missed as part of rebase. This suggestion makes sense

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Signed-off-by: Brandon Pelfrey <brandonpelfrey@gmail.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
@soodoshll
Copy link
Author

soodoshll commented Feb 5, 2026

Oops, need to pick out some irrelevant commits.
Done

Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
…atype, cleanup on scheduler finished_req_ids

Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
Signed-off-by: Brandon Pelfrey <bpelfrey@nvidia.com>
@wangshangsam wangshangsam merged commit bf71fe5 into CentML:mlperf-inf-mm-q3vl-v6.0 Feb 9, 2026
1 check passed
wangshangsam added a commit that referenced this pull request Feb 10, 2026
wangshangsam added a commit that referenced this pull request Feb 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants